- 专利标题: Joint embedding of corpus pairs for domain mapping
-
申请号: US16591100申请日: 2019-10-02
-
公开(公告)号: US11436487B2公开(公告)日: 2022-09-06
- 发明人: Ashish Jagmohan , Elham Khabiri , Richard B. Segal , Roman Vaculin
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Amin, Turocy & Watson, LLP
- 主分类号: G06N20/00
- IPC分类号: G06N20/00 ; G06N3/08 ; G06F40/30 ; G06F16/36
摘要:
Techniques for outside-in mapping for corpus pairs are provided. In one example, a computer-implemented method comprises: inputting first keywords associated with a first domain corpus; extracting a first keyword of the first keywords; inputting second keywords associated with a second domain corpus; generating an embedded representation of the first keyword via a trained model and generating an embedded representation of the second keywords via the trained model; and scoring a joint embedding affinity associated with a joint embedding. The scoring the joint embedding affinity comprises: transforming the embedded representation of the first keyword and the embedded representation of the second keywords via the trained model; determining an affinity value based on comparing the first keyword to the second keywords; and based on the affinity value, aggregating the joint embedding of the embedded representation of the first keyword and the embedded representation of the second keywords within the second domain corpus.
公开/授权文献
- US20200034741A1 JOINT EMBEDDING OF CORPUS PAIRS FOR DOMAIN MAPPING 公开/授权日:2020-01-30
信息查询