- 专利标题: Stacked cross-modal matching
-
申请号: US16138587申请日: 2018-09-21
-
公开(公告)号: US11093560B2公开(公告)日: 2021-08-17
- 发明人: Kuang-Huei Lee , Gang Hua , Xi Chen , Houdong Hu , He Xiaodong
- 申请人: Microsoft Technology Licensing, LLC
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人地址: US WA Redmond
- 代理机构: Rainier Patents, P.S.
- 主分类号: G06F16/20
- IPC分类号: G06F16/20 ; G06F16/951 ; G06N3/04 ; G06N3/08 ; G06F17/18 ; G06K9/62 ; G06F17/16 ; G06T7/11
摘要:
The present concepts relate to matching data of two different modalities using two stages of attention. First data is encoded as a set of first vectors representing components of the first data, and second data is encoded as a set of second vectors representing components of the second data. In the first stage, the components of the first data are attended by comparing the first vectors and the second vectors to generate a set of attended vectors. In the second stage, the components of the second data are attended by comparing the second vectors and the attended vectors to generate a plurality of relevance scores. Then, the relevance scores are pooled to calculate a similarity score that indicates a degree of similarity between the first data and the second data.
公开/授权文献
- US20200097604A1 STACKED CROSS-MODAL MATCHING 公开/授权日:2020-03-26
信息查询