Invention Grant
- Patent Title: Stacked cross-modal matching
-
Application No.: US16138587Application Date: 2018-09-21
-
Publication No.: US11093560B2Publication Date: 2021-08-17
- Inventor: Kuang-Huei Lee , Gang Hua , Xi Chen , Houdong Hu , He Xiaodong
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Rainier Patents, P.S.
- Main IPC: G06F16/20
- IPC: G06F16/20 ; G06F16/951 ; G06N3/04 ; G06N3/08 ; G06F17/18 ; G06K9/62 ; G06F17/16 ; G06T7/11

Abstract:
The present concepts relate to matching data of two different modalities using two stages of attention. First data is encoded as a set of first vectors representing components of the first data, and second data is encoded as a set of second vectors representing components of the second data. In the first stage, the components of the first data are attended by comparing the first vectors and the second vectors to generate a set of attended vectors. In the second stage, the components of the second data are attended by comparing the second vectors and the attended vectors to generate a plurality of relevance scores. Then, the relevance scores are pooled to calculate a similarity score that indicates a degree of similarity between the first data and the second data.
Public/Granted literature
- US20200097604A1 STACKED CROSS-MODAL MATCHING Public/Granted day:2020-03-26
Information query