专利检索 ap:("XIDIAN UNIVERSITY") AND inv:"Yujia Shi" 第 1 页

1.

发明授权
Bidirectional attention-based image-text cross-modal retrieval method 有权

公开(公告)号：US11373055B2

公开(公告)日：2022-06-28

申请号：US16946441

申请日：2020-06-22

申请人： XIDIAN UNIVERSITY

发明人： Jing Liu , Yujia Shi

IPC分类号： G06K9/62 , G06N3/08 , G06V10/40 , G06F16/44 , G06V30/10

摘要： The present disclosure provides a bidirectional attention-based image-text cross-modal retrieval method, applicable for cross-modal retrieval between natural image and electronic text. The present disclosure extracts initial image and text features by using a neural network, and builds a bidirectional attention module to reconstruct the initial image and text features extracted by the neural network, the reconstructed features containing richer semantic information. By using the bidirectional attention module, the present disclosure improves the conventional feature extraction process, obtaining higher-order features with richer image and text semantics, thereby realizing image-text cross-modal retrieval.