Bidirectional attention-based image-text cross-modal retrieval method

    公开(公告)号:US11373055B2

    公开(公告)日:2022-06-28

    申请号:US16946441

    申请日:2020-06-22

    申请人: XIDIAN UNIVERSITY

    发明人: Jing Liu Yujia Shi

    摘要: The present disclosure provides a bidirectional attention-based image-text cross-modal retrieval method, applicable for cross-modal retrieval between natural image and electronic text. The present disclosure extracts initial image and text features by using a neural network, and builds a bidirectional attention module to reconstruct the initial image and text features extracted by the neural network, the reconstructed features containing richer semantic information. By using the bidirectional attention module, the present disclosure improves the conventional feature extraction process, obtaining higher-order features with richer image and text semantics, thereby realizing image-text cross-modal retrieval.