发明公开
- 专利标题: METHOD AND APPARATUS FOR EXTRACTING INFORMATION, ELECTRONIC DEVICE AND STORAGE MEDIUM
-
申请号: EP22191894.9申请日: 2022-08-24
-
公开(公告)号: EP4131024A1公开(公告)日: 2023-02-08
- 发明人: GAN, Jingru , WANG, Haiwei , LUO, Jinchang , CHEN, Kunbin , HE, Wei , WANG, Shuhui
- 申请人: Beijing Baidu Netcom Science Technology Co., Ltd.
- 申请人地址: CN Beijing 100085 2/F Baidu Campus, No. 10, Shangdi 10th Street, Haidian District
- 代理机构: Maiwald GmbH
- 优先权: CN202111006586 20210830
- 主分类号: G06F16/55
- IPC分类号: G06F16/55 ; G06F16/35
摘要:
A method for extracting information, includes: obtaining an information stream comprising text and an image; generating, according to the text, embedded representations of textual entity mentions and a textual similarity matrix of the textual entity mentions and candidate textual entities; generating, according to the image, embedded representations of image entity mentions and an image similarity matrix of the image entity mentions and candidate image entities; and determining, based on an optimal transport, target textual entities of the textual entity mentions and target image entities of the image entity mentions according to the embedded representations of the textual entity mentions, the embedded representations of the image entity mentions, the textual similarity matrix and the image similarity matrix.
信息查询