-
公开(公告)号:US20220108556A1
公开(公告)日:2022-04-07
申请号:US17552149
申请日:2021-12-15
Inventor: Yiyu PENG , Teng HU , Hua LU , Yongfeng CHEN
IPC: G06V30/418 , G06V30/412 , G06V30/10 , G06F40/103 , G06F16/93
Abstract: A method of comparing documents, an electronic device, and a readable storage medium are provided, which relate to the field of data processing technology, and specifically to the field of big data technology. In the present disclosure, an area division is performed on each document of two documents to be compared, according to a document layout of each document, so as to obtain at least two sets of comparison units. Each set of comparison units comprises comparison units for the two documents respectively and the comparison units for the two documents correspond to each other. Thus, a content comparison may be performed on between comparison units of each of the at least two sets, so as to obtain a content comparison result for each set of comparison units as a comparison result for the two documents.
-
公开(公告)号:US20210406619A1
公开(公告)日:2021-12-30
申请号:US17169112
申请日:2021-02-05
Inventor: Pengyuan LV , Xiaoqiang ZHANG , Shanshan LIU , Chengquan ZHANG , Qiming PENG , Sijin WU , Hua LU , Yongfeng CHEN
IPC: G06K9/72 , G06T7/70 , G06F40/30 , G06K9/46 , G06K9/00 , G06K9/32 , G06K9/20 , G06K9/62 , G06N20/00 , G06N5/04
Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.
-
3.
公开(公告)号:US20250094789A1
公开(公告)日:2025-03-20
申请号:US18968810
申请日:2024-12-04
Inventor: Hua LU , Shilong FAN , Zeyang LEI , Bingjin CHEN , Siqi BAO , Hua WU
IPC: G06N3/0475
Abstract: A method for evaluating a large model, an electronic device and a computer readable storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of large models technology and deep learning technology. The method includes: evaluating a response information of each of M large language models for an input instruction based on a preset evaluation rule, so as to obtain a first evaluation information for each response information, where M is a positive integer greater than 1; evaluating, in response to the first evaluation information for the M large language models being consistent with each other, each response information in a plurality of evaluation dimensions, so as to obtain a second evaluation information for each response information; and determining an evaluation result representing a responsiveness of each large language model, according to the second evaluation information for each response information.
-
-