-
公开(公告)号:US20230073550A1
公开(公告)日:2023-03-09
申请号:US17988065
申请日:2022-11-16
Inventor: Han LIU , Teng Hu , Shikun Feng , Yongfeng Chen
IPC: G06F40/30 , G06F40/40 , G06F40/284
Abstract: A method for extracting text information includes: acquiring a text to be extracted and a target field name; extracting candidate text information matching the target field name from the text to be extracted based on the text to be extracted and the target field name; and acquiring target text information matching fusion semantics of the text to be extracted, the target field name and the candidate text information by filtering the candidate text information based on the fusion semantics. Therefore, when the candidate text information matching the target field name is extracted from the text to be extracted, the candidate text information is filtered based on the fusion semantics of the text to be extracted, the target field name and the candidate text information, which improves the accuracy of extracting text information.
-
公开(公告)号:US12131728B2
公开(公告)日:2024-10-29
申请号:US17828773
申请日:2022-05-31
Inventor: Siyu Ding , Chao Pang , Shuohuan Wang , Yanbin Zhao , Junyuan Shang , Yu Sun , Shikun Feng , Hao Tian , Hua Wu , Haifeng Wang
CPC classification number: G10L15/063 , G10L15/02 , G10L15/18
Abstract: The present application provides a method of training a natural language processing model, which relates to a field of artificial intelligence, and in particular to a field of natural language processing. A specific implementation scheme includes: performing a semantic learning for multi-tasks on an input text, so as to obtain a semantic feature for the multi-tasks, wherein the multi-tasks include a plurality of branch tasks; performing a feature learning for each branch task based on the semantic feature, so as to obtain a first output result for each branch task; calculating a loss for each branch task according to the first output result for the branch task; and adjusting a parameter of the natural language processing model according to the loss for each branch task. The present application further provides a method of processing a natural language, an electronic device, and a storage medium.
-
公开(公告)号:US20230222827A1
公开(公告)日:2023-07-13
申请号:US18181800
申请日:2023-03-10
Inventor: Wenjin Wang , Zhengjie Huang , Bin Luo , Qiming Peng , Weichong Yin , Shikun Feng , Shiwei Huang , Jingzhou He
IPC: G06V30/414 , G06V30/18 , G06F40/30 , G06F40/295
CPC classification number: G06V30/414 , G06F40/30 , G06F40/295 , G06V30/18143
Abstract: In a method for processing a document image, a document image to be processed is acquired. Text nodes of multiple granularities, visual nodes of multiple granularities, respective node information of the text nodes, and respective node information of the visual nodes in the document image are obtained. A multi-granularity and multi-modality document graph is construct based on the text nodes of multiple granularities, the visual nodes of multiple granularities, the respective node information of the text nodes and the respective node information of the visual nodes. Multi-granularity semantic feature information of the document image is determined based on the multi-granularity and multi-modality document graph, the respective node information of the text nodes and the respective node information of the visual nodes.
-
公开(公告)号:US20230073994A1
公开(公告)日:2023-03-09
申请号:US17988107
申请日:2022-11-16
Inventor: Han LIU , Teng Hu , Shikun Feng , Yongfeng Chen
Abstract: A method for extracting text information includes: acquiring a text to be extracted and a target field name; extracting candidate text information matching the target field name from the text to be extracted based on the text to be extracted and the target field name; and acquiring target text information matching fusion semantics of the text to be extracted, the target field name and the candidate text information by filtering the candidate text information based on the fusion semantics. Therefore, when the candidate text information matching the target field name is extracted from the text to be extracted, the candidate text information is filtered based on the fusion semantics of the text to be extracted, the target field name and the candidate text information, which improves the accuracy of extracting text information.
-
-
-