-
公开(公告)号:US12175379B2
公开(公告)日:2024-12-24
申请号:US17119651
申请日:2020-12-11
Inventor: Hongjian Shi , Wenbin Jiang , Xinwei Feng , Miao Yu , Huanyu Zhou , Meng Tian , Xueqian Wu , Xunchao Song
Abstract: The present disclosure discloses a method, apparatus, device, and storage medium for training a model, relates to the technical fields of knowledge graph, natural language processing, and deep learning. The method may include: acquiring a first annotation data set, the first annotation data set including sample data and a annotation classification result corresponding to the sample data; training a preset initial classification model based on the first annotation data set to obtain an intermediate model; performing prediction on the sample data in the first annotation data set using the intermediate model to obtain a prediction classification result corresponding to the sample data; generating a second annotation data set based on the sample data, the corresponding annotation classification result, and the corresponding prediction classification result; and training the intermediate model based on the second annotation data set to obtain a classification model.
-
12.
公开(公告)号:US11366819B2
公开(公告)日:2022-06-21
申请号:US16812062
申请日:2020-03-06
Inventor: Songtai Dai , Xinwei Feng , Miao Yu , Huanyu Zhou , Xunchao Song , Pengcheng Yuan
IPC: G06F16/2457 , G06N20/00 , G09B7/02
Abstract: A method for obtaining an answer to a question is provided. The method may include: acquiring a question; determining at least a part of articles in a preset article database as candidate articles, and determining first scores of the candidate articles respectively, the first score of any of the candidate articles representing a matching degree between the candidate article and the question; determining at least a part of texts in each of the candidate articles as candidate texts, and determining second scores of the candidate texts respectively, the second score of any of the candidate texts representing a matching degree between the candidate text and the question; and determining at least a part of the candidate texts as the answer based on a score set of each of the candidate texts, the score set of any of the candidate texts including the second score and the first score.
-
公开(公告)号:US11055373B2
公开(公告)日:2021-07-06
申请号:US16133483
申请日:2018-09-17
Inventor: Pengcheng Yuan , Renkai Yang , Xunchao Song , Xiaobo Liu , Xinwei Feng
IPC: G06F16/9535 , G06F40/247 , G06F16/955
Abstract: Embodiments of the disclosure disclose a method and apparatus for generating information. A specific embodiment of the method comprises: acquiring a historical click log, the historical click log comprising a historical search term and a clicked historical search result corresponding to the historical search term; determining whether matching clicked historical search results exist in the historical click log; establishing a synonymous relationship between historical search terms corresponding to the matching clicked historical search results, in response to determining the matching clicked historical search results existing in the historical click log; and generating a relational word list based on the established synonymous relationship. The embodiment helps to enrich the content of the relational word list, and improve the coverage of the relational word list.
-
14.
公开(公告)号:US11372942B2
公开(公告)日:2022-06-28
申请号:US16691017
申请日:2019-11-21
Inventor: Miao Yu , Xinwei Feng , Huanyu Zhou , Xunchao Song , Songtai Dai
IPC: G06F16/9536 , G06F16/9032 , G06F16/903 , G06F40/295
Abstract: Embodiments of the present disclosure provide a method, apparatus, computer device, and storage medium for verifying community question answer data. The method may include: acquiring a community question answer data set, and generating a plurality of question answer pairs based on the community question answer data set, a question answer pair including: a question, and a to-be-verified answer corresponding to the question; generating an authoritative data set based on data stored in at least one confidence source site; and performing an authority verification on the to-be-verified answer, based on a score of a similarity between the to-be-verified answer and authoritative data in the authoritative data set in at least one dimension.
-
公开(公告)号:US11216618B2
公开(公告)日:2022-01-04
申请号:US16538589
申请日:2019-08-12
Inventor: Xinwei Feng , Xunchao Song , Miao Yu , Huanyu Zhou , Shaoshun Kang
IPC: G06F17/00 , G06F40/30 , G06F16/33 , G06F40/295
Abstract: Embodiments of the present disclosure provide a query processing method and an apparatus, a server and a storage medium. The method includes: determining a word vector representation of a query sequence and an entity vector representation of the query sequence respectively based on respective words and respective entities included in the query sequence; determining a word vector representation of a paragraph and an entity vector representation of the paragraph respectively based on respective words and respective entities included in the paragraph; and determining a similarity between the query sequence and the paragraph according to the word vector representation of the query sequence, the entity vector representation of the query sequence, the word vector representation of the paragraph, and the entity vector representation of the paragraph.
-
公开(公告)号:US20210390428A1
公开(公告)日:2021-12-16
申请号:US17119651
申请日:2020-12-11
Inventor: Hongjian Shi , Wenbin Jiang , Xinwei Feng , Miao Yu , Huanyu Zhou , Meng Tian , Xueqian Wu , Xunchao Song
Abstract: The present disclosure discloses a method, apparatus, device, and storage medium for training a model, relates to the technical fields of knowledge graph, natural language processing, and deep learning. The method may include: acquiring a first annotation data set, the first annotation data set including sample data and a annotation classification result corresponding to the sample data; training a preset initial classification model based on the first annotation data set to obtain an intermediate model; performing prediction on the sample data in the first annotation data set using the intermediate model to obtain a prediction classification result corresponding to the sample data; generating a second annotation data set based on the sample data, the corresponding annotation classification result, and the corresponding prediction classification result; and training the intermediate model based on the second annotation data set to obtain a classification model.
-
公开(公告)号:US20210390260A1
公开(公告)日:2021-12-16
申请号:US17119323
申请日:2020-12-11
Inventor: Hongjian SHI , Wenbin JIANG , Xinwei FENG , Miao YU , Huanyu ZHOU , Meng Tian , Xueqian Wu , Xunchao Song
Abstract: The present disclosure discloses a method, apparatus, device, and storage medium for matching semantics, relates to the technical fields of knowledge graph, natural language processing, and deep learning. The method may include: acquiring a first text and a second text; acquiring language knowledge related to the first text and the second text; determining a target embedding vector based on the first text, the second text, and the language knowledge; and determining a semantic matching result of the first text and the second text, based on the target embedding vector.
-
公开(公告)号:US10983978B2
公开(公告)日:2021-04-20
申请号:US16161968
申请日:2018-10-16
Inventor: Yang Wang , Xi Chen , Pengcheng Yuan , Xunchao Song , Xiaobo Liu
Abstract: The present disclosure provides a method for updating a relational index, a storage medium and an electronic device. The method includes: reading out relational data of an entity to be operated from a disk to a memory; performing an updating operation on the relational data in the memory; storing the updated relational data into a memory relational index; writing content data of the entity to be operated into the disk; and synchronizing periodically the memory relational index to a disk relational index.
-
-
-
-
-
-
-