-
公开(公告)号:US20190220749A1
公开(公告)日:2019-07-18
申请号:US16236570
申请日:2018-12-30
Inventor: Zhifan FENG , Chao LU , Yong ZHU , Ying LI
CPC classification number: G06N3/088 , G06F17/278 , G06F17/2785 , G06N3/0454 , G06N5/02 , G06N5/022 , G06N20/00
Abstract: The present disclosure provides a text processing method and device based on ambiguous entity words. The method includes: obtaining a context of a text to be disambiguated and at least two candidate entities represented by the text to be disambiguated; generating a semantic vector of the context based on a trained word vector model; generating a first entity vector of each of the at least two candidate entities based on a trained unsupervised neural network model; determining a similarity between the context and each candidate entity; and determining a target entity represented by the text to be disambiguated in the context.
-
公开(公告)号:US20210383069A1
公开(公告)日:2021-12-09
申请号:US17117553
申请日:2020-12-10
Inventor: Zhijie LIU , Qi WANG , Zhifan FENG , Chunguang CHAI , Yong ZHU
IPC: G06F40/30 , G06F40/295 , G06F17/16
Abstract: A method, apparatus, device, and storage medium for linking an entity, relates to the technical fields of knowledge graph and deep learning are provided. The method may include: acquiring a target text; determining at least one entity mention included in the target text and a candidate entity corresponding to each entity mention; determining an embedding vector of each candidate entity based on the each candidate entity and a preset entity embedding vector determination model; determining context semantic information of the target text based on the target text and each embedding vector; determining type information of the at least one entity mention; and determining an entity linking result of the at least one entity mention, based on the each embedding vector, the context semantic information, and each type information.
-
公开(公告)号:US20210256051A1
公开(公告)日:2021-08-19
申请号:US17069410
申请日:2020-10-13
Inventor: Qi WANG , Zhifan FENG , Zhijie LIU , Chunguang CHAI , Yong ZHU
Abstract: A theme classification method based on multimodality is related to a field of a knowledge map. The method includes obtaining text information and non-text information of an object to be classified. The non-text information includes at least one of visual information and audio information. The method also includes determining an entity set of the text information based on a pre-established knowledge base, and then extracting a text feature of the object based on the text information and the entity set. The method also includes determining a theme classification of the object based on the text feature and a non-text feature of the object.
-
公开(公告)号:US20210216819A1
公开(公告)日:2021-07-15
申请号:US17149267
申请日:2021-01-14
Inventor: Wei HE , Shuangjie LI , Yabing SHI , Ye JIANG , Yang ZHANG , Yong ZHU
IPC: G06K9/62 , G06F40/205 , G06N20/00
Abstract: A method and an apparatus for extracting SPO triples, an electronic device, and a storage medium are related to the field of artificial intelligence technologies. The solution may include: inputting annotated training data into each of multiple extraction models; predicting SPO triples satisfying defined relations in the annotated training data through each of multiple extraction models; combining the predicted SPO triples corresponding to each of multiple extraction models; extracting SPO triples satisfying screening conditions from the combined SPO triples; mining SPO triples with missing annotations from the annotated training data based on the SPO triples satisfying screening conditions, in response to that the SPO triples satisfying screening conditions do not satisfy output conditions; supplementing the SPO triples with missing annotations into the annotated training data; repeating the inputting, predicting, combining, extracting, mining and supplementing until the SPO triples satisfying screening conditions satisfy the output conditions.
-
公开(公告)号:US20210216725A1
公开(公告)日:2021-07-15
申请号:US17147881
申请日:2021-01-13
Inventor: Shuangjie LI , Miao YU , Yabing SHI , Xuefeng HAO , Xunchao SONG , Ye JIANG , Yang ZHANG , Yong ZHU
IPC: G06F40/40 , G06F40/289 , G06N20/00
Abstract: A method and an apparatus for processing information are provided. The method can include: acquiring a word sequence obtained by performing word segmentation on two paragraphs in a text; inputting the word sequence into a to-be-trained natural language processing model to generate a word vector corresponding to a word in the word sequence; inputting the word vector into a preset processing layer of the to-be-trained natural language processing model; predicting whether the two paragraphs are adjacent, and a replaced word in the two paragraphs; and acquiring reference information of the two paragraphs, and training the to-be-trained natural language processing model to obtain a trained natural language processing model, based on the prediction result and the reference information.
-
16.
公开(公告)号:US20200057788A1
公开(公告)日:2020-02-20
申请号:US16539796
申请日:2019-08-13
Inventor: Fang HUANG , Shuangjie LI , Bingyang YU , Yabing SHI , Haijin LIANG , Yang ZHANG , Yong ZHU
IPC: G06F16/958 , G06F16/953
Abstract: Embodiments of the present disclosure provide a method, an apparatus and a device for generating entity relationship data, and a storage medium. The method includes: obtaining webpage source data corresponding to a target webpage; identifying at least one key value block from the webpage source data, wherein the key value block comprises at least one key value pair; identifying body values corresponding to the at least one key value block from the webpage source data; and generating entity relationship data corresponding to the target webpage according to the key value blocks and the body values corresponding to the key value blocks. With the technical solution the present disclosure, the webpage universality may be improved, labor cost may be reduced, and output quantity of the entity relationship data may be increased.
-
公开(公告)号:US20190205384A1
公开(公告)日:2019-07-04
申请号:US16157204
申请日:2018-10-11
Inventor: Yong ZHU , Xunchao SONG , Ying LI , Yilin ZHANG
CPC classification number: G06F17/2785 , G06F16/3344 , G06F16/35
Abstract: The present disclosure provides a search method and device based on artificial intelligence and an electronic device. The search method based on artificial intelligence includes: obtaining a query; performing a word segmentation on the query to obtain a term sequence containing a plurality of terms; performing a structured analysis on the term sequence to generate a semantic pattern; performing a knowledge-based analysis on the term sequence based on the semantic pattern to generate a semantic analysis result; determining an understanding result corresponding to the query based on the semantic pattern and the semantic analysis result; and performing a search based on the understanding result corresponding to the query.
-
18.
公开(公告)号:US20210319335A1
公开(公告)日:2021-10-14
申请号:US17037612
申请日:2020-09-29
Inventor: Wenbin JIANG , Huanyu ZHOU , Meng TIAN , Ying LI , Xinwei FENG , Xunchao SONG , Pengcheng YUAN , Yajuan LYU , Yong ZHU
Abstract: The present disclosure discloses a question analysis method, a device, a knowledge base question answering system and an electronic equipment. The method includes: analyzing a question to obtain N linearized sequences, N being an integer greater than 1; converting the N linearized sequences into N network topology maps; separately calculating a semantic matching degree of each of the N network topology maps to the question; and selecting a network topology map having a highest semantic matching degree to the question as a query graph of the question from the N network topology maps. According to the technology of the present disclosure, the query graph of the question can be obtained more accurately, and the accuracy of the question to the query graph is improved, thereby improving the accuracy of question analysis.
-
公开(公告)号:US20210216882A1
公开(公告)日:2021-07-15
申请号:US17025952
申请日:2020-09-18
Inventor: Fang HUANG , Shuangjie LI , Yabing SHI , Ye JIANG , Yang ZHANG , Yong ZHU
Abstract: A method and apparatus for generating a temporal knowledge graph, a device and a medium. An embodiment of the method comprises: acquiring corpus including time information; performing multivariate data extraction on the corpus, multivariate data including an entity pair, an entity relationship and a target time interval of the entity relationship, the target time interval being used to indicate a valid period of the entity relationship; and generating a temporal knowledge graph based on the entity pair, the entity relationship and the target time interval of the entity relationship.
-
公开(公告)号:US20210216712A1
公开(公告)日:2021-07-15
申请号:US17149185
申请日:2021-01-14
Inventor: Shu WANG , Kexin REN , Xiaohan ZHANG , Zhifan FENG , Yang ZHANG , Yong ZHU
IPC: G06F40/279 , G06F17/16 , G06F17/18
Abstract: A method and an apparatus for labelling a core entity, and a related electronic device are proposed. A character vector sequence, a first word vector sequence and an entity vector sequence corresponding to a target text are obtained by performing character vector mapping, word vector mapping and entity vector mapping are performed on the target text, to obtain a target vector sequence corresponding to the target text. A first probability that each character of the target text is a starting character of a core entity and a second probability that each character of the target text is an ending character of a core entity are determined by encoding and decoding the target vector sequence. One or more core entities of the target text are determined based on the first probability and the second probability.
-
-
-
-
-
-
-
-
-