-
公开(公告)号:US11574130B2
公开(公告)日:2023-02-07
申请号:US17103649
申请日:2020-11-24
发明人: Mihaela Ancuta Bornea , Lin Pan , Sara Rosenthal , Avirup Sil , Radu Florian
摘要: A method includes receiving, by a question-answer system, a question in a first language and the question in a second language and predicting, by the question-answer system, a first answer to the question in the first language and a second answer to the question in the second language. The method also includes generating, by the question-answer system, a first vector representing the question in the first language and a second vector representing the question in the second language and adjusting the question-answer system based on the first and second answers and the first and second vectors such that when the question-answer system subsequently generates a third vector representing the question in the first language and a fourth vector representing the question in the second language, a distance between the third and fourth vectors is less than a distance between the first and second vectors.
-
公开(公告)号:US11966699B2
公开(公告)日:2024-04-23
申请号:US17350116
申请日:2021-06-17
发明人: Abhishek Shah , Ladislav Kunc , Haode Qi , Lin Pan , Saloni Potdar
IPC分类号: G06F40/30 , G06F16/33 , G06F16/35 , G06F40/284 , G06N5/04 , G06N20/00 , G10L15/18 , G06F40/263 , G06F40/279 , G06F40/295 , G06F40/53
CPC分类号: G06F40/284 , G06F16/3344 , G06F16/355 , G06N5/04 , G06N20/00 , G10L15/1822 , G06F40/263 , G06F40/279 , G06F40/295 , G06F40/53
摘要: A system for classifying a language sample intent by receiving a language sample including a set of features, identifying language sample features, determining a tokenization score for the language sample according to the language sample features, eliminating duplicate features according to the tokenization score, determining a term frequency (tf) according to the identified features and the tokenization score, determining an inverse document frequency (idf) according to the identified features and the tokenization score, and generating a term frequency-inverse document frequency (tf-idf) matrix for the identified features.
-
公开(公告)号:US11853712B2
公开(公告)日:2023-12-26
申请号:US17303728
申请日:2021-06-07
发明人: Haode Qi , Lin Pan , Abhishek Shah , Ladislav Kunc , Saloni Potdar
摘要: A method, computer system, and computer program product for multi-lingual chatlog training are provided. The embodiment may include receiving, by a processor, a plurality of data related to conversational data in multiple languages. The embodiment may also include assigning an intent label to each conversational data. The embodiment may further include assigning a language label to each conversational data. The embodiment may also include paring the plurality of the data related to the conversational data according to the intent label and the language label. The embodiment may further include training a machine learning model using a multi-lingual and multi-intent conversational data pairing. The embodiment may also include training the machine learning model using a single language and multi-intent conversational data paring.
-
公开(公告)号:US20190188271A1
公开(公告)日:2019-06-20
申请号:US15844289
申请日:2017-12-15
发明人: James William Murdock , Eun Ha , Chung-Wei Hang , Kazi Hasan , Nisarga Markandaiah , Christopher Munjal Nolan , Lin Pan , Sai Prathyusha Peddi , Mary Diane Swift
IPC分类号: G06F17/30
CPC分类号: G06F16/24578 , G06F16/24575 , G06F16/3329 , G06F16/3344
摘要: Systems and methods for generating answers to questions. One method includes receiving a question having question terms; identifying candidate answers to the question having answer terms; searching data sources to determine passages including either a question term or an answer term in the candidate answer; scoring the passages for candidate answers using a scoring mechanism, the scoring mechanism computing a first degree of relevance of the passage to the question terms, computing a second degree of relevance of the passage to the answer terms of one of the candidate answers, and determining a score for the passage by combining the first degree of relevance and the second degree of relevance; ranking candidate answers to the question based on the scores associated with the scoring each of the passages for each of the candidate answers; and providing an answer to the question based on ranking of the candidate answers.
-
公开(公告)号:US10810215B2
公开(公告)日:2020-10-20
申请号:US15844289
申请日:2017-12-15
发明人: James William Murdock, IV , Eun Ha , Chung-Wei Hang , Kazi Hasan , Nisarga Markandaiah , Christopher Munjal Nolan , Lin Pan , Sai Prathyusha Peddi , Mary Diane Swift
IPC分类号: G06F7/02 , G06F16/00 , G06F16/2457 , G06F40/30 , G06F16/332 , G06F16/33
摘要: Systems and methods for generating answers to questions. One method includes receiving a question having question terms; identifying candidate answers to the question having answer terms; searching data sources to determine passages including either a question term or an answer term in the candidate answer; scoring the passages for candidate answers using a scoring mechanism, the scoring mechanism computing a first degree of relevance of the passage to the question terms, computing a second degree of relevance of the passage to the answer terms of one of the candidate answers, and determining a score for the passage by combining the first degree of relevance and the second degree of relevance; ranking candidate answers to the question based on the scores associated with the scoring each of the passages for each of the candidate answers; and providing an answer to the question based on ranking of the candidate answers.
-
-
-
-