-
1.
公开(公告)号:US20240256906A1
公开(公告)日:2024-08-01
申请号:US18401074
申请日:2023-12-29
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vikas Yadav , Hyuk Joon Kwon , Vijay Srinivasan , Hongxia Jin
IPC: G06N5/02 , G06F40/295
CPC classification number: G06N5/02 , G06F40/295
Abstract: A method includes predicting, using the at least one processing device, a question type for each section of a document using a trained question type prediction model, each section including a different portion of the document. The method also includes generating, using the at least one processing device, multiple question-answer pairs using a trained question-answer generation model that receives the predicted question types and the document as input. Each question-answer pair includes (i) a question having a type corresponding to one of the predicted question types and being associated with content in the section corresponding to the type and (ii) an answer to the question. The method further includes outputting, using the at least one processing device, the question-answer pairs for use in training a question answering model.