-
公开(公告)号:US20240265041A1
公开(公告)日:2024-08-08
申请号:US18640448
申请日:2024-04-19
申请人: Pryon Incorporated
发明人: Steven John Rennie , Marie Wenzel Meteer , David Nahamoo Nahamoo , Dominique O'donnell , Vaibhava Goel , Etienne Marcheret , Chul Sung , Igor Roditis Jablokov , Soonthorn Ativanichayaphong , Ajinkya Jitendra Zadbuke , Carmi Rothberg , Ellen Eide Kislal
IPC分类号: G06F16/332 , G06N5/04 , G06N20/00
CPC分类号: G06F16/3329 , G06N5/04 , G06N20/00
摘要: Disclosed are methods, systems, devices, apparatus, media, and other implementations that include a method for document processing (particularly for training of a machine learning question answering platform, and for ingestion of documents). The method includes obtaining a question dataset (e.g., either from public or private repositories of questions) comprising one or more source questions for document processing by a machine learning question-and-answer system that provides answer data in response to question data submitted by a user, modifying a source question from the question dataset to generate one or more augmented questions with equivalent semantic meanings as that of the source question, and processing a document with the one or more augmented questions.