Invention Grant
- Patent Title: Systems, apparatuses, and methods to generate synthetic queries from customer data for training of document querying machine learning models
-
Application No.: US16698080Application Date: 2019-11-27
-
Publication No.: US11475067B2Publication Date: 2022-10-18
- Inventor: Cicero Nogueira Dos Santos , Xiaofei Ma , Peng Xu , Ramesh M. Nallapati , Bing Xiang , Sudipta Sengupta , Zhiguo Wang , Patrick Ng
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Nicholson De Vos Webster & Elliott LLP
- Main IPC: G06F40/30
- IPC: G06F40/30 ; G06F16/9032 ; G06K9/62 ; G06F16/9038 ; G06N20/00 ; G06F16/903 ; G06F16/93 ; G06F40/20

Abstract:
Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.
Public/Granted literature
Information query