-
公开(公告)号:US12141553B2
公开(公告)日:2024-11-12
申请号:US17847113
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Praphruetpong Athiwaratkun , Zixuan Lin , Ramana Keerthi , Zijian Wang , Yuchen Tian , Hantian Ding , Sri Ranga Akhilesh Bontala , Matthew Lee , Yanitsa Donchev , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Rama Krishna Sandeep Pokkunuri , Srinivas Iragavarapu , Atul Deo , Ankur Deepak Desai
Abstract: Evaluation data sets may be programmatically generated for code generation models. An evaluation data set is obtained that includes items that correspond to different evaluation tests for a code generation system. The individual items of the evaluation data set maybe converted, including the conversion of a function signature for the items, the test statements for the items and using a code generation system to generate the body of the function.
-
公开(公告)号:US12014155B2
公开(公告)日:2024-06-18
申请号:US17847115
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Praphruetpong Athiwaratkun , Yuchen Tian , Mingyue Shang , Zijian Wang , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Yanitsa Donchev , Srinivas Iragavarapu , Matthew Lee , Vamshidhar Krishnamurthy Dantu , Atul Deo , Ankur Deepak Desai
IPC: G06F8/33
CPC classification number: G06F8/33
Abstract: Pre-fix matching may constrain the generation of next token predictions. Input text to perform a next token prediction may be received. Multiple tokens may be determined from the input text, including a partial token. From possible tokens, one or more matching possible tokens with the partial token may be identified. Next token predictions may then be filtered using the identified possible tokens in order to ensure that the partial token is matched.
-
公开(公告)号:US20230418566A1
公开(公告)日:2023-12-28
申请号:US17847113
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Praphruetpong Athiwaratkun , Zixuan Lin , Ramana Keerthi , Zijian Wang , Yuchen Tian , Hantian Ding , Sri Ranga Akhilesh Bontala , Matthew Lee , Yanitsa Donchev , Ramesh M Nallapati , Parminder Bhatia , Andrew Oliver Arnold , Bing Xiang , Sudipta Sengupta , Rama Krishna Sandeep Pokkunuri , Srinivas Iragavarapu , Atul Deo , Ankur Deepak Desai
CPC classification number: G06F8/33 , G06F8/447 , G06F11/3608
Abstract: Evaluation data sets may be programmatically generated for code generation models. An evaluation data set is obtained that includes items that correspond to different evaluation tests for a code generation system. The individual items of the evaluation data set maybe converted, including the conversion of a function signature for the items, the test statements for the items and using a code generation system to generate the body of the function.
-
公开(公告)号:US20230418565A1
公开(公告)日:2023-12-28
申请号:US17847112
申请日:2022-06-22
Applicant: Amazon Technologies, Inc.
Inventor: Sathish Arumugam Selvaraj , Qiang Yu , Venkat Rakshith Reddy Swamireddy , Matthew Lee , Lei Gao , Wei Fang , Rama Krishna Sandeep Pokkunuri , Ramesh M Nallapati , Srinivas Iragavarapu , Alexander Johannes Smola , Sudipta Sengupta , Wasi Uddin Ahmad , Parminder Bhatia , Atul Deo , Ankur Deepak Desai , Bing Xiang , Andrew Oliver Arnold
IPC: G06F8/33 , G06F16/332
CPC classification number: G06F8/33 , G06F16/3322
Abstract: Code completion suggestions may be proactively obtained and validated. An event that triggers obtaining a code completion suggestion for inclusion in a code file being edited using an integrated development environment may be detected. The code completion suggestion may be obtained. The characters of the code completion suggestion may be compared with characters added to the code file after the detection of the event that triggered obtaining the code completion suggestion to determine whether the code completion suggestion is valid. A valid code completion suggestion may then be displayed.
-
公开(公告)号:US11526557B2
公开(公告)日:2022-12-13
申请号:US16697979
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Zhiguo Wang , Zhiheng Huang , Ramesh M. Nallapati , Bing Xiang
IPC: G06F16/9038 , G06F16/908 , G06F16/93 , G06N20/00
Abstract: Techniques for displaying a search are described. An exemplary method includes receiving a search query, performing the search query on a plurality of documents, the documents including text passages, to generate a search query result, determining an aspect of the search query result that has a confidence value that exceeds a first confidence threshold with respect to its relevance to the search query; and, displaying the search result including an emphasis on the aspect of the result exceeds the first confidence threshold.
-
公开(公告)号:US11475067B2
公开(公告)日:2022-10-18
申请号:US16698080
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Cicero Nogueira Dos Santos , Xiaofei Ma , Peng Xu , Ramesh M. Nallapati , Bing Xiang , Sudipta Sengupta , Zhiguo Wang , Patrick Ng
IPC: G06F40/30 , G06F16/9032 , G06K9/62 , G06F16/9038 , G06N20/00 , G06F16/903 , G06F16/93 , G06F40/20
Abstract: Techniques for generation of synthetic queries from customer data for training of document querying machine learning (ML) models as a service are described. A service may receive one or more documents from a user, generate a set of question and answer pairs from the one or more documents from the user using a machine learning model trained to predict a question from an answer, and store the set of question and answer pairs generated from the one or more documents from the user. The question and answer pairs may be used to train another machine learning model, for example, a document ranking model, a passage ranking model, a question/answer model, or a frequently asked question (FAQ) model.
-
17.
公开(公告)号:US12271698B1
公开(公告)日:2025-04-08
申请号:US17537273
申请日:2021-11-29
Applicant: Amazon Technologies, Inc.
Inventor: Jun Wang , Sudipta Sengupta , Zhiguo Wang , Ramesh M Nallapati , Bing Xiang
IPC: G06F40/295 , G06F16/2452 , G06F16/2458 , G06F40/284
Abstract: A schema and cell value aware Named Entity Recognition (NER) model is used to perform natural language queries. Natural language queries may be received via an interface of a natural language query processing system. A fuzzy search may be performed that allows non-exact matches for column names or cell values of data sets potentially used to answer the natural language query. An NER model that adds a type embedding for an exact match of a column name or cell found in the fuzzy search that corresponds to a span of one or more words may be applied as part of generating the entity prediction for the natural language query. One or more queries to at least one of the data sets may be performed to return a result to the natural language query using the entity prediction generated by the NER machine learning model.
-
公开(公告)号:US12265528B1
公开(公告)日:2025-04-01
申请号:US18187553
申请日:2023-03-21
Applicant: Amazon Technologies, Inc.
Inventor: Wuwei Lan , Patrick Ng , Zhiguo Wang , Ramesh M. Nallapati , Henghui Zhu , Anuj Chauhan , Sudipta Sengupta , Stephen Michael Ash , Bing Xiang , Gregory David Adams
IPC: G06F16/00 , G06F16/22 , G06F16/242 , G06F16/2457 , G06F16/248 , G06F16/25 , G06N3/0455 , G06N3/0499
Abstract: Techniques for handling natural language query processing are described. In some examples, a sequence-to-sequence model is used to handle a natural language query. Post-processing of a result of the sequence-to-sequence model utilizes fine-grained information from an entity linker. In some examples, the sequence-to-sequence model and aspects of a natural language query pipeline are used to handle a natural language query.
-
公开(公告)号:US20230325384A1
公开(公告)日:2023-10-12
申请号:US18182303
申请日:2023-03-10
Applicant: Amazon Technologies, Inc.
Inventor: Ramesh M Nallapati , Zhiguo Wang , Bing Xiang , Patrick Ng , Yung Haw Wang , Mukul Karnik , Nanyan Li , Sharanabasappa Parashuram Revadigar , Timothy Jones , Stephen Michael Ash , Sudipta Sengupta , Gregory David Adams , Deepak Shantha Murthy , Douglas Scott Cerny , Stephanie Weeks , Hanbo Li
IPC: G06F16/2452 , G06F16/242 , G06F40/295 , G06N20/00
CPC classification number: G06F16/24522 , G06F16/243 , G06F16/2423 , G06F40/295 , G06N20/00
Abstract: Interactive assistances for executing natural language queries to data sets may be performed. A natural language query may be received. Candidate entity linkages may be determined between an entity recognized in the natural language query and columns in data sets. The candidate linkages may be ranked according to confidence scores which may be evaluated to detect ambiguity for an entity linkage. Candidate entity linkages may be provided to a user via an interface to select an entity linkage to use as part of completing the natural language query.
-
公开(公告)号:US11314819B2
公开(公告)日:2022-04-26
申请号:US16697964
申请日:2019-11-27
Applicant: Amazon Technologies, Inc.
Inventor: Jared Lee Katzman , Nithin Kunala , Bing Xiang , Krishnakumar Rajagopalan , Andrew M. Grant
IPC: G06F16/93 , G06F9/54 , G06F16/31 , G06F16/951
Abstract: Techniques for intaking one or more documents are described. An exemplary method includes receiving an ingestion request to ingest a document; extracting text from the document; pre-processing the extracted text to generate pre-processed text that is predictable and analyzable; generating an index entry for the extracted text, the index entry to map the extracted text to a reserved field of a plurality of reserved fields; and storing the extracted text, index entry, and pre-processed text in at least one data storage location.
-
-
-
-
-
-
-
-
-