-
公开(公告)号:US11842738B1
公开(公告)日:2023-12-12
申请号:US17208069
申请日:2021-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Wenbo Yan , Ruiqi Luo , Prathap Ramachandra , Jingqian Zhao , Kyung Jae Lee , Liu Yang
IPC: G10L15/00 , G10L15/26 , G06N20/00 , G06F40/279 , G10L25/27
CPC classification number: G10L15/26 , G06F40/279 , G06N20/00 , G10L25/27
Abstract: Techniques are described and relate to providing computing services using embeddings of a transformer-based encoder. In an example, a computer system generates, by using a machine learning (ML) transformer, an embedding vector based at least in part on text. The computer system stores the embedding vector and an association between the embedding vector and the text in a data store. Further, the computer system determines that a task is to be performed based at least in part on natural language understanding (NLU) of the text. The computer system receives the embedding vector from the data store based at least in part on the association between the embedding vector and the text. The task is performed based at least in part on the embedding vector after being received from the data store.
-
公开(公告)号:US11657807B2
公开(公告)日:2023-05-23
申请号:US17357338
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Uday Kumar Kollu , Lior Maor Maimon , Sean Gunnar Skaar
IPC: G10L15/22 , G10L15/18 , G10L15/183 , G10L15/32
CPC classification number: G10L15/1815 , G10L15/183 , G10L15/22 , G10L15/32 , G10L2015/223
Abstract: A multi-tier architecture is provided for processing user voice queries and making routing decisions for generating responses, including responses to book browsing requests and other content requests. When an utterance is associated with multiple applications in a given domain, the applications may be organized into a subdomain and a tier of routing decisions may be added to the inter-domain and intra-domain routing decision system. The system uses contextual signals to make subdomain routing decisions, including signals regarding content items that are already in a user's content catalog, consumption status of individual content items in the user's catalog, and the like.
-
公开(公告)号:US20220415310A1
公开(公告)日:2022-12-29
申请号:US17304714
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Uday Kumar Kollu , Jingqian Zhao , Prathap Ramachandra , Adam Kalman , Ruiqi Luo , Krupal Maddipati , Charlotte Alizerine Dzialo , Wenbo Yan , Liu Yang , Mohammad Alnuaimat , Meng Xie , Nalledath P Vinodkrishnan , Adriano Devillaine
IPC: G10L15/18 , H04L29/08 , G10L15/22 , G06F16/2457
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.
-
公开(公告)号:US11830497B2
公开(公告)日:2023-11-28
申请号:US17357025
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu
Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.
-
公开(公告)号:US20220415326A1
公开(公告)日:2022-12-29
申请号:US17357025
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu
Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.
-
公开(公告)号:US11302312B1
公开(公告)日:2022-04-12
申请号:US16585988
申请日:2019-09-27
Applicant: Amazon Technologies, Inc.
Inventor: Ajay Soni , Xi Chen , Jingqian Zhao , Liu Yang , Prathap Ramachandra , Ruiqi Luo
Abstract: A new model is introduced into a particular domain that receives a routing of a dialog from a speech processing component. A method associated with the model includes running a set of test utterances through the speech processing component that enables a spoken language dialog with a user to establish a base line score associated with processing for the set of test utterances. The speech processing component determines an intent of the user and routes the spoken language dialog to a network-based domain based on the intent. The method includes establishing an automatic test run of the set of test utterances to obtain a current score and, when a threshold associated with a difference between the current score and the base line score is breached, switching, at the network-based domain, from the false accept detection model to a second model.
-
公开(公告)号:US11657805B2
公开(公告)日:2023-05-23
申请号:US17304714
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Uday Kumar Kollu , Jingqian Zhao , Prathap Ramachandra , Adam Kalman , Ruiqi Luo , Krupal Maddipati , Charlotte Alizerine Dzialo , Wenbo Yan , Liu Yang , Mohammad Alnuaimat , Meng Xie , Nalledath P Vinodkrishnan , Adriano Devillaine
IPC: G10L15/18 , H04L67/306 , G10L15/22 , G06F16/2457 , H04L67/10 , H04L67/63 , G10L15/30
CPC classification number: G10L15/1815 , G06F16/24578 , G10L15/22 , H04L67/10 , H04L67/306 , H04L67/63 , G10L15/30 , G10L2015/223
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.
-
公开(公告)号:US11604925B1
公开(公告)日:2023-03-14
申请号:US16881520
申请日:2020-05-22
Applicant: Amazon Technologies, Inc.
Inventor: Kyung Jae Lee , Charlotte Alizerine Dzialo , Lan Ma , Liu Yang , Yi Qin , Prathap Ramachandra , Wenbo Yan , Darshan Ashok Fofadiya
IPC: G06F17/00 , G06F40/295 , G10L15/16 , G06N3/04 , G10L15/197 , G06F40/30 , G06N3/08 , G06F17/18
Abstract: Features are disclosed for training and using named entity recognition models based on gazetteer information. A named entity recognition model can be trained with a gazetteer output at a layer of the model to provide deterministic data in the probabilistic model. The named entity recognition model can recognize named entities based on the word embedding and the gazetteer output. The named entity recognition model can tune the gazetteer output to include false positive name entities such that the gazetteer output is not deterministic of the output of the model. In some embodiments, the named entity recognition model can be tuned so as to adjust the gazetteer output.
-
公开(公告)号:US20220415309A1
公开(公告)日:2022-12-29
申请号:US17304712
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Jinning Wu , Uday Kumar Kollu , Xi Chen , Wenbo Yan , Charlotte Alizerine Dzialo , Liu Yang
IPC: G10L15/18 , G06F16/687 , G10L15/22
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. Some applications may be given priority over others such that some applications are general request applications to which responsibility for processing an intent is to be assigned as long as contextual criteria are satisfied, while other applications are specific request applications to which responsibility for processing an intent is to be assigned only if the applications are specifically requested, if the contextual criteria of priority applications are not satisfied, and/or if certain contextual criteria associated with the specific request applications are satisfied.
-
公开(公告)号:US11823671B1
公开(公告)日:2023-11-21
申请号:US16872047
申请日:2020-05-11
Applicant: Amazon Technologies, Inc.
Inventor: Prathap Ramachandra , Lan Ma , Liu Yang , Yi Qin , Kyung Jae Lee , Wenbo Yan , Charlotte Alizerine Dzialo , Darshan Ashok Fofadiya
IPC: G10L15/22 , G06F40/279
CPC classification number: G10L15/22 , G06F40/279
Abstract: Features are disclosed for training and using a word embedding model configured to receive textual and context data associated with an utterance of a user. A word embedding model can be trained with text data and context data to account for context associated with the text data. The word embedding model can receive an input vector including text data and one or more sets of context data associated with the text data and perform word embedding based on the input vector. In some embodiments, the input vector can include an automatic speech recognition (“ASR”) confidence score generated by an ASR model and one or more labels generated by an NLU model. In some embodiments, the input vector can include user information associated with the user.
-
-
-
-
-
-
-
-
-