-
公开(公告)号:US11657807B2
公开(公告)日:2023-05-23
申请号:US17357338
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Uday Kumar Kollu , Lior Maor Maimon , Sean Gunnar Skaar
IPC: G10L15/22 , G10L15/18 , G10L15/183 , G10L15/32
CPC classification number: G10L15/1815 , G10L15/183 , G10L15/22 , G10L15/32 , G10L2015/223
Abstract: A multi-tier architecture is provided for processing user voice queries and making routing decisions for generating responses, including responses to book browsing requests and other content requests. When an utterance is associated with multiple applications in a given domain, the applications may be organized into a subdomain and a tier of routing decisions may be added to the inter-domain and intra-domain routing decision system. The system uses contextual signals to make subdomain routing decisions, including signals regarding content items that are already in a user's content catalog, consumption status of individual content items in the user's catalog, and the like.
-
公开(公告)号:US20220415310A1
公开(公告)日:2022-12-29
申请号:US17304714
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Uday Kumar Kollu , Jingqian Zhao , Prathap Ramachandra , Adam Kalman , Ruiqi Luo , Krupal Maddipati , Charlotte Alizerine Dzialo , Wenbo Yan , Liu Yang , Mohammad Alnuaimat , Meng Xie , Nalledath P Vinodkrishnan , Adriano Devillaine
IPC: G10L15/18 , H04L29/08 , G10L15/22 , G06F16/2457
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.
-
公开(公告)号:US11842738B1
公开(公告)日:2023-12-12
申请号:US17208069
申请日:2021-03-22
Applicant: Amazon Technologies, Inc.
Inventor: Wenbo Yan , Ruiqi Luo , Prathap Ramachandra , Jingqian Zhao , Kyung Jae Lee , Liu Yang
IPC: G10L15/00 , G10L15/26 , G06N20/00 , G06F40/279 , G10L25/27
CPC classification number: G10L15/26 , G06F40/279 , G06N20/00 , G10L25/27
Abstract: Techniques are described and relate to providing computing services using embeddings of a transformer-based encoder. In an example, a computer system generates, by using a machine learning (ML) transformer, an embedding vector based at least in part on text. The computer system stores the embedding vector and an association between the embedding vector and the text in a data store. Further, the computer system determines that a task is to be performed based at least in part on natural language understanding (NLU) of the text. The computer system receives the embedding vector from the data store based at least in part on the association between the embedding vector and the text. The task is performed based at least in part on the embedding vector after being received from the data store.
-
公开(公告)号:US11830497B2
公开(公告)日:2023-11-28
申请号:US17357025
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu
Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.
-
公开(公告)号:US20220415326A1
公开(公告)日:2022-12-29
申请号:US17357025
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu
Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.
-
公开(公告)号:US11302312B1
公开(公告)日:2022-04-12
申请号:US16585988
申请日:2019-09-27
Applicant: Amazon Technologies, Inc.
Inventor: Ajay Soni , Xi Chen , Jingqian Zhao , Liu Yang , Prathap Ramachandra , Ruiqi Luo
Abstract: A new model is introduced into a particular domain that receives a routing of a dialog from a speech processing component. A method associated with the model includes running a set of test utterances through the speech processing component that enables a spoken language dialog with a user to establish a base line score associated with processing for the set of test utterances. The speech processing component determines an intent of the user and routes the spoken language dialog to a network-based domain based on the intent. The method includes establishing an automatic test run of the set of test utterances to obtain a current score and, when a threshold associated with a difference between the current score and the base line score is breached, switching, at the network-based domain, from the false accept detection model to a second model.
-
公开(公告)号:US11705113B2
公开(公告)日:2023-07-18
申请号:US17304712
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Jinning Wu , Uday Kumar Kollu , Xi Chen , Wenbo Yan , Charlotte Alizerine Dzialo , Liu Yang
IPC: G10L15/183 , G10L15/18 , G10L15/22 , G06F16/687
CPC classification number: G10L15/1815 , G06F16/687 , G10L15/22 , G10L2015/223
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. Some applications may be given priority over others such that some applications are general request applications to which responsibility for processing an intent is to be assigned as long as contextual criteria are satisfied, while other applications are specific request applications to which responsibility for processing an intent is to be assigned only if the applications are specifically requested, if the contextual criteria of priority applications are not satisfied, and/or if certain contextual criteria associated with the specific request applications are satisfied.
-
公开(公告)号:US20220415312A1
公开(公告)日:2022-12-29
申请号:US17357338
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Uday Kumar Kollu , Lior Maor Maimon , Sean Gunnar Skaar
IPC: G10L15/18 , G10L15/183 , G10L15/22 , G10L15/32
Abstract: A multi-tier architecture is provided for processing user voice queries and making routing decisions for generating responses, including responses to book browsing requests and other content requests. When an utterance is associated with multiple applications in a given domain, the applications may be organized into a subdomain and a tier of routing decisions may be added to the inter-domain and intra-domain routing decision system. The system uses contextual signals to make subdomain routing decisions, including signals regarding content items that are already in a user's content catalog, consumption status of individual content items in the user's catalog, and the like
-
公开(公告)号:US11222630B1
公开(公告)日:2022-01-11
申请号:US16576115
申请日:2019-09-19
Applicant: Amazon Technologies, Inc.
Inventor: Ajay Soni , Jingqian Zhao , Ruiqi Luo , Adam Kalman , Prathap Ramachandra , Liu Yang , Simone Filice , Ponnu Jacob , Amitpal Singh Bhutani
IPC: G10L15/22 , G10L15/187 , G10L15/16
Abstract: A new model is introduced into a particular domain that receives a routing of a dialog from a speech processing component. The speech processing component is engaged in the dialog with a user and the speech processing component routes the dialog to the particular network-based domain according to a determination by the speech processing component that the user has an intent to perform a task handled by the domain. The model detects, at the domain, whether the user has the proper intent associated with the domain by using the user utterance in its entirety to yield a detection result. When the user does not have the proper intent based on the detection result, the domain drops the user utterance.
-
公开(公告)号:US12211493B2
公开(公告)日:2025-01-28
申请号:US17304720
申请日:2021-06-24
Applicant: Amazon Technologies, Inc.
Inventor: Ponnu Jacob , Adam Kalman , Uday Kumar Kollu , Ruiqi Luo , Xi Chen , Jingqian Zhao , Yunqiang Zhu Zhu , Adriano Devillaine
IPC: G10L15/22 , G06F40/30 , G10L15/18 , G10L15/183 , G06F40/35
Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.
-
-
-
-
-
-
-
-
-