Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Jingqian Zhao"

1.

发明授权
Multi-tier speech processing and content operations 有权

公开(公告)号：US11657807B2

公开(公告)日：2023-05-23

申请号：US17357338

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Uday Kumar Kollu , Lior Maor Maimon , Sean Gunnar Skaar

IPC: G10L15/22 , G10L15/18 , G10L15/183 , G10L15/32

CPC classification number: G10L15/1815 , G10L15/183 , G10L15/22 , G10L15/32 , G10L2015/223

Abstract: A multi-tier architecture is provided for processing user voice queries and making routing decisions for generating responses, including responses to book browsing requests and other content requests. When an utterance is associated with multiple applications in a given domain, the applications may be organized into a subdomain and a tier of routing decisions may be added to the inter-domain and intra-domain routing decision system. The system uses contextual signals to make subdomain routing decisions, including signals regarding content items that are already in a user's content catalog, consumption status of individual content items in the user's catalog, and the like.

2.

发明申请
DYNAMIC CONTEXT-BASED ROUTING OF SPEECH PROCESSING 有权

公开(公告)号：US20220415310A1

公开(公告)日：2022-12-29

申请号：US17304714

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Uday Kumar Kollu , Jingqian Zhao , Prathap Ramachandra , Adam Kalman , Ruiqi Luo , Krupal Maddipati , Charlotte Alizerine Dzialo , Wenbo Yan , Liu Yang , Mohammad Alnuaimat , Meng Xie , Nalledath P Vinodkrishnan , Adriano Devillaine

IPC: G10L15/18 , H04L29/08 , G10L15/22 , G06F16/2457

Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.

3.

发明授权
Computing services using embeddings of a transformer-based encoder 有权

公开(公告)号：US11842738B1

公开(公告)日：2023-12-12

申请号：US17208069

申请日：2021-03-22

Applicant: Amazon Technologies, Inc.

Inventor： Wenbo Yan , Ruiqi Luo , Prathap Ramachandra , Jingqian Zhao , Kyung Jae Lee , Liu Yang

IPC: G10L15/00 , G10L15/26 , G06N20/00 , G06F40/279 , G10L25/27

CPC classification number: G10L15/26 , G06F40/279 , G06N20/00 , G10L25/27

Abstract: Techniques are described and relate to providing computing services using embeddings of a transformer-based encoder. In an example, a computer system generates, by using a machine learning (ML) transformer, an embedding vector based at least in part on text. The computer system stores the embedding vector and an association between the embedding vector and the text in a data store. Further, the computer system determines that a task is to be performed based at least in part on natural language understanding (NLU) of the text. The computer system receives the embedding vector from the data store based at least in part on the association between the embedding vector and the text. The task is performed based at least in part on the embedding vector after being received from the data store.

4.

发明授权
Multi-domain intent handling with cross-domain contextual signals 有权

公开(公告)号：US11830497B2

公开(公告)日：2023-11-28

申请号：US17357025

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu

IPC: G10L15/26 , G10L15/18 , G10L15/30 , G10L15/22

CPC classification number: G10L15/26 , G10L15/18 , G10L15/22 , G10L15/30

Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.

5.

发明申请
MULTI-DOMAIN INTENT HANDLING WITH CROSS-DOMAIN CONTEXTUAL SIGNALS 有权

公开(公告)号：US20220415326A1

公开(公告)日：2022-12-29

申请号：US17357025

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Krupal Maddipati , Jinning Wu , Charlotte Alizerine Dzialo , Daksh Gautam , Wenbo Yan , Liu Yang , Uday Kumar Kollu

IPC: G10L15/26 , G10L15/22 , G10L15/30 , G10L15/18

Abstract: A multi-tier domain is provided for processing user voice queries and making routing decisions for generating responses, including for user voice queries that include multi-domain trigger words or phrases. When an utterance is recognized as different intents in different domains, a routing system for a domain may consider contextual signals, including those associated with other domains, to determine whether the domain is the proper one to handle the request. This determination can be performed with a statistical model specifically trained to make such determinations using the available contextual data.

6.

发明授权
Spoken language quality automatic regression detector background 有权

公开(公告)号：US11302312B1

公开(公告)日：2022-04-12

申请号：US16585988

申请日：2019-09-27

Applicant: Amazon Technologies, Inc.

Inventor： Ajay Soni , Xi Chen , Jingqian Zhao , Liu Yang , Prathap Ramachandra , Ruiqi Luo

IPC: G10L15/07 , G10L15/06 , G10L15/18

Abstract: A new model is introduced into a particular domain that receives a routing of a dialog from a speech processing component. A method associated with the model includes running a set of test utterances through the speech processing component that enables a spoken language dialog with a user to establish a base line score associated with processing for the set of test utterances. The speech processing component determines an intent of the user and routes the spoken language dialog to a network-based domain based on the intent. The method includes establishing an automatic test run of the set of test utterances to obtain a current score and, when a threshold associated with a difference between the current score and the base line score is breached, switching, at the network-based domain, from the false accept detection model to a second model.

7.

发明授权
Priority and context-based routing of speech processing 有权

公开(公告)号：US11705113B2

公开(公告)日：2023-07-18

申请号：US17304712

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Jinning Wu , Uday Kumar Kollu , Xi Chen , Wenbo Yan , Charlotte Alizerine Dzialo , Liu Yang

IPC: G10L15/183 , G10L15/18 , G10L15/22 , G06F16/687

CPC classification number: G10L15/1815 , G06F16/687 , G10L15/22 , G10L2015/223

Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. Some applications may be given priority over others such that some applications are general request applications to which responsibility for processing an intent is to be assigned as long as contextual criteria are satisfied, while other applications are specific request applications to which responsibility for processing an intent is to be assigned only if the applications are specifically requested, if the contextual criteria of priority applications are not satisfied, and/or if certain contextual criteria associated with the specific request applications are satisfied.

8.

发明申请
MULTI-TIER SPEECH PROCESSING AND CONTENT OPERATIONS 有权

公开(公告)号：US20220415312A1

公开(公告)日：2022-12-29

申请号：US17357338

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Jingqian Zhao , Prathap Ramachandra , Uday Kumar Kollu , Lior Maor Maimon , Sean Gunnar Skaar

IPC: G10L15/18 , G10L15/183 , G10L15/22 , G10L15/32

Abstract: A multi-tier architecture is provided for processing user voice queries and making routing decisions for generating responses, including responses to book browsing requests and other content requests. When an utterance is associated with multiple applications in a given domain, the applications may be organized into a subdomain and a tier of routing decisions may be added to the inter-domain and intra-domain routing decision system. The system uses contextual signals to make subdomain routing decisions, including signals regarding content items that are already in a user's content catalog, consumption status of individual content items in the user's catalog, and the like

9.

发明授权
Detecting false accepts in a shopping domain for handling a spoken dialog 有权

公开(公告)号：US11222630B1

公开(公告)日：2022-01-11

申请号：US16576115

申请日：2019-09-19

Applicant: Amazon Technologies, Inc.

Inventor： Ajay Soni , Jingqian Zhao , Ruiqi Luo , Adam Kalman , Prathap Ramachandra , Liu Yang , Simone Filice , Ponnu Jacob , Amitpal Singh Bhutani

IPC: G10L15/22 , G10L15/187 , G10L15/16

Abstract: A new model is introduced into a particular domain that receives a routing of a dialog from a speech processing component. The speech processing component is engaged in the dialog with a user and the speech processing component routes the dialog to the particular network-based domain according to a determination by the speech processing component that the user has an intent to perform a task handled by the domain. The model detects, at the domain, whether the user has the proper intent associated with the domain by using the user utterance in its entirety to yield a detection result. When the user does not have the proper intent based on the detection result, the domain drops the user utterance.

10.

发明授权
Early invocation for contextual data processing 有权

公开(公告)号：US12211493B2

公开(公告)日：2025-01-28

申请号：US17304720

申请日：2021-06-24

Applicant: Amazon Technologies, Inc.

Inventor： Ponnu Jacob , Adam Kalman , Uday Kumar Kollu , Ruiqi Luo , Xi Chen , Jingqian Zhao , Yunqiang Zhu Zhu , Adriano Devillaine

IPC: G10L15/22 , G06F40/30 , G10L15/18 , G10L15/183 , G06F40/35

Abstract: A speech processing system uses contextual data to determine the specific domains, subdomains, and applications appropriate for taking action in response to spoken commands and other utterances. The system can use signals and other contextual data associated with an utterance, such as location signals, content catalog data, data regarding historical usage patterns, data regarding content visually presented on a display screen of a computing device when an utterance was made, other data, or some combination thereof.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification