Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Ying Shi"

11.

发明授权
Architecture for multi-domain utterance processing 有权
Title translation: 多域语音处理架构

公开(公告)号：US09070366B1

公开(公告)日：2015-06-30

申请号：US13720909

申请日：2012-12-19

Applicant: Amazon Technologies, Inc.

Inventor： Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat

IPC: G10L15/00 , G10L15/18 , G10L15/04

CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26

Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

Abstract translation: 公开了用于处理关于多个主题或域的用户话语的特征，并且从用于响应于话语或以其他方式采取行动的特定域中选择可能的结果。用户话语可以通过自动语音识别（“ASR”）模块进行转录，并且可以将结果提供给多域自然语言理解（“NLU”）引擎。多域NLU引擎可以处理多个单个域中的转录，而不是在单个域中处理转录。在一些情况下，转录可以在多个单独的结构域中并行或基本上同时进行。此外，可以基于先前的用户交互和其他数据生成提示。 ASR模块，多域NLU引擎和口语处理系统的其他组件可以使用提示来更有效地处理输入或更准确地生成输出。

12.

发明授权
Context interpretation in natural language processing using previous dialog acts 有权

公开(公告)号：US10726831B2

公开(公告)日：2020-07-28

申请号：US14283017

申请日：2014-05-20

Applicant: Amazon Technologies, Inc.

Inventor： Giuseppe Di Fabbrizio , Shishir Sridhar Bharathi , Ying Shi , Lambert Mathias

IPC: G10L15/18 , G10L15/00 , G10L15/22 , G06F16/332 , G10L15/30 , G06F40/35

Abstract: Features are disclosed for processing and interpreting natural language, such as interpretations of user utterances, in multi-turn dialog interactions. Context information regarding interpretations of user utterances and system responses to the user utterances can be maintained. Subsequent user utterances can be interpreted using the context information, rather than being interpreted without context. In some cases, interpretations of subsequent user utterances can be merged with interpretations of prior user utterances using a rule-based framework. Rules may be defined to determine which interpretations may be merged and under what circumstances they may be merged.

13.

发明申请
DIALOG MANAGEMENT FOR MULTIPLE USERS 有权

公开(公告)号：US20220093101A1

公开(公告)日：2022-03-24

申请号：US17112520

申请日：2020-12-04

Applicant: Amazon Technologies, Inc.

Inventor： Prakash Krishnan , Arindam Mandal , Siddhartha Reddy Jonnalagadda , Nikko Strom , Ariya Rastrow , Ying Shi , David Chi-Wai Tang , Nishtha Gupta , Aaron Challenner , Bonan Zheng , Angeliki Metallinou , Vincent Auvray , Minmin Shen

IPC: G10L15/22 , G10L15/20 , G06F3/16 , G10L13/08

Abstract: A system that is capable of resolving anaphora using timing data received by a local device. A local device outputs audio representing a list of entries. The audio may represent synthesized speech of the list of entries. A user can interrupt the device to select an entry in the list, such as by saying “that one.” The local device can determine an offset time representing the time between when audio playback began and when the user interrupted. The local device sends the offset time and audio data representing the utterance to a speech processing system which can then use the offset time and stored data to identify which entry on the list was most recently output by the local device when the user interrupted. The system can then resolve anaphora to match that entry and can perform additional processing based on the referred to item.

14.

发明授权
Architecture for multi-domain natural language processing 有权

公开(公告)号：US10283119B2

公开(公告)日：2019-05-07

申请号：US15966400

申请日：2018-04-30

Applicant: Amazon Technologies, Inc.

Inventor： Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat

IPC: G10L15/00 , G10L15/22 , G10L15/26 , G06F17/28 , G06F17/27 , G10L13/08

Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

15.

发明申请
ARCHITECTURE FOR MULTI-DOMAIN NATURAL LANGUAGE PROCESSING 审中-公开

公开(公告)号：US20180315425A1

公开(公告)日：2018-11-01

申请号：US15966400

申请日：2018-04-30

Applicant: Amazon Technologies, Inc.

Inventor： Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat

IPC: G10L15/22 , G10L15/26 , G06F17/28 , G06F17/27 , G10L13/08

CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26

Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

16.

发明授权
Interpreting nonstandard terms in language processing using text-based communications 有权

公开(公告)号：US10102845B1

公开(公告)日：2018-10-16

申请号：US13776594

申请日：2013-02-25

Applicant: Amazon Technologies, Inc.

Inventor： Weam Abu Zaki , Ying Shi , Rajiv Ramachandran

IPC: G10L15/00

Abstract: Features are disclosed for determining a definition or value of a nonstandard term. A user utterance may be processed into one or more candidate transcriptions. An interpretation of the utterance can be generated from the transcriptions. If the transcription includes a word, phrase, or term that is not recognized or is used in a nonstandard way, one or more data stores may be queried regarding the proper value or definition of the term. If a definition or value is not available in the data stores, the user may be prompted to provide one. The user-supplied definition can be saved for future use, and may be used as a general definition of the term for other users.

17.

发明授权
Architecture for multi-domain natural language processing 有权

公开(公告)号：US09959869B2

公开(公告)日：2018-05-01

申请号：US15694996

申请日：2017-09-04

Applicant: Amazon Technologies, Inc.

Inventor： Lambert Mathias , Ying Shi , Imre Attila Kiss , Ryan Paul Thomas , Frederic Johan Georges Deramat

IPC: G10L15/00 , G10L15/22 , G10L15/26 , G06F17/27 , G10L13/08

CPC classification number: G10L15/22 , G06F17/277 , G06F17/278 , G06F17/279 , G06F17/28 , G06F17/2881 , G10L13/08 , G10L15/26

Abstract: Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.

18.

发明申请
CONTEXT INTERPRETATION IN NATURAL LANGUAGE PROCESSING USING PREVIOUS DIALOG ACTS 审中-公开
Title translation: 使用以前的对话行为进行自然语言处理的语境解释

公开(公告)号：US20150340033A1

公开(公告)日：2015-11-26

申请号：US14283017

申请日：2014-05-20

Applicant: Amazon Technologies, Inc.

Inventor： Giuseppe Di Fabbrizio , Shishir Sridhar Bharathi , Ying Shi , Lambert Mathias

IPC: G10L15/18 , G10L15/00

Abstract: Features are disclosed for processing and interpreting natural language, such as interpretations of user utterances, in multi-turn dialog interactions. Context information regarding interpretations of user utterances and system responses to the user utterances can be maintained. Subsequent user utterances can be interpreted using the context information, rather than being interpreted without context. In some cases, interpretations of subsequent user utterances can be merged with interpretations of prior user utterances using a rule-based framework. Rules may be defined to determine which interpretations may be merged and under what circumstances they may be merged.

Abstract translation: 公开了用于处理和解释自然语言的特征，例如用户话语的解释，在多圈对话交互中。可以保持关于用户话语的解释和对用户话语的系统响应的上下文信息。可以使用上下文信息来解释随后的用户话语，而不是在没有上下文的情况下被解释。在某些情况下，使用基于规则的框架，对后续用户话语的解释可以与先前用户话语的解释相结合。可以定义规则来确定哪些解释可以合并，以及在什么情况下可以合并。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification