Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Xing Fan"

11.

发明申请
SPOKEN LANGUAGE UNDERSTANDING SYSTEM 有权

公开(公告)号：US20230047811A1

公开(公告)日：2023-02-16

申请号：US17856090

申请日：2022-07-01

Applicant: Amazon Technologies, Inc.

Inventor： Chenlei Guo , Xing Fan , Chengyuan Ma , Shuting Tang , Kai Wei

IPC: G10L15/06 , G10L15/18 , G10L15/22 , G06N3/08 , G10L15/16

Abstract: A system is provided for a self-learning policy engine that can be used by various spoken language understanding (SLU) processing components. The system also provides for sharing contextual information from processing performed by an upstream SLU component to a downstream SLU component to facilitate decision making by the downstream SLU component. The system also provides for a SLU component to select from a variety of actions to take. A SLU component may implement an instance of the self-learning policy that is specifically configured for the particular SLU component.

12.

发明授权
Proactive command framework 有权

公开(公告)号：US11270698B2

公开(公告)日：2022-03-08

申请号：US16550639

申请日：2019-08-26

Applicant: Amazon Technologies, Inc.

Inventor： Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya

IPC: G10L15/22 , G10L13/00 , G10L15/14 , G10L15/18 , G06N5/02 , G06F40/30 , G10L17/00 , G06F40/295

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

13.

发明授权
Disambiguation in automatic speech processing 有权

公开(公告)号：US11211058B1

公开(公告)日：2021-12-28

申请号：US16577394

申请日：2019-09-20

Applicant: Amazon Technologies, Inc.

Inventor： Aaron Eakin , Angela Sun , Ankur Gandhe , Ariya Rastrow , Chenlei Guo , Xing Fan

IPC: G10L15/197 , G10L15/30 , G10L15/22

Abstract: Described herein is a system for prompting a user for clarification when an automatic speech recognition (ASR) system encounters ambiguity with respect to the user's input. The feedback provided by the user is used to retrain machine-learning models and/or to generate new machine-learning models. Based on the type of ambiguity, the system may determine to retrain one or more ASR models that are widely used by the system or to generate/update one or more user-specific models that are used to process inputs from one or more particular users.

14.

发明公开
PROACTIVE COMMAND FRAMEWORK 审中-公开

公开(公告)号：US20240153505A1

公开(公告)日：2024-05-09

申请号：US18490029

申请日：2023-10-19

Applicant: Amazon Technologies, Inc.

Inventor： Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya

IPC: G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/18 , G10L17/00

CPC classification number: G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/1815 , G10L17/00 , G06F40/295 , G10L2015/223

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

15.

发明授权
Interaction data and processing natural language inputs 有权

公开(公告)号：US11837229B1

公开(公告)日：2023-12-05

申请号：US17363387

申请日：2021-06-30

Applicant: Amazon Technologies, Inc.

Inventor： Xing Fan , Saurabh Gupta , Chenlei Guo , Eunah Cho

IPC: G06F16/33 , G10L15/22 , G06N5/02 , G10L15/14 , G06F16/36

CPC classification number: G10L15/22 , G06N5/02 , G10L15/144 , G06F16/3338 , G06F16/367 , G10L2015/223

Abstract: Techniques for determining and using interaction affinity data are described. Interaction affinity data may indicate a latent affinity between information corresponding to an interaction, such as, intents, entities, device type from which a user input is received, domain, etc. A system may use the interaction affinity data to determine an alternative input representation for a spoken input to cause output of a desired response to the spoken input. The system may also use the interaction affinity data to recommend an action to a user.

16.

发明申请
ALTERNATE NATURAL LANGUAGE INPUT GENERATION 有权

公开(公告)号：US20230110205A1

公开(公告)日：2023-04-13

申请号：US17901209

申请日：2022-09-01

Applicant: Amazon Technologies, Inc.

Inventor： Chenlei Guo , Xing Fan , Jin Hock Ong , Kai Wei

IPC: G10L15/197 , G10L15/22 , G10L15/18

Abstract: Techniques for handling errors during processing of natural language inputs are described. A system may process a natural language input to generate an ASR hypothesis or NLU hypothesis. The system may use more than one data searching technique (e.g., deep neural network searching, convolutional neural network searching, etc.) to generate an alternate ASR hypothesis or NLU hypothesis, depending on the type of hypothesis input for alternate hypothesis processing.

17.

发明申请
NATURAL LANGUAGE UNDERSTANDING 有权

公开(公告)号：US20230089285A1

公开(公告)日：2023-03-23

申请号：US17853013

申请日：2022-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Xing Fan , Zheng Chen , Yuan Ling , Lambert Leo Mathias , Chenlei Guo

IPC: G10L15/197 , G10L15/22 , G10L15/30 , G10L15/18

Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.

18.

发明授权
Natural language understanding 有权

公开(公告)号：US11386890B1

公开(公告)日：2022-07-12

申请号：US16788085

申请日：2020-02-11

Applicant: Amazon Technologies, Inc.

Inventor： Xing Fan , Zheng Chen , Yuan Ling , Lambert Leo Mathias , Chenlei Guo

IPC: G10L15/197 , G10L15/22 , G10L15/30 , G10L15/18

Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.

19.

发明授权
Speech detection and speech recognition 有权

公开(公告)号：US10923111B1

公开(公告)日：2021-02-16

申请号：US16368120

申请日：2019-03-28

Applicant: Amazon Technologies, Inc.

Inventor： Xing Fan , I-Fan Chen , Yuzong Liu , Bjorn Hoffmeister , Yiming Wang , Tongfei Chen

IPC: G10L15/02 , G10L15/16 , G10L15/26 , G10L15/10 , G10L17/00 , G10L15/08

Abstract: A system configured to recognize text represented by speech may determine that a first portion of audio data corresponds to speech from a first speaker and that a second portion of audio data corresponds to speech from the first speaker and a second speaker. Features of the first portion are compared to features of the second portion to determine a similarity therebetween. Based on this similarity, speech from the first speaker is distinguished from speech from the second speaker and text corresponding to speech from the first speaker is determined.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification