-
公开(公告)号:US20230047811A1
公开(公告)日:2023-02-16
申请号:US17856090
申请日:2022-07-01
Applicant: Amazon Technologies, Inc.
Inventor: Chenlei Guo , Xing Fan , Chengyuan Ma , Shuting Tang , Kai Wei
Abstract: A system is provided for a self-learning policy engine that can be used by various spoken language understanding (SLU) processing components. The system also provides for sharing contextual information from processing performed by an upstream SLU component to a downstream SLU component to facilitate decision making by the downstream SLU component. The system also provides for a SLU component to select from a variety of actions to take. A SLU component may implement an instance of the self-learning policy that is specifically configured for the particular SLU component.
-
公开(公告)号:US11270698B2
公开(公告)日:2022-03-08
申请号:US16550639
申请日:2019-08-26
Applicant: Amazon Technologies, Inc.
Inventor: Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya
Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
-
公开(公告)号:US11211058B1
公开(公告)日:2021-12-28
申请号:US16577394
申请日:2019-09-20
Applicant: Amazon Technologies, Inc.
Inventor: Aaron Eakin , Angela Sun , Ankur Gandhe , Ariya Rastrow , Chenlei Guo , Xing Fan
IPC: G10L15/197 , G10L15/30 , G10L15/22
Abstract: Described herein is a system for prompting a user for clarification when an automatic speech recognition (ASR) system encounters ambiguity with respect to the user's input. The feedback provided by the user is used to retrain machine-learning models and/or to generate new machine-learning models. Based on the type of ambiguity, the system may determine to retrain one or more ASR models that are widely used by the system or to generate/update one or more user-specific models that are used to process inputs from one or more particular users.
-
公开(公告)号:US20240153505A1
公开(公告)日:2024-05-09
申请号:US18490029
申请日:2023-10-19
Applicant: Amazon Technologies, Inc.
Inventor: Anjishnu Kumar , Xing Fan , Arpit Gupta , Ruhi Sarikaya
CPC classification number: G10L15/22 , G06F40/30 , G06N5/022 , G10L13/00 , G10L15/14 , G10L15/1815 , G10L17/00 , G06F40/295 , G10L2015/223
Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
-
公开(公告)号:US11837229B1
公开(公告)日:2023-12-05
申请号:US17363387
申请日:2021-06-30
Applicant: Amazon Technologies, Inc.
Inventor: Xing Fan , Saurabh Gupta , Chenlei Guo , Eunah Cho
CPC classification number: G10L15/22 , G06N5/02 , G10L15/144 , G06F16/3338 , G06F16/367 , G10L2015/223
Abstract: Techniques for determining and using interaction affinity data are described. Interaction affinity data may indicate a latent affinity between information corresponding to an interaction, such as, intents, entities, device type from which a user input is received, domain, etc. A system may use the interaction affinity data to determine an alternative input representation for a spoken input to cause output of a desired response to the spoken input. The system may also use the interaction affinity data to recommend an action to a user.
-
公开(公告)号:US20230110205A1
公开(公告)日:2023-04-13
申请号:US17901209
申请日:2022-09-01
Applicant: Amazon Technologies, Inc.
Inventor: Chenlei Guo , Xing Fan , Jin Hock Ong , Kai Wei
IPC: G10L15/197 , G10L15/22 , G10L15/18
Abstract: Techniques for handling errors during processing of natural language inputs are described. A system may process a natural language input to generate an ASR hypothesis or NLU hypothesis. The system may use more than one data searching technique (e.g., deep neural network searching, convolutional neural network searching, etc.) to generate an alternate ASR hypothesis or NLU hypothesis, depending on the type of hypothesis input for alternate hypothesis processing.
-
公开(公告)号:US20230089285A1
公开(公告)日:2023-03-23
申请号:US17853013
申请日:2022-06-29
Applicant: Amazon Technologies, Inc.
Inventor: Xing Fan , Zheng Chen , Yuan Ling , Lambert Leo Mathias , Chenlei Guo
IPC: G10L15/197 , G10L15/22 , G10L15/30 , G10L15/18
Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.
-
公开(公告)号:US11386890B1
公开(公告)日:2022-07-12
申请号:US16788085
申请日:2020-02-11
Applicant: Amazon Technologies, Inc.
Inventor: Xing Fan , Zheng Chen , Yuan Ling , Lambert Leo Mathias , Chenlei Guo
IPC: G10L15/197 , G10L15/22 , G10L15/30 , G10L15/18
Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.
-
公开(公告)号:US10923111B1
公开(公告)日:2021-02-16
申请号:US16368120
申请日:2019-03-28
Applicant: Amazon Technologies, Inc.
Inventor: Xing Fan , I-Fan Chen , Yuzong Liu , Bjorn Hoffmeister , Yiming Wang , Tongfei Chen
Abstract: A system configured to recognize text represented by speech may determine that a first portion of audio data corresponds to speech from a first speaker and that a second portion of audio data corresponds to speech from the first speaker and a second speaker. Features of the first portion are compared to features of the second portion to determine a similarity therebetween. Based on this similarity, speech from the first speaker is distinguished from speech from the second speaker and text corresponding to speech from the first speaker is determined.
-
-
-
-
-
-
-
-