-
公开(公告)号:US11335346B1
公开(公告)日:2022-05-17
申请号:US16215061
申请日:2018-12-10
Applicant: Amazon Technologies, Inc.
Inventor: Chengwei Su , Spyridon Matsoukas , Sankaranarayanan Ananthakrishnan , Shirin Saleem , Chungnam Chan , Yugang Li , Mallory McManamon , Rahul Gupta , Luca Soldaini
IPC: G10L15/26 , G06K9/62 , G06N20/10 , G06N7/00 , G06F40/295
Abstract: Techniques for processing a user input are described. Text data representing a user input is processed with respect to at least one finite state transducer (FST) to generate at least one FST hypothesis. Context information may be required to traverse one or more paths of the at least one FST. The text data is also processed using at least one statistical model (e.g., perform intent classification, named entity recognition, and/or domain classification processing) to generate at least one statistical model hypothesis. The at least one FST hypothesis and the at least one statistical model hypothesis are input to a reranker that determines a most likely interpretation of the user input.
-
公开(公告)号:US09792901B1
公开(公告)日:2017-10-17
申请号:US14567416
申请日:2014-12-11
Applicant: Amazon Technologies, Inc.
Inventor: Shirin Saleem , Aimee Therese Piercy , Marcello Typrin , Shamitha Somashekar , Kurt Wesley Piersol
IPC: G10L15/22 , B60R16/037 , G06F3/16 , G10L15/08 , G10L17/22
CPC classification number: G10L15/22 , B60R16/0373 , G06F3/167 , G10L2015/223
Abstract: A speech system may be configured to operate in conjunction with a stationary base device and a handheld remote device to receive voice commands from a user. A user may direct speech either to the base device or to the handheld device. In order to direct speech to the base device, the user first speaks a keyword. In order to direct speech to the handheld device, the user presses a talk control on the handheld device. A dialog may be conducted with the user in multiple turns, where each turn comprises user speech and a speech response by the speech system. The user speech in any given dialog turn may be provided from the base device and/or the handheld device.
-
公开(公告)号:US11081104B1
公开(公告)日:2021-08-03
申请号:US15838917
申请日:2017-12-12
Applicant: Amazon Technologies, Inc.
Inventor: Chengwei Su , Sankaranarayanan Ananthakrishnan , Spyridon Matsoukas , Shirin Saleem , Rahul Gupta , Kavya Ravikumar , John Will Crimmins , Kelly James Vanee , John Pelak , Melanie Chie Bomke Gens
IPC: G10L15/18 , G10L15/22 , G10L15/06 , G10L15/183 , H04L29/08 , G10L15/32 , G06K9/00 , H04W4/02 , G10L15/26 , G06F16/31 , G06F40/295
Abstract: A natural language understanding system that can determine an overall score for a natural language hypothesis using hypothesis-specific component scores from different aspects of NLU processing as well as context data describing the context surrounding the utterance corresponding to the natural language hypotheses. The individual component scores may be input into a feature vector at a location corresponding to a type of a device captured by the utterance. Other locations in the feature vector corresponding to other device types may be populated with zero values. The feature vector may also be populated with other values represent other context data. The feature vector may then be multiplied by a weight vector comprising trained weights corresponding to the feature vector positions to determine a new overall score for each hypothesis, where the overall score incorporates the impact of the context data. Natural language hypotheses can be ranked using their respective new overall scores.
-
公开(公告)号:US09293134B1
公开(公告)日:2016-03-22
申请号:US14502103
申请日:2014-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Shirin Saleem , Shamitha Somashekar , Aimee Therese Piercy , Kurt Wesley Piersol , Marcello Typrin
Abstract: A speech system may be configured to operate in conjunction with a stationary base device and a handheld remote device to receive voice commands from a user. Voice commands may be directed either to the base device or to the handheld device. When performing automatic speech recognition (ASR), natural language understanding (NLU), dialog management, text-to-speech (TTS) conversion, and other speech-related tasks, the system may utilize various models, including ASR models, NLU models, dialog models, and TTS models. Different models may be used depending on whether the user has chosen to speak into the base device or the handheld audio device. The different models may be designed to accommodate the different characteristics of audio and speech that are present in audio provided by the two different components and the different characteristics of the environmental situation of the user.
Abstract translation: 语音系统可以被配置为与固定基站设备和手持远程设备结合操作以从用户接收语音命令。 语音命令可以被引导到基本设备或手持设备。 当进行自动语音识别(ASR),自然语言理解(NLU),对话管理,文本到语音(TTS)转换和其他语音相关任务时,系统可以利用各种模型,包括ASR模型,NLU模型, 对话模型和TTS模型。 可以使用不同的型号,这取决于用户是否选择对基本设备或手持音频设备进行说话。 可以将不同的模型设计为适应由两个不同组件提供的音频中存在的音频和语音的不同特征以及用户的环境状况的不同特征。
-
-
-