-
公开(公告)号:US11094320B1
公开(公告)日:2021-08-17
申请号:US14579699
申请日:2014-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Vikas Jain , Shishir Sridhar Bharathi , Giuseppe Pino Di Fabbrizio , Ling Hu , Sumedha Arvind Kshirsagar , Shamitha Somashekar , John Daniel Thimsen , Tudor Toma
IPC: G06F3/0481 , G10L15/08 , G10L15/22
Abstract: Dialog visualizations are created to enable analysis of interactions between a user and a speech recognition system used to implement user commands. Spoken commands from the user may be classified, along with system responses to the spoken commands, to enable aggregation of communication exchanges that form dialog. This data may then be used to create a dialog visualization. The dialog visualization may enable an analyst to visually explore different branches of the interactions represented in the dialog visualization. The dialog visualization may show a trajectory of the dialog, which may be explored in an interactive manner by the analyst.
-
公开(公告)号:US10522134B1
公开(公告)日:2019-12-31
申请号:US15388458
申请日:2016-12-22
Applicant: Amazon Technologies, Inc.
Inventor: Spyridon Matsoukas , Aparna Khare , Vishwanathan Krishnamoorthy , Shamitha Somashekar , Arindam Mandal
Abstract: Systems, methods, and devices for verifying a user are disclosed. A speech-controlled device captures a spoken command, and sends audio data corresponding thereto to a server. The server performs ASR on the audio data to determine ASR confidence data. The server, in parallel, performs user verification on the audio data to determine user verification confidence data. The server may modify the user verification confidence data using the ASR confidence data. In addition or alternatively, the server may modify the user verification confidence data using at least one of a location of the speech-controlled device within a building, a type of the speech-controlled device, or a geographic location of the speech-controlled device.
-
公开(公告)号:US09293134B1
公开(公告)日:2016-03-22
申请号:US14502103
申请日:2014-09-30
Applicant: Amazon Technologies, Inc.
Inventor: Shirin Saleem , Shamitha Somashekar , Aimee Therese Piercy , Kurt Wesley Piersol , Marcello Typrin
Abstract: A speech system may be configured to operate in conjunction with a stationary base device and a handheld remote device to receive voice commands from a user. Voice commands may be directed either to the base device or to the handheld device. When performing automatic speech recognition (ASR), natural language understanding (NLU), dialog management, text-to-speech (TTS) conversion, and other speech-related tasks, the system may utilize various models, including ASR models, NLU models, dialog models, and TTS models. Different models may be used depending on whether the user has chosen to speak into the base device or the handheld audio device. The different models may be designed to accommodate the different characteristics of audio and speech that are present in audio provided by the two different components and the different characteristics of the environmental situation of the user.
Abstract translation: 语音系统可以被配置为与固定基站设备和手持远程设备结合操作以从用户接收语音命令。 语音命令可以被引导到基本设备或手持设备。 当进行自动语音识别(ASR),自然语言理解(NLU),对话管理,文本到语音(TTS)转换和其他语音相关任务时,系统可以利用各种模型,包括ASR模型,NLU模型, 对话模型和TTS模型。 可以使用不同的型号,这取决于用户是否选择对基本设备或手持音频设备进行说话。 可以将不同的模型设计为适应由两个不同组件提供的音频中存在的音频和语音的不同特征以及用户的环境状况的不同特征。
-
-