-
公开(公告)号:US10325591B1
公开(公告)日:2019-06-18
申请号:US14478923
申请日:2014-09-05
Applicant: Amazon Technologies, Inc.
Inventor: Michael Alan Pogue , Kurt Wesley Piersol
Abstract: A speech interface device may capture user speech for analysis by automatic speech recognition (ASR) and natural language understanding (NLU) components. However, an audio signal representing the user speech may also contain interfering sound generated by a media player that is playing audio content such as music. Before performing ASR and NLU, a system attempts to identify the content being played by the media player, such as by querying the media player or by analyzing the audio signal. The system then obtains the same content from an available source and subtracts the audio represented by the content from the audio signal.
-
公开(公告)号:US10192546B1
公开(公告)日:2019-01-29
申请号:US14672277
申请日:2015-03-30
Applicant: Amazon Technologies, Inc.
Inventor: Kurt Wesley Piersol , Gabriel Beddingfield
Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.
-
公开(公告)号:US09799329B1
公开(公告)日:2017-10-24
申请号:US14559687
申请日:2014-12-03
Applicant: Amazon Technologies, Inc.
Inventor: Michael Alan Pogue , Kurt Wesley Piersol
CPC classification number: G10L15/20 , G10L15/063 , G10L15/065 , G10L15/22 , G10L17/22 , G10L21/0208 , G10L25/51 , G10L25/78
Abstract: This disclosure describes, in part, techniques and devices for identifying recurring environmental sounds in an environment such that these sounds may be canceled out of corresponding audio signals to increase signal-to-noise ratios (SNRs) of the signals and, hence, improve automatic speech recognition (ASR) on the signals. Recurring environmental sounds may include the ringing of a mobile phone, the beeping sound of a microphone, the buzzing of a washing machine, or the like.
-
公开(公告)号:US11710478B2
公开(公告)日:2023-07-25
申请号:US17232609
申请日:2021-04-16
Applicant: Amazon Technologies, Inc.
Inventor: Kurt Wesley Piersol , Gabriel Beddingfield
Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.
-
公开(公告)号:US20210210071A1
公开(公告)日:2021-07-08
申请号:US17146995
申请日:2021-01-12
Applicant: Amazon Technologies, Inc.
Inventor: James David Meyers , Kurt Wesley Piersol
IPC: G10L15/08 , G10L15/04 , G10L21/028 , G10L15/20
Abstract: Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.
-
公开(公告)号:US10930266B2
公开(公告)日:2021-02-23
申请号:US16665461
申请日:2019-10-28
Applicant: Amazon Technologies, Inc.
Inventor: James David Meyers , Kurt Wesley Piersol
IPC: G10L15/08 , G10L15/04 , G10L21/028 , G10L15/20 , G10L21/0208 , G10L15/22
Abstract: Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.
-
公开(公告)号:US20200279552A1
公开(公告)日:2020-09-03
申请号:US16813194
申请日:2020-03-09
Applicant: Amazon Technologies, Inc.
Inventor: Kurt Wesley Piersol , Gabriel Beddingfield
Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.
-
公开(公告)号:US09792901B1
公开(公告)日:2017-10-17
申请号:US14567416
申请日:2014-12-11
Applicant: Amazon Technologies, Inc.
Inventor: Shirin Saleem , Aimee Therese Piercy , Marcello Typrin , Shamitha Somashekar , Kurt Wesley Piersol
IPC: G10L15/22 , B60R16/037 , G06F3/16 , G10L15/08 , G10L17/22
CPC classification number: G10L15/22 , B60R16/0373 , G06F3/167 , G10L2015/223
Abstract: A speech system may be configured to operate in conjunction with a stationary base device and a handheld remote device to receive voice commands from a user. A user may direct speech either to the base device or to the handheld device. In order to direct speech to the base device, the user first speaks a keyword. In order to direct speech to the handheld device, the user presses a talk control on the handheld device. A dialog may be conducted with the user in multiple turns, where each turn comprises user speech and a speech response by the speech system. The user speech in any given dialog turn may be provided from the base device and/or the handheld device.
-
公开(公告)号:US09774998B1
公开(公告)日:2017-09-26
申请号:US15285446
申请日:2016-10-04
Applicant: Amazon Technologies, Inc.
Inventor: Olusanya Temitope Soyannwo , Tina Yung-Ting Chen , Edward Dietz Crump , Kurt Wesley Piersol , Kavitha Velusamy
IPC: G06F15/173 , H04W4/02 , H04L29/08 , G10L17/22 , G01S3/802
CPC classification number: H04W4/023 , G01S3/802 , G01S11/14 , G06F9/4451 , G10L17/22 , H04L63/0861 , H04L67/141 , H04L67/18 , H04L67/24 , H04L67/306 , H04W4/026 , H04W4/33
Abstract: A computing system with multiple devices local to an environment facilitates active transfer among the multiple devices as a user moves about the environment. The devices may sense a presence or non-presence of the user and attempt to coordinate transfer to a device proximal to the user. In another implementation, the devices may communicate with a remote system that monitors a location of the user within the environment and causes content associated with the user to transfer between computing devices of the system based on the location and movement of the user.
-
公开(公告)号:US09552816B2
公开(公告)日:2017-01-24
申请号:US14578056
申请日:2014-12-19
Applicant: Amazon Technologies, Inc.
Inventor: Peter Spalding VanLund , Kurt Wesley Piersol , James David Meyers , Jacob Michael Simpson , Vikram Kumar Gundeti , David Robert Thomas , Andrew Christopher Miles
CPC classification number: G10L17/22 , G06F9/5011 , G10L15/22 , G10L2015/223 , G10L2015/228
Abstract: A speech-based system includes an audio device in a user premises and a network-based service that supports use of the audio device by multiple applications. The audio device may be directed to play audio content such as music, audio books, etc. The audio device may also be directed to interact with a user through speech. The network-based service monitors event messages received from the audio device to determine which of the multiple applications currently has speech focus. When receiving speech from a user, the service first offers the corresponding meaning to the application, if any, that currently has primary speech focus. If there is no application that currently has primary speech focus, or if the application having primary speech focus is not able to respond to the meaning, the service then offers the user meaning to the application that currently has secondary speech focus.
Abstract translation: 基于语音的系统包括用户场所中的音频设备和支持通过多个应用使用该音频设备的基于网络的服务。 音频设备可以被引导以播放诸如音乐,音频书籍等的音频内容。音频设备还可以被引导以通过语音与用户交互。 基于网络的服务监视从音频设备接收的事件消息,以确定当前具有语音焦点的多个应用中的哪一个。 当从用户接收到语音时,服务首先向当前具有主要语音焦点的应用(如果有的话)提供相应的含义。 如果没有目前具有主要语音焦点的应用程序,或者如果具有主要语音焦点的应用程序不能响应意义,则该服务然后向当前具有辅助语音焦点的应用程序提供用户意义。
-
-
-
-
-
-
-
-
-