Identifying and suppressing interfering audio content

    公开(公告)号:US10325591B1

    公开(公告)日:2019-06-18

    申请号:US14478923

    申请日:2014-09-05

    Abstract: A speech interface device may capture user speech for analysis by automatic speech recognition (ASR) and natural language understanding (NLU) components. However, an audio signal representing the user speech may also contain interfering sound generated by a media player that is playing audio content such as music. Before performing ASR and NLU, a system attempts to identify the content being played by the media player, such as by querying the media player or by analyzing the audio signal. The system then obtains the same content from an available source and subtracts the audio represented by the content from the audio signal.

    Pre-wakeword speech processing
    12.
    发明授权

    公开(公告)号:US10192546B1

    公开(公告)日:2019-01-29

    申请号:US14672277

    申请日:2015-03-30

    Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.

    Pre-wakeword speech processing
    14.
    发明授权

    公开(公告)号:US11710478B2

    公开(公告)日:2023-07-25

    申请号:US17232609

    申请日:2021-04-16

    CPC classification number: G10L15/08 G10L17/22 G10L25/87

    Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.

    METHODS AND DEVICES FOR SELECTIVELY IGNORING CAPTURED AUDIO DATA

    公开(公告)号:US20210210071A1

    公开(公告)日:2021-07-08

    申请号:US17146995

    申请日:2021-01-12

    Abstract: Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.

    Methods and devices for selectively ignoring captured audio data

    公开(公告)号:US10930266B2

    公开(公告)日:2021-02-23

    申请号:US16665461

    申请日:2019-10-28

    Abstract: Systems and methods for selectively ignoring an occurrence of a wakeword within audio input data is provided herein. In some embodiments, a wakeword may be detected to have been uttered by an individual within a modified time window, which may account for hardware delays and echoing offsets. The detected wakeword that occurs during this modified time window may, in some embodiments, correspond to a word included within audio that is outputted by a voice activated electronic device. This may cause the voice activated electronic device to activate itself, stopping the audio from being outputted. By identifying when these occurrences of the wakeword within outputted audio are going to happen, the voice activated electronic device may selectively determine when to ignore the wakeword, and furthermore, when not to ignore the wakeword.

    PRE-WAKEWORD SPEECH PROCESSING
    17.
    发明申请

    公开(公告)号:US20200279552A1

    公开(公告)日:2020-09-03

    申请号:US16813194

    申请日:2020-03-09

    Abstract: A system for capturing and processing portions of a spoken utterance command that may occur before a wakeword. The system buffers incoming audio and indicates locations in the audio where the utterance changes, for example when a long pause is detected. When the system detects a wakeword within a particular utterance, the system determines the most recent utterance change location prior to the wakeword and sends the audio from that location to the end of the command utterance to a server for further speech processing.

    Application focus in speech-based systems
    20.
    发明授权
    Application focus in speech-based systems 有权
    基于语音的系统中的应用重点

    公开(公告)号:US09552816B2

    公开(公告)日:2017-01-24

    申请号:US14578056

    申请日:2014-12-19

    Abstract: A speech-based system includes an audio device in a user premises and a network-based service that supports use of the audio device by multiple applications. The audio device may be directed to play audio content such as music, audio books, etc. The audio device may also be directed to interact with a user through speech. The network-based service monitors event messages received from the audio device to determine which of the multiple applications currently has speech focus. When receiving speech from a user, the service first offers the corresponding meaning to the application, if any, that currently has primary speech focus. If there is no application that currently has primary speech focus, or if the application having primary speech focus is not able to respond to the meaning, the service then offers the user meaning to the application that currently has secondary speech focus.

    Abstract translation: 基于语音的系统包括用户场所中的音频设备和支持通过多个应用使用该音频设备的基于网络的服务。 音频设备可以被引导以播放诸如音乐,音频书籍等的音频内容。音频设备还可以被引导以通过语音与用户交互。 基于网络的服务监视从音频设备接收的事件消息,以确定当前具有语音焦点的多个应用中的哪一个。 当从用户接收到语音时,服务首先向当前具有主要语音焦点的应用(如果有的话)提供相应的含义。 如果没有目前具有主要语音焦点的应用程序,或者如果具有主要语音焦点的应用程序不能响应意义,则该服务然后向当前具有辅助语音焦点的应用程序提供用户意义。

Patent Agency Ranking