Detecting self-generated wake expressions

    公开(公告)号:US11600271B2

    公开(公告)日:2023-03-07

    申请号:US16933931

    申请日:2020-07-20

    摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.

    Multichannel noise cancellation using frequency domain spectrum masking

    公开(公告)号:US10755728B1

    公开(公告)日:2020-08-25

    申请号:US15906949

    申请日:2018-02-27

    摘要: A system configured to improve noise cancellation by using portions of multiple reference signals instead of using a complete reference signal. The system divides a frequency spectrum into frequency bands and selects a single reference signal from a group of potential reference signals for every frequency band. For example, a first reference signal is selected for a first frequency band while a second reference signal is selected for a second frequency band. The system may generate a combined reference signal using portions of each of the selected reference signals, such as a portion of the first reference signal corresponding to the first frequency band and a portion of the second reference signal corresponding to the second frequency band. Additionally or alternatively, the system may perform noise cancellation using each of the selected reference signals and filter the outputs based on the corresponding frequency band to generate combined audio output data.

    Detecting Self-Generated Wake Expressions
    3.
    发明申请

    公开(公告)号:US20180130468A1

    公开(公告)日:2018-05-10

    申请号:US15652019

    申请日:2017-07-17

    IPC分类号: G10L15/22

    摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.

    ADAPTIVE BEAMFORMING TO CREATE REFERENCE CHANNELS

    公开(公告)号:US20170178662A1

    公开(公告)日:2017-06-22

    申请号:US14973274

    申请日:2015-12-17

    IPC分类号: G10L21/0216 H04R5/04

    摘要: An echo cancellation system that performs audio beamforming to separate audio input into multiple directions and determines a target signal and a reference signal from the multiple directions. For example, the system may detect a strong signal associated with a speaker and select the strong signal as a reference signal, selecting another direction as a target signal. The system may determine a speech position and may select the speech position as a target signal and an opposite direction as a reference signal. The system may create pairwise combinations of opposite directions, with an individual direction being selected as a target signal and a reference signal. The system may select a fixed beamformer output for the target signal and an adaptive beamformer output for the reference signal, or vice versa. The system may remove the reference signal (e.g., audio output by the loudspeaker) to isolate speech included in the target signal.

    Low latency limiter
    5.
    发明授权

    公开(公告)号:US09661438B1

    公开(公告)日:2017-05-23

    申请号:US14669559

    申请日:2015-03-26

    IPC分类号: H04S7/00

    摘要: A limiter for an audio system prevents loud audio signals that exceed a threshold from being output. Output of the audio signals are delayed. When a loud signal exceeds the threshold, the gain applied to the delayed signals is gradually reduced so that by the time the loud signal reaches the output, the gain is at a level that reduces the loud audio signal to be within the threshold. Thereafter the gain is gradually restored to normal over a longer period of time than the audio signals are delayed.

    Beamforming using filter coefficients corresponding to virtual microphones

    公开(公告)号:US11483646B1

    公开(公告)日:2022-10-25

    申请号:US15995994

    申请日:2018-06-01

    IPC分类号: H04R3/00 H04R1/40

    摘要: Techniques for improving beamforming using filter coefficient values corresponding to virtual microphones are described. A system may define “virtual” microphone positions and determine corresponding filter coefficient values. These filter coefficient values may be applied to input audio data captured by actual physical microphones, enabling the system to improve performance of beamforming and/or to reduce a number of physical microphones without degrading performance. Offline testing and simulations may be performed to identify the best combination of virtual microphones and/or filter coefficient values for a particular look-direction. For example, the simulations may identify that a first filter coefficient corresponding to a first virtual microphone and a first direction will be associated with a first physical microphone and the first direction. During run-time processing, a device may generate beamformed audio data for the first direction by applying the first filter coefficient to input audio data captured by the first physical microphone.

    Method for autonomous loudspeaker room adaptation

    公开(公告)号:US10959018B1

    公开(公告)日:2021-03-23

    申请号:US16252066

    申请日:2019-01-18

    摘要: A system that performs wall detection, range estimation, and/or corner detection to determine a position of a device relative to acoustically reflective surfaces. The device generates output audio using a loudspeaker, generates microphone audio data using a microphone array, performs beamforming to generate directional audio data and then generates impulse response data for each of a plurality of directions. The device may detect a peak in the impulse response data and determine a distance and/or direction to a reflective surface based on the peak. Based on a number of reflected surfaces and/or direction(s) of the reflected surfaces detected by the device, the device may classify the different directions and estimate where it is in the room, such as whether the device is in a corner, along one wall, or in an open area. By knowing its position relative to the room surfaces, the device may improve sound equalization.

    Detecting Self-Generated Wake Expressions

    公开(公告)号:US20210005198A1

    公开(公告)日:2021-01-07

    申请号:US16933931

    申请日:2020-07-20

    IPC分类号: G10L15/22

    摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.

    Method for efficient autonomous loudspeaker room adaptation

    公开(公告)号:US10777214B1

    公开(公告)日:2020-09-15

    申请号:US16456247

    申请日:2019-06-28

    摘要: A system that performs wall detection, range estimation, and/or corner detection to determine a position of a device relative to acoustically reflective surfaces. The device generates output audio using loudspeaker(s), generates microphone audio data using a microphone array, and generates impulse response data for each of the microphones. The device may generate the impulse response data using an acoustic echo cancellation (AEC) component or multi-channel AEC (MC-AEC). The device may detect a peak in the impulse response data and determine a distance to a reflective surface based on the peak. Based on a number of reflected surfaces detected by the device, the device may classify a position of the device within the room, such as whether the device is in a corner, along one wall, or in an open area. By knowing the position relative to the room surfaces, the device may improve sound equalization and other processing.