-
公开(公告)号:US11600271B2
公开(公告)日:2023-03-07
申请号:US16933931
申请日:2020-07-20
IPC分类号: G10L15/22 , G10L21/0208 , G10L15/08 , G10L21/0216
摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.
-
公开(公告)号:US10755728B1
公开(公告)日:2020-08-25
申请号:US15906949
申请日:2018-02-27
IPC分类号: G10L21/038 , G10L21/0272 , G10L21/0208
摘要: A system configured to improve noise cancellation by using portions of multiple reference signals instead of using a complete reference signal. The system divides a frequency spectrum into frequency bands and selects a single reference signal from a group of potential reference signals for every frequency band. For example, a first reference signal is selected for a first frequency band while a second reference signal is selected for a second frequency band. The system may generate a combined reference signal using portions of each of the selected reference signals, such as a portion of the first reference signal corresponding to the first frequency band and a portion of the second reference signal corresponding to the second frequency band. Additionally or alternatively, the system may perform noise cancellation using each of the selected reference signals and filter the outputs based on the corresponding frequency band to generate combined audio output data.
-
公开(公告)号:US20180130468A1
公开(公告)日:2018-05-10
申请号:US15652019
申请日:2017-07-17
IPC分类号: G10L15/22
摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.
-
公开(公告)号:US20170178662A1
公开(公告)日:2017-06-22
申请号:US14973274
申请日:2015-12-17
IPC分类号: G10L21/0216 , H04R5/04
CPC分类号: G10L21/0216 , G10L21/0208 , G10L2021/02082 , G10L2021/02166 , H04R3/005 , H04R5/04 , H04R2201/40 , H04R2203/12 , H04R2420/07
摘要: An echo cancellation system that performs audio beamforming to separate audio input into multiple directions and determines a target signal and a reference signal from the multiple directions. For example, the system may detect a strong signal associated with a speaker and select the strong signal as a reference signal, selecting another direction as a target signal. The system may determine a speech position and may select the speech position as a target signal and an opposite direction as a reference signal. The system may create pairwise combinations of opposite directions, with an individual direction being selected as a target signal and a reference signal. The system may select a fixed beamformer output for the target signal and an adaptive beamformer output for the reference signal, or vice versa. The system may remove the reference signal (e.g., audio output by the loudspeaker) to isolate speech included in the target signal.
-
公开(公告)号:US09661438B1
公开(公告)日:2017-05-23
申请号:US14669559
申请日:2015-03-26
发明人: Jun Yang , Philip Ryan Hilmes
IPC分类号: H04S7/00
CPC分类号: H04R3/007 , H04R2430/01 , H04R2499/15
摘要: A limiter for an audio system prevents loud audio signals that exceed a threshold from being output. Output of the audio signals are delayed. When a loud signal exceeds the threshold, the gain applied to the delayed signals is gradually reduced so that by the time the loud signal reaches the output, the gain is at a level that reduces the loud audio signal to be within the threshold. Thereafter the gain is gradually restored to normal over a longer period of time than the audio signals are delayed.
-
公开(公告)号:US12087318B1
公开(公告)日:2024-09-10
申请号:US17963652
申请日:2022-10-11
发明人: Michael Alan Pogue , Kavitha Velusamy , Preethi Parasseri Narayanan , Tony David , Philip Ryan Hilmes
CPC分类号: G10L25/00 , G06F3/167 , H04R3/005 , H04R27/00 , H04R2227/003 , H04R2227/005 , H04R2227/007
摘要: A distributed voice controlled system has a primary assistant and at least one secondary assistant. The primary assistant has a housing to hold one or more microphones, one or more speakers, and various computing components. The secondary assistant is similar in structure, but is void of speakers. The voice controlled assistants perform transactions and other functions primarily based on verbal interactions with a user. The assistants within the system are coordinated and synchronized to perform acoustic echo cancellation, selection of a best audio input from among the assistants, and distributed processing.
-
公开(公告)号:US11483646B1
公开(公告)日:2022-10-25
申请号:US15995994
申请日:2018-06-01
摘要: Techniques for improving beamforming using filter coefficient values corresponding to virtual microphones are described. A system may define “virtual” microphone positions and determine corresponding filter coefficient values. These filter coefficient values may be applied to input audio data captured by actual physical microphones, enabling the system to improve performance of beamforming and/or to reduce a number of physical microphones without degrading performance. Offline testing and simulations may be performed to identify the best combination of virtual microphones and/or filter coefficient values for a particular look-direction. For example, the simulations may identify that a first filter coefficient corresponding to a first virtual microphone and a first direction will be associated with a first physical microphone and the first direction. During run-time processing, a device may generate beamformed audio data for the first direction by applying the first filter coefficient to input audio data captured by the first physical microphone.
-
公开(公告)号:US10959018B1
公开(公告)日:2021-03-23
申请号:US16252066
申请日:2019-01-18
摘要: A system that performs wall detection, range estimation, and/or corner detection to determine a position of a device relative to acoustically reflective surfaces. The device generates output audio using a loudspeaker, generates microphone audio data using a microphone array, performs beamforming to generate directional audio data and then generates impulse response data for each of a plurality of directions. The device may detect a peak in the impulse response data and determine a distance and/or direction to a reflective surface based on the peak. Based on a number of reflected surfaces and/or direction(s) of the reflected surfaces detected by the device, the device may classify the different directions and estimate where it is in the room, such as whether the device is in a corner, along one wall, or in an open area. By knowing its position relative to the room surfaces, the device may improve sound equalization.
-
公开(公告)号:US20210005198A1
公开(公告)日:2021-01-07
申请号:US16933931
申请日:2020-07-20
IPC分类号: G10L15/22
摘要: A speech-based audio device may be configured to detect a user-uttered wake expression. For example, the audio device may generate a parameter indicating whether output audio is currently being produced by an audio speaker, whether the output audio contains speech, whether the output audio contains a predefined expression, loudness of the output audio, loudness of input audio, and/or an echo characteristic. Based on the parameter, the audio device may determine whether an occurrence of the predefined expression in the input audio is a result of an utterance of the predefined expression by a user.
-
公开(公告)号:US10777214B1
公开(公告)日:2020-09-15
申请号:US16456247
申请日:2019-06-28
摘要: A system that performs wall detection, range estimation, and/or corner detection to determine a position of a device relative to acoustically reflective surfaces. The device generates output audio using loudspeaker(s), generates microphone audio data using a microphone array, and generates impulse response data for each of the microphones. The device may generate the impulse response data using an acoustic echo cancellation (AEC) component or multi-channel AEC (MC-AEC). The device may detect a peak in the impulse response data and determine a distance to a reflective surface based on the peak. Based on a number of reflected surfaces detected by the device, the device may classify a position of the device within the room, such as whether the device is in a corner, along one wall, or in an open area. By knowing the position relative to the room surfaces, the device may improve sound equalization and other processing.
-
-
-
-
-
-
-
-
-