-
公开(公告)号:US20190096429A1
公开(公告)日:2019-03-28
申请号:US15714190
申请日:2017-09-25
Inventor: Narayan Kovvali , Seth Suppappola
IPC: G10L25/84 , H04R1/40 , G10L21/0208 , H04R3/00
Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.
-
公开(公告)号:US11189303B2
公开(公告)日:2021-11-30
申请号:US15714190
申请日:2017-09-25
Inventor: Narayan Kovvali , Seth Suppappola
IPC: G10L25/84 , G01S3/808 , G01S3/80 , H04R1/40 , H04R3/00 , G10L21/0208 , G10L21/0216 , G10L21/0224 , G10L21/0232
Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.
-
公开(公告)号:US10733276B2
公开(公告)日:2020-08-04
申请号:US15836677
申请日:2017-12-08
Inventor: Narayan Kovvali , Ying Li , Nima Yousefian Jazi , Seth Suppappola
Abstract: The reliable differentiation of human and artificial talkers is important for many automatic speaker verification applications, such as in developing anti-spoofing countermeasures against replay attacks for voice biometric authentication. A multi-microphone approach may exploit small movements of human talkers to differentiate between a human talker and an artificial talker. One method of determining the presence or absence of talker movement includes monitoring the variation of the inter-mic frequency-dependent phase profile of the received microphone array data over a period of time. Using spatial information with spectral-based techniques for determining whether an audio source is a human or artificial talker may reduce the likelihood of success of spoofing attacks against a voice biometric authentication system. The anti-spoofing countermeasure may be used in electronic devices including smart home devices, cellular phones, tablets, and personal computers.
-
公开(公告)号:US10580411B2
公开(公告)日:2020-03-03
申请号:US15714296
申请日:2017-09-25
Inventor: Ying Li , Ghassan Maalouli , Narayan Kovvali , Seth Suppappola
IPC: G10L21/028 , G10L17/00 , G10L15/22 , G10L17/06 , H04R1/40 , H04R3/00 , G10L25/21 , G01S5/18 , G06F3/16
Abstract: A change in the phase pattern of the inter-mic impulse response (IMIR), determined by a cross power spectral density, may be used to detect the appearance of a new talker or a dramatic movement of the current talker. For example, the phase of the IMIR is dependent on a location of the sound source relative to the microphone array. Any signal originating from a specific location has a specific phase pattern on the IMIR across the frequency domain. By comparing phase patterns of the current cross power spectral density with a recorded talker phase profile, a talker change can be detected. This detection can be used to control signal processing algorithms. For example, when talker change is detected, the step size of an adaptive filter can be increased to track the changes efficiently.
-
公开(公告)号:US10559317B2
公开(公告)日:2020-02-11
申请号:US16024618
申请日:2018-06-29
Inventor: Justin L. Allen , Narayan Kovvali
IPC: G10L21/0232 , H04R3/00 , H04R1/40 , G10L25/78 , G10L21/0208
Abstract: An apparatus includes a beamformer, an echo suppression control unit, and a residual echo cancellation unit. The beamformer is configured to pass desired portions of audio signals and to suppress undesired portions of the audio signals. The beamformer includes a speech blocking filter to prevent suppression of near-end desired talker speech in the audio signals and an echo suppression filter to suppress echo in the audio signals. An echo suppression control unit is coupled to the beamformer and receives signals and determines whether to dynamically adapt the speech blocking filter or to dynamically adapt the echo suppression filter. The speech blocking filter remains unchanged during dynamic adaptation of the echo suppression filter, and the echo suppression filter remains unchanged during dynamic adaptation of the speech blocking filter. The residual echo cancellation unit is coupled to the beamformer and receives output audio signals from the beamformer and further suppresses residual echo.
-
公开(公告)号:US11533559B2
公开(公告)日:2022-12-20
申请号:US16684190
申请日:2019-11-14
Inventor: Narayan Kovvali , Ghassan Maalouli , Seth Suppappola
IPC: H04R3/00 , G01S3/80 , G10L21/0264 , G10L21/0208 , G10L21/0216
Abstract: An estimator of direction of arrival (DOA) of speech from a far-field talker to a device in the presence of room reverberation and directional noise includes audio inputs received from multiple microphones and one or more beamformer outputs generated by processing the microphone inputs. A first DOA estimate is obtained by performing generalized cross-correlation between two or more of the microphone inputs. A second DOA estimate is obtained by performing generalized cross-correlation between one of the one or more beamformer outputs and one or more of: the microphone inputs and other of the one or more beamformer outputs. A selector selects the first or second DOA estimate based on an SNR estimate at the microphone inputs and a noise reduction amount estimate at the beamformer outputs. The SNR and noise reduction estimates may be obtained based on the detection of a keyword spoken by a desired talker.
-
公开(公告)号:US11315543B2
公开(公告)日:2022-04-26
申请号:US16773259
申请日:2020-01-27
Inventor: Khosrow Lashkari , Narayan Kovvali , Seth Suppappola
IPC: G10K11/178 , H04R3/00 , H04S7/00
Abstract: A system performs pole-zero or IIR modeling and estimation of an inter-microphone transfer function between first and second microphones that output respective first and second microphone signals. The system includes a first adaptive FIR filter to which the first microphone signal is provided, a delay element that delays the second microphone signal by a predetermined delay amount, and a second adaptive FIR filter to which the delayed second microphone signal is provided. A first coefficient of the second adaptive FIR filter is constrained to a fixed non-zero value. The filters are jointly adapted to minimize an error signal that is a difference of the two filters outputs. The delay is small: approximately the acoustic propagation delay between the two microphones and is not determined by the environmental reverberation characteristics. The error signal may serve as a noise reference in a noise canceller, for implementing far-field beamforming with low delay.
-
公开(公告)号:US20210233509A1
公开(公告)日:2021-07-29
申请号:US16773259
申请日:2020-01-27
Inventor: Khosrow Lashkari , Narayan Kovvali , Seth Suppappola
IPC: G10K11/178 , H04S7/00 , H04R3/00
Abstract: A system performs pole-zero or IIR modeling and estimation of an inter-microphone transfer function between first and second microphones that output respective first and second microphone signals. The system includes a first adaptive FIR filter to which the first microphone signal is provided, a delay element that delays the second microphone signal by a predetermined delay amount, and a second adaptive FIR filter to which the delayed second microphone signal is provided. A first coefficient of the second adaptive FIR filter is constrained to a fixed non-zero value. The filters are jointly adapted to minimize an error signal that is a difference of the two filters outputs. The delay is small: approximately the acoustic propagation delay between the two microphones and is not determined by the environmental reverberation characteristics. The error signal may serve as a noise reference in a noise canceller, for implementing far-field beamforming with low delay.
-
公开(公告)号:US20190180014A1
公开(公告)日:2019-06-13
申请号:US15836677
申请日:2017-12-08
Inventor: Narayan Kovvali , Ying Li , Nima Yousefian Jazi , Seth Suppappola
Abstract: The reliable differentiation of human and artificial talkers is important for many automatic speaker verification applications, such as in developing anti-spoofing countermeasures against replay attacks for voice biometric authentication. A multi-microphone approach may exploit small movements of human talkers to differentiate between a human talker and an artificial talker. One method of determining the presence or absence of talker movement includes monitoring the variation of the inter-mic frequency-dependent phase profile of the received microphone array data over a period of time. Using spatial information with spectral-based techniques for determining whether an audio source is a human or artificial talker may reduce the likelihood of success of spoofing attacks against a voice biometric authentication system. The anti-spoofing countermeasure may be used in electronic devices including smart home devices, cellular phones, tablets, and personal computers.
-
公开(公告)号:US20190096408A1
公开(公告)日:2019-03-28
申请号:US15714296
申请日:2017-09-25
Inventor: Ying Li , Ghassan Maalouli , Narayan Kovvali , Seth Suppappola
Abstract: A change in the phase pattern of the inter-mic impulse response (IMIR), determined by a cross power spectral density, may be used to detect the appearance of a new talker or a dramatic movement of the current talker. For example, the phase of the IMIR is dependent on a location of the sound source relative to the microphone array. Any signal originating from a specific location has a specific phase pattern on the IMIR across the frequency domain. By comparing phase patterns of the current cross power spectral density with a recorded talker phase profile, a talker change can be detected. This detection can be used to control signal processing algorithms. For example, when talker change is detected, the step size of an adaptive filter can be increased to track the changes efficiently.
-
-
-
-
-
-
-
-
-