-
公开(公告)号:US10811032B2
公开(公告)日:2020-10-20
申请号:US16225023
申请日:2018-12-19
Inventor: Ghassan Maalouli , Seth Suppappola
Abstract: A method and apparatus to determine a direction of arrival (DOA) of a talker in the presence of a source of spatially-coherent noise. A time sequence of audio samples that include the spatially-coherent noise is received and buffered. Aided by previously known data, a trigger point is detected in the time sequence of audio samples when the talker begins to talk. The buffered time sequence of audio samples is separated into a noise segment and a signal-plus-noise segment based on the trigger point. For each direction of a plurality of distinct directions: an energy difference is computed for the direction between the noise segment and the signal-plus-noise segment, and the DOA of the talker is selected as the direction of the plurality of distinct directions having a largest of the computed energy differences.
-
公开(公告)号:US20190096429A1
公开(公告)日:2019-03-28
申请号:US15714190
申请日:2017-09-25
Inventor: Narayan Kovvali , Seth Suppappola
IPC: G10L25/84 , H04R1/40 , G10L21/0208 , H04R3/00
Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.
-
公开(公告)号:US20190043509A1
公开(公告)日:2019-02-07
申请号:US15669607
申请日:2017-08-04
Inventor: Seth Suppappola
Abstract: A method and apparatus for audio privacy may be based on user identification. An audio signal containing speech may be analyzed, identifying a user to which the speech belongs and determining a user class for the user. The speech may be uploaded to a remote device based on whether the user class for the user is a public user class or a private user class. This allows certain users to opt-out of having their speech uploaded through public networks. The user identification may be based on voice biometrics.
-
公开(公告)号:US10580411B2
公开(公告)日:2020-03-03
申请号:US15714296
申请日:2017-09-25
Inventor: Ying Li , Ghassan Maalouli , Narayan Kovvali , Seth Suppappola
IPC: G10L21/028 , G10L17/00 , G10L15/22 , G10L17/06 , H04R1/40 , H04R3/00 , G10L25/21 , G01S5/18 , G06F3/16
Abstract: A change in the phase pattern of the inter-mic impulse response (IMIR), determined by a cross power spectral density, may be used to detect the appearance of a new talker or a dramatic movement of the current talker. For example, the phase of the IMIR is dependent on a location of the sound source relative to the microphone array. Any signal originating from a specific location has a specific phase pattern on the IMIR across the frequency domain. By comparing phase patterns of the current cross power spectral density with a recorded talker phase profile, a talker change can be detected. This detection can be used to control signal processing algorithms. For example, when talker change is detected, the step size of an adaptive filter can be increased to track the changes efficiently.
-
公开(公告)号:US11189303B2
公开(公告)日:2021-11-30
申请号:US15714190
申请日:2017-09-25
Inventor: Narayan Kovvali , Seth Suppappola
IPC: G10L25/84 , G01S3/808 , G01S3/80 , H04R1/40 , H04R3/00 , G10L21/0208 , G10L21/0216 , G10L21/0224 , G10L21/0232
Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.
-
公开(公告)号:US10733276B2
公开(公告)日:2020-08-04
申请号:US15836677
申请日:2017-12-08
Inventor: Narayan Kovvali , Ying Li , Nima Yousefian Jazi , Seth Suppappola
Abstract: The reliable differentiation of human and artificial talkers is important for many automatic speaker verification applications, such as in developing anti-spoofing countermeasures against replay attacks for voice biometric authentication. A multi-microphone approach may exploit small movements of human talkers to differentiate between a human talker and an artificial talker. One method of determining the presence or absence of talker movement includes monitoring the variation of the inter-mic frequency-dependent phase profile of the received microphone array data over a period of time. Using spatial information with spectral-based techniques for determining whether an audio source is a human or artificial talker may reduce the likelihood of success of spoofing attacks against a voice biometric authentication system. The anti-spoofing countermeasure may be used in electronic devices including smart home devices, cellular phones, tablets, and personal computers.
-
公开(公告)号:US10142730B1
公开(公告)日:2018-11-27
申请号:US15714262
申请日:2017-09-25
Inventor: Nima Yousefian , Seth Suppappola
Abstract: Noise sources may be identified as either an interference source, such as a television, or a talker source by analyzing phase information of the microphone signals. A phase delay variance may be computed from pairs of microphone signals. A profile of an interference source may be learned over time by updating a stored profile when the phase delay variance is below a threshold. The stored profile may be used to identify interference sources received by the microphones by determining a correlation between the microphone signals and the stored profile. When an interference source is detected, control parameters may be generated to control a beamformer to reduce contribution of the interference source to an output audio signal. The output audio signal may be used for speech processing, such as in a smart home device.
-
公开(公告)号:US11533559B2
公开(公告)日:2022-12-20
申请号:US16684190
申请日:2019-11-14
Inventor: Narayan Kovvali , Ghassan Maalouli , Seth Suppappola
IPC: H04R3/00 , G01S3/80 , G10L21/0264 , G10L21/0208 , G10L21/0216
Abstract: An estimator of direction of arrival (DOA) of speech from a far-field talker to a device in the presence of room reverberation and directional noise includes audio inputs received from multiple microphones and one or more beamformer outputs generated by processing the microphone inputs. A first DOA estimate is obtained by performing generalized cross-correlation between two or more of the microphone inputs. A second DOA estimate is obtained by performing generalized cross-correlation between one of the one or more beamformer outputs and one or more of: the microphone inputs and other of the one or more beamformer outputs. A selector selects the first or second DOA estimate based on an SNR estimate at the microphone inputs and a noise reduction amount estimate at the beamformer outputs. The SNR and noise reduction estimates may be obtained based on the detection of a keyword spoken by a desired talker.
-
公开(公告)号:US11315543B2
公开(公告)日:2022-04-26
申请号:US16773259
申请日:2020-01-27
Inventor: Khosrow Lashkari , Narayan Kovvali , Seth Suppappola
IPC: G10K11/178 , H04R3/00 , H04S7/00
Abstract: A system performs pole-zero or IIR modeling and estimation of an inter-microphone transfer function between first and second microphones that output respective first and second microphone signals. The system includes a first adaptive FIR filter to which the first microphone signal is provided, a delay element that delays the second microphone signal by a predetermined delay amount, and a second adaptive FIR filter to which the delayed second microphone signal is provided. A first coefficient of the second adaptive FIR filter is constrained to a fixed non-zero value. The filters are jointly adapted to minimize an error signal that is a difference of the two filters outputs. The delay is small: approximately the acoustic propagation delay between the two microphones and is not determined by the environmental reverberation characteristics. The error signal may serve as a noise reference in a noise canceller, for implementing far-field beamforming with low delay.
-
公开(公告)号:US10827076B1
公开(公告)日:2020-11-03
申请号:US16815936
申请日:2020-03-11
Inventor: Ying Li , Venkat Anant , Wilbur Lawrence , Seth Suppappola
Abstract: An acoustic echo path change detector provides a monitoring process in an acoustic echo canceler that removes echo from a microphone signal using an adaptive echo path model that generates an echo estimate from a playback signal. The acoustic echo canceler removes the echo estimate from the microphone signal to provide an echo-canceled output signal. The path change detector receives the microphone signal, the echo estimate and the output signal and determines a rate of change of one or more statistical values dependent on the microphone signal, the echo estimate and the output signal. If the rate of change exceeds a threshold value, the echo path change detector generates an indication that causes a supervisory process to change adaptation of the adaptive echo path model to increase responsiveness to the change in the acoustic echo path, e.g., by increasing the step size.
-
-
-
-
-
-
-
-
-