PERSISTENT INTERFERENCE DETECTION
    2.
    发明申请

    公开(公告)号:US20190096429A1

    公开(公告)日:2019-03-28

    申请号:US15714190

    申请日:2017-09-25

    Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.

    AUDIO PRIVACY BASED ON USER IDENTIFICATION
    3.
    发明申请

    公开(公告)号:US20190043509A1

    公开(公告)日:2019-02-07

    申请号:US15669607

    申请日:2017-08-04

    Inventor: Seth Suppappola

    Abstract: A method and apparatus for audio privacy may be based on user identification. An audio signal containing speech may be analyzed, identifying a user to which the speech belongs and determining a user class for the user. The speech may be uploaded to a remote device based on whether the user class for the user is a public user class or a private user class. This allows certain users to opt-out of having their speech uploaded through public networks. The user identification may be based on voice biometrics.

    Talker change detection
    4.
    发明授权

    公开(公告)号:US10580411B2

    公开(公告)日:2020-03-03

    申请号:US15714296

    申请日:2017-09-25

    Abstract: A change in the phase pattern of the inter-mic impulse response (IMIR), determined by a cross power spectral density, may be used to detect the appearance of a new talker or a dramatic movement of the current talker. For example, the phase of the IMIR is dependent on a location of the sound source relative to the microphone array. Any signal originating from a specific location has a specific phase pattern on the IMIR across the frequency domain. By comparing phase patterns of the current cross power spectral density with a recorded talker phase profile, a talker change can be detected. This detection can be used to control signal processing algorithms. For example, when talker change is detected, the step size of an adaptive filter can be increased to track the changes efficiently.

    Persistent interference detection

    公开(公告)号:US11189303B2

    公开(公告)日:2021-11-30

    申请号:US15714190

    申请日:2017-09-25

    Abstract: A multi-microphone algorithm for detecting and differentiating interference sources from desired talker speech in advanced audio processing for smart home applications is described. The approach is based on characterizing a persistent interference source when sounds repeated occur from a fixed spatial location relative to the device, which is also fixed. Some examples of such interference sources include TV, music system, air-conditioner, washing machine, and dishwasher. Real human talkers, in contrast, are not expected to remain stationary and speak continuously from the same position for a long time. The persistency of an acoustic source is established based on identifying historically-recurring inter-microphone frequency-dependent phase profiles in multiple time periods of the audio data. The detection algorithm can be used with a beamforming processor to suppress the interference and for achieving voice quality and automatic speech recognition rate improvements in smart home applications.

    Multi-microphone human talker detection

    公开(公告)号:US10733276B2

    公开(公告)日:2020-08-04

    申请号:US15836677

    申请日:2017-12-08

    Abstract: The reliable differentiation of human and artificial talkers is important for many automatic speaker verification applications, such as in developing anti-spoofing countermeasures against replay attacks for voice biometric authentication. A multi-microphone approach may exploit small movements of human talkers to differentiate between a human talker and an artificial talker. One method of determining the presence or absence of talker movement includes monitoring the variation of the inter-mic frequency-dependent phase profile of the received microphone array data over a period of time. Using spatial information with spectral-based techniques for determining whether an audio source is a human or artificial talker may reduce the likelihood of success of spoofing attacks against a voice biometric authentication system. The anti-spoofing countermeasure may be used in electronic devices including smart home devices, cellular phones, tablets, and personal computers.

    Temporal and spatial detection of acoustic sources

    公开(公告)号:US10142730B1

    公开(公告)日:2018-11-27

    申请号:US15714262

    申请日:2017-09-25

    Abstract: Noise sources may be identified as either an interference source, such as a television, or a talker source by analyzing phase information of the microphone signals. A phase delay variance may be computed from pairs of microphone signals. A profile of an interference source may be learned over time by updating a stored profile when the phase delay variance is below a threshold. The stored profile may be used to identify interference sources received by the microphones by determining a correlation between the microphone signals and the stored profile. When an interference source is detected, control parameters may be generated to control a beamformer to reduce contribution of the interference source to an output audio signal. The output audio signal may be used for speech processing, such as in a smart home device.

    Beamformer enhanced direction of arrival estimation in a reverberant environment with directional noise

    公开(公告)号:US11533559B2

    公开(公告)日:2022-12-20

    申请号:US16684190

    申请日:2019-11-14

    Abstract: An estimator of direction of arrival (DOA) of speech from a far-field talker to a device in the presence of room reverberation and directional noise includes audio inputs received from multiple microphones and one or more beamformer outputs generated by processing the microphone inputs. A first DOA estimate is obtained by performing generalized cross-correlation between two or more of the microphone inputs. A second DOA estimate is obtained by performing generalized cross-correlation between one of the one or more beamformer outputs and one or more of: the microphone inputs and other of the one or more beamformer outputs. A selector selects the first or second DOA estimate based on an SNR estimate at the microphone inputs and a noise reduction amount estimate at the beamformer outputs. The SNR and noise reduction estimates may be obtained based on the detection of a keyword spoken by a desired talker.

    Pole-zero blocking matrix for low-delay far-field beamforming

    公开(公告)号:US11315543B2

    公开(公告)日:2022-04-26

    申请号:US16773259

    申请日:2020-01-27

    Abstract: A system performs pole-zero or IIR modeling and estimation of an inter-microphone transfer function between first and second microphones that output respective first and second microphone signals. The system includes a first adaptive FIR filter to which the first microphone signal is provided, a delay element that delays the second microphone signal by a predetermined delay amount, and a second adaptive FIR filter to which the delayed second microphone signal is provided. A first coefficient of the second adaptive FIR filter is constrained to a fixed non-zero value. The filters are jointly adapted to minimize an error signal that is a difference of the two filters outputs. The delay is small: approximately the acoustic propagation delay between the two microphones and is not determined by the environmental reverberation characteristics. The error signal may serve as a noise reference in a noise canceller, for implementing far-field beamforming with low delay.

    Echo path change monitoring in an acoustic echo canceler

    公开(公告)号:US10827076B1

    公开(公告)日:2020-11-03

    申请号:US16815936

    申请日:2020-03-11

    Abstract: An acoustic echo path change detector provides a monitoring process in an acoustic echo canceler that removes echo from a microphone signal using an adaptive echo path model that generates an echo estimate from a playback signal. The acoustic echo canceler removes the echo estimate from the microphone signal to provide an echo-canceled output signal. The path change detector receives the microphone signal, the echo estimate and the output signal and determines a rate of change of one or more statistical values dependent on the microphone signal, the echo estimate and the output signal. If the rate of change exceeds a threshold value, the echo path change detector generates an indication that causes a supervisory process to change adaptation of the adaptive echo path model to increase responsiveness to the change in the acoustic echo path, e.g., by increasing the step size.

Patent Agency Ranking