ECHO CANCELLATION FOR KEYWORD SPOTTING
    11.
    发明申请

    公开(公告)号:US20180174598A1

    公开(公告)日:2018-06-21

    申请号:US15846049

    申请日:2017-12-18

    Applicant: GOOGLE LLC

    Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

    HIERARCHICAL DECORRELATION OF MULTICHANNEL AUDIO

    公开(公告)号:US20200176009A1

    公开(公告)日:2020-06-04

    申请号:US16780506

    申请日:2020-02-03

    Applicant: GOOGLE LLC

    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

    ECHO CANCELLATION FOR KEYWORD SPOTTING
    13.
    发明申请

    公开(公告)号:US20200152220A1

    公开(公告)日:2020-05-14

    申请号:US16598462

    申请日:2019-10-10

    Applicant: GOOGLE LLC

    Abstract: Techniques of performing linear acoustic echo cancellation performing a phase correction operation on the estimate of the echo signal based on a clock drift between a capture of an input microphone signal and a playout of a loudspeaker signal. Along these lines, the existence of the clock drift, i.e., a small difference in the sampling rates of the input microphone signal and the loudspeaker signal, can cause processing circuitry in a device configured to perform LAEC operations to generate a filter based on the magnitudes of the short-term Fourier transforms (STFTs) of the input microphone signal and the loudspeaker signal. Such a filter is real-valued and results in a positive estimate of the acoustic echo signal included in the input microphone signal. The phase of this estimate may then be aligned with the phase of the input microphone signal.

    Hierarchical decorrelation of multichannel audio

    公开(公告)号:US10553234B2

    公开(公告)日:2020-02-04

    申请号:US16197645

    申请日:2018-11-21

    Applicant: GOOGLE LLC

    Abstract: Provided are methods, systems, and apparatus for hierarchical decorrelation of multichannel audio. A hierarchical decorrelation algorithm is designed to adapt to possibly changing characteristics of an input signal, and also preserves the energy of the original signal. The algorithm is invertible in that the original signal can be retrieved if needed. Furthermore, the proposed algorithm decomposes the decorrelation process into multiple low-complexity steps. The contribution of these steps is generally in a decreasing order, and thus the complexity of the algorithm can be scaled.

    Detection and suppression of keyboard transient noise in audio streams with auxiliary keybed microphone

    公开(公告)号:US10755726B2

    公开(公告)日:2020-08-25

    申请号:US14591418

    申请日:2015-01-07

    Applicant: Google LLC

    Abstract: Provided are methods and systems for enhancing speech when corrupted by transient noise (e.g., keyboard typing noise). The methods and systems utilize a reference microphone input signal for the transient noise in a signal restoration process used for the voice part of the signal. A robust Bayesian statistical model is used to regress the voice microphone on the reference microphone, which allows for direct inference about the desired voice signal while marginalizing the unwanted power spectral values of the voice and transient noise. Also provided is a straightforward and efficient Expectation-maximization (EM) procedure for fast enhancement of the corrupted signal. The methods and systems are designed to operate easily in real-time on standard hardware, and have very low latency so that there is no irritating delay in speaker response.

    OBJECTIVE QUALITY METRICS FOR AMBISONIC SPATIAL AUDIO

    公开(公告)号:US20190341060A1

    公开(公告)日:2019-11-07

    申请号:US15973287

    申请日:2018-05-07

    Applicant: GOOGLE LLC

    Abstract: A computing device includes a processor and a memory. The processor is configured to generate spectrograms, for example, using short-time Fourier transform, for a plurality of channels of reference and test ambisonic signals. In some implementations, the test ambisonic signal may be generated by decoding an encoded version of the reference ambisonic signal. The processor is further configured to compare, for each of the plurality of channels of a reference ambisonic signal, at least a patch associated with a channel of the reference ambisonic signal with at least a corresponding patch of a corresponding channel of the test ambisonic signal and determine a localization accuracy of the test ambisonic signal based on the comparison. In some implementations, the comparing may be based on phaseograms of the reference and test ambisonic signals.

Patent Agency Ranking