Filter adaptation step size control for echo cancellation

    公开(公告)号:US11837248B2

    公开(公告)日:2023-12-05

    申请号:US17786138

    申请日:2020-12-11

    CPC classification number: G10L21/0208 G10L2021/02082

    Abstract: In some embodiments, an echo cancellation method which includes adaptation of at least one prediction filter, with adaptation step size controlled using gradient descent on a set of filter coefficients of the filter, where control of the adaptation step size is based at least in part on a direction of adaptation and a predictability of a gradient of adaptation (e.g., a gradient vector). Other aspects of embodiments of the invention include systems, methods, and computer program products for controlling adaptation step size of adaptive (e.g., low-complexity adaptive) echo cancellation. In some embodiments, adaptation step size control is based on a normalized, scaled gradient of adaptation, or includes smoothing of a normalized gradient of adaptation.

    Methods, apparatus and systems for asymmetric speaker processing

    公开(公告)号:US10659880B2

    公开(公告)日:2020-05-19

    申请号:US16191123

    申请日:2018-11-14

    Abstract: A method of processing audio data for replay on a mobile device with a first speaker and a second speaker, wherein the audio data comprises a respective audio signal for each of the first and second speakers, includes: determining a device orientation of the mobile device; if the determined device orientation is vertical orientation, applying a first processing mode to the audio signals for the first and second speakers; and if the determined device orientation is horizontal orientation, applying a second processing mode to the audio signals for the first and second speakers. Applying the first processing mode involves: determining respective mono audio signals in at least two frequency bands based on the audio signals for the first and second speakers; in a first one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to one of the first and second speakers; and in a second one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to the other one of the first and second speakers. Applying the second processing mode involves applying cross-talk cancellation to the audio signals for the first and second speakers.

    Processing high-definition audio data

    公开(公告)号:US10586553B2

    公开(公告)日:2020-03-10

    申请号:US15747735

    申请日:2016-09-21

    Abstract: In an apparatus configured to perform signal processing on audio data of a first sampling rate, methods disclosed herein comprise receiving audio data of a second sampling rate, the second sampling rate being higher than the first sampling rate. The methods comprise applying filtering to the audio data of the second sampling rate to thereby produce first filtered audio data and second filtered audio data, the first filtered audio data comprising mainly component frequencies which are audible to the human ear, the second filtered audio data comprising mainly components frequencies which are substantially inaudible to the human ear. The methods further comprise applying first signal processing to the first filtered audio data; and applying second signal processing to the second filtered audio data, the second signal processing having a lower computational complexity than the first signal processing. Corresponding apparatus and computer readable media are also disclosed herein.

    Sample-accurate delay identification in a frequency domain

    公开(公告)号:US11437054B2

    公开(公告)日:2022-09-06

    申请号:US17022423

    申请日:2020-09-16

    Abstract: Systems, methods, and computer program products for frequency-domain estimation of latency between audio signals. In some embodiments, the estimation is performed on first blocks of data indicative of samples of a first audio signal and second blocks of data indicative of samples of a second audio signal, and includes determining a coarse latency estimate, including by determining gains which, when applied to some of the second blocks, determine estimates of one of the first blocks, and identifying one of the estimates as having a best spectral match to said one of the first blocks. A refined latency estimate is determined from the coarse estimate and some of the gains. Optionally, at least one metric indicative of confidence in the refined latency estimate is generated. Audio processing (e.g., echo cancellation) may be performed on the frequency-domain data, including by performing time alignment based on the refined latency estimate.

Patent Agency Ranking