-
公开(公告)号:US11837248B2
公开(公告)日:2023-12-05
申请号:US17786138
申请日:2020-12-11
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicholas Luke Appleton , Jenean Jiaying Lee
IPC: H04B3/20 , G10L21/0208
CPC classification number: G10L21/0208 , G10L2021/02082
Abstract: In some embodiments, an echo cancellation method which includes adaptation of at least one prediction filter, with adaptation step size controlled using gradient descent on a set of filter coefficients of the filter, where control of the adaptation step size is based at least in part on a direction of adaptation and a predictability of a gradient of adaptation (e.g., a gradient vector). Other aspects of embodiments of the invention include systems, methods, and computer program products for controlling adaptation step size of adaptive (e.g., low-complexity adaptive) echo cancellation. In some embodiments, adaptation step size control is based on a normalized, scaled gradient of adaptation, or includes smoothing of a normalized gradient of adaptation.
-
公开(公告)号:US12080317B2
公开(公告)日:2024-09-03
申请号:US17639317
申请日:2020-08-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Hadis Nosrati , Glenn N. Dickins , Nicholas Luke Appleton
IPC: G10L15/20 , G10L21/02 , G10L21/0208 , G10L21/0316
CPC classification number: G10L21/0316 , G10L15/20 , G10L21/0208 , G10L2021/02082
Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
-
公开(公告)号:US09979369B2
公开(公告)日:2018-05-22
申请号:US15788443
申请日:2017-10-19
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicholas Luke Appleton , Christopher Mark Tubb , William Thomas Rowley , Imankalyan Mukherjee
CPC classification number: H03G11/04 , H03G3/20 , H03G7/002 , H03G7/007 , H03G11/008
Abstract: An audio peak limiter apparatus which calculates a smoothed sequence of gains for application to a sequence of blocks of samples of an audio signal. The apparatus sometimes calculates a candidate gain, to replace a too-large smoothed gain, such that applying the candidate gain would produce no scaled sample whose magnitude exceeds a predetermined limit. The apparatus sometimes calculates and stores a final gain to replace the too-large smoothed gain (where the final gain can be obtained e.g. by dividing the to-be-replaced smoothed gain by a prior smoothed gain and multiplying the result by the corresponding prior final gain), if it is determined that the candidate gain is not less than the immediately previous final gain.
-
公开(公告)号:US20250061914A1
公开(公告)日:2025-02-20
申请号:US18820282
申请日:2024-08-30
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Hadis Nosrati , Glenn N. Dickins , Nicholas Luke Appleton
IPC: G10L21/0316 , G10L15/20 , G10L21/0208
Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
-
公开(公告)号:US11184706B2
公开(公告)日:2021-11-23
申请号:US17055985
申请日:2019-05-14
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Brian George Arnott , Nicholas Luke Appleton , Juan Felix Torres , William Thomas Rowley , Ho Young Sung , Michael J. Smithers
Abstract: An apparatus and method of excursion protection of a loudspeaker. The method includes attenuating selected bands in a transform domain, controlled by a feedback signal resulting from an excursion transfer function that has been modified according to the real-time operational characteristics of the loudspeaker. In this manner, the system reduces the amount of wideband attenuation needed to address the predicted excursion, resulting in a better listening experience.
-
公开(公告)号:US10659880B2
公开(公告)日:2020-05-19
申请号:US16191123
申请日:2018-11-14
Inventor: Dirk Jeroen Breebaart , Mark David de Burgh , Nicholas Luke Appleton , Heiko Purnhagen , Mark William Gerrard , David Matthew Cooper
IPC: H04R5/02 , H04R3/14 , H04R5/04 , H04S1/00 , H04M1/03 , H04M1/60 , H04R3/04 , H04S3/00 , H04S7/00
Abstract: A method of processing audio data for replay on a mobile device with a first speaker and a second speaker, wherein the audio data comprises a respective audio signal for each of the first and second speakers, includes: determining a device orientation of the mobile device; if the determined device orientation is vertical orientation, applying a first processing mode to the audio signals for the first and second speakers; and if the determined device orientation is horizontal orientation, applying a second processing mode to the audio signals for the first and second speakers. Applying the first processing mode involves: determining respective mono audio signals in at least two frequency bands based on the audio signals for the first and second speakers; in a first one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to one of the first and second speakers; and in a second one of the at least two frequency bands, routing a larger portion of the respective mono audio signal to the other one of the first and second speakers. Applying the second processing mode involves applying cross-talk cancellation to the audio signals for the first and second speakers.
-
公开(公告)号:US10586553B2
公开(公告)日:2020-03-10
申请号:US15747735
申请日:2016-09-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Nicholas L. Engel , Nicholas Luke Appleton , Alan J. Seefeldt
Abstract: In an apparatus configured to perform signal processing on audio data of a first sampling rate, methods disclosed herein comprise receiving audio data of a second sampling rate, the second sampling rate being higher than the first sampling rate. The methods comprise applying filtering to the audio data of the second sampling rate to thereby produce first filtered audio data and second filtered audio data, the first filtered audio data comprising mainly component frequencies which are audible to the human ear, the second filtered audio data comprising mainly components frequencies which are substantially inaudible to the human ear. The methods further comprise applying first signal processing to the first filtered audio data; and applying second signal processing to the second filtered audio data, the second signal processing having a lower computational complexity than the first signal processing. Corresponding apparatus and computer readable media are also disclosed herein.
-
公开(公告)号:US12205608B2
公开(公告)日:2025-01-21
申请号:US17905860
申请日:2021-03-17
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicholas Luke Appleton , Jenean Jiaying Lee
IPC: G10L21/02 , G10L21/0232 , G10L21/0264 , G10L21/0208 , G10L21/0316
Abstract: Systems, methods, and computer program products for echo cancellation with prediction filter adaptation and detection of wideband offset between a reference signal (available to the echo canceller) and an output signal (unavailable to the echo canceller), where the output signal has been generated by applying at least one level shift to the reference signal, e.g. such that the level shift is unknown to the echo canceller.
-
公开(公告)号:US20220319532A1
公开(公告)日:2022-10-06
申请号:US17639317
申请日:2020-08-27
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Hadis Nosrati , Glenn N. Dickins , Nicholas Luke Appleton
IPC: G10L21/0316 , G10L15/20 , G10L21/0208
Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
-
公开(公告)号:US11437054B2
公开(公告)日:2022-09-06
申请号:US17022423
申请日:2020-09-16
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Nicholas Luke Appleton , Shanush Prema Thasarathan
IPC: G10L21/0224 , G10L19/008 , G10L21/0216
Abstract: Systems, methods, and computer program products for frequency-domain estimation of latency between audio signals. In some embodiments, the estimation is performed on first blocks of data indicative of samples of a first audio signal and second blocks of data indicative of samples of a second audio signal, and includes determining a coarse latency estimate, including by determining gains which, when applied to some of the second blocks, determine estimates of one of the first blocks, and identifying one of the estimates as having a best spectral match to said one of the first blocks. A refined latency estimate is determined from the coarse estimate and some of the gains. Optionally, at least one metric indicative of confidence in the refined latency estimate is generated. Audio processing (e.g., echo cancellation) may be performed on the frequency-domain data, including by performing time alignment based on the refined latency estimate.
-
-
-
-
-
-
-
-
-