-
公开(公告)号:US20250166637A1
公开(公告)日:2025-05-22
申请号:US19030555
申请日:2025-01-17
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Pontus CARLSSON , Kristofer KJOERLING
IPC: G10L19/002 , G10L19/008 , G10L19/18 , H04S3/02 , H04S5/00 , H04S5/02
Abstract: Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.
-
2.
公开(公告)号:US20250118314A1
公开(公告)日:2025-04-10
申请号:US18982303
申请日:2024-12-16
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Lars VILLEMOES , Heiko PURNHAGEN , Per EKSTRAND
IPC: G10L19/18 , G10L21/038
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US20240304196A1
公开(公告)日:2024-09-12
申请号:US18551134
申请日:2022-04-01
Inventor: Rishabh TYAGI , Heiko PURNHAGEN
IPC: G10L19/02 , G10L19/008
CPC classification number: G10L19/0204 , G10L19/008
Abstract: A method for multi-band ducking of audio signals is provided. In some implementations, the method involves receiving, at a decoder, an input audio signal, wherein the input audio signal is a downmixed audio signal. In some implementations, the method involves separating the input audio signal into a first set of frequency bands. In some implementations, the method involves determining a set of ducking gains, a ducking gain corresponding to a frequency band of the first set of frequency bands. In some implementations, the method involves generating a broadband decorrelated audio signal, wherein ducking gains of the set of ducking gains are applied to at least one of: 1) a second set of frequency bands prior to generating the at least one broadband decorrelated audio signal: or 2) a third set of frequency bands that separates the at least one broadband decorrelated audio signal.
-
公开(公告)号:US20240087590A1
公开(公告)日:2024-03-14
申请号:US18508415
申请日:2023-11-14
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Lars VILLEMOES , Heiko PURNHAGEN , Per EKSTRAND
IPC: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
CPC classification number: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/008
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
5.
公开(公告)号:US20230232176A1
公开(公告)日:2023-07-20
申请号:US18008431
申请日:2021-06-10
Inventor: Aaron Steven MASTER , Lie LU , Heiko PURNHAGEN
IPC: H04S7/00 , G10L21/0308 , H04S1/00 , G10L25/18
CPC classification number: H04S7/30 , G10L21/0308 , G10L25/18 , H04S1/007 , H04S2400/11
Abstract: A method comprises: obtaining softmask values for frequency bins of time-frequency tiles representing an audio signal; reducing, or expanding and limiting, the softmask values; and applying the reduced, or expanded and limited, softmask values to the frequency bins to create a time-frequency representation of an estimated target source. An alternative method comprises, for each time-frequency tile: obtaining softmask values; applying the softmask values to the frequency bins to create a time-frequency domain representation of an estimated target source; obtaining a panning parameter and a source concentration estimates for the target source; determining, using the panning parameter estimate and the softmask values, a magnitude for the time-frequency representation of the estimated target source; determining, using the panning parameter estimate and the source phase concentration estimate, a phase for the time-frequency representation of the estimated target source; and combining the magnitude and the phase.
-
6.
公开(公告)号:US20230049695A1
公开(公告)日:2023-02-16
申请号:US17973406
申请日:2022-10-25
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Lars VILLEMOES , Heiko PURNHAGEN , Per EKSTRAND
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
-
公开(公告)号:US20200302943A1
公开(公告)日:2020-09-24
申请号:US16842212
申请日:2020-04-07
Applicant: DOLBY INTERNATIONAL AB
Inventor: Lars VILLEMOES , Heidi-Maria LEHTONEN , Heiko PURNHAGEN , Toni HIRVONEN
IPC: G10L19/16 , G10L19/26 , G10L19/008 , H04S5/00
Abstract: An encoding system encodes an N-channel audio signal (X), wherein N≥3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system, a decorrelating section outputs, based on the downmix signal, an (N−1)-channel decorrelated signal (Z); a dry upmix section maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
-
公开(公告)号:US20200098381A1
公开(公告)日:2020-03-26
申请号:US16593830
申请日:2019-10-04
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Heiko PURNHAGEN , Harald MUNDT , Karl Jonas ROEDEN , Leif SEHLSTROM
Abstract: A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.
-
公开(公告)号:US20200066282A1
公开(公告)日:2020-02-27
申请号:US16673042
申请日:2019-11-04
Applicant: Dolby International AB
Inventor: Kristofer KJOERLING , Harald MUNDT , Heiko PURNHAGEN
IPC: G10L19/008
Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.
-
公开(公告)号:US20190320263A1
公开(公告)日:2019-10-17
申请号:US16454250
申请日:2019-06-27
Applicant: DOLBY INTERNATIONAL AB
Inventor: Heiko PURNHAGEN , Lars VILLEMOES , Jonas ENGDEGARD , Jonas ROEDEN , Kristofer KJOERLING
IPC: H04R5/00 , H04S5/00 , G10L19/16 , G10L19/008 , G10L19/032 , G10L19/26 , H04S3/02 , G10L19/02
Abstract: A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.
-
-
-
-
-
-
-
-
-