-
31.
公开(公告)号:US11081117B2
公开(公告)日:2021-08-03
申请号:US16580738
申请日:2019-09-24
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Oliver Wuebbolt , Johannes Boehm , Peter Jax
IPC: G10L19/008 , H04S3/00 , H04R5/027 , G10L19/16
Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the audio data in an Ambisonics format into encoded multi-channel audio data is also provided.
-
32.
公开(公告)号:US20210027795A1
公开(公告)日:2021-01-28
申请号:US16925334
申请日:2020-07-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: G10L19/20 , G10L19/008 , H04S3/00
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
-
33.
公开(公告)号:US20200020344A1
公开(公告)日:2020-01-16
申请号:US16580738
申请日:2019-09-24
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Oliver Wuebbolt , Johannes Boehm , Peter Jax
IPC: G10L19/008 , H04S3/00
Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the audio data in an Ambisonics format into encoded multi-channel audio data is also provided.
-
公开(公告)号:US10516414B2
公开(公告)日:2019-12-24
申请号:US15952082
申请日:2018-04-12
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Oliver Wuebbolt
IPC: H04N11/02 , H03M7/40 , H04N19/124 , H04N19/182
Abstract: The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values.Said method comprises processing the preceding spectral coefficients, using the processed preceding spectral coefficients for determining a context class being one of at least two different context classes, using the determined context class and a mapping from the at least two different context classes to at least two different probability density functions for determining the probability density function, and arithmetic encoding the current spectral coefficient based on the determined probability density function wherein processing the preceding spectral coefficients comprises non-uniformly quantizing absolutes of the preceding spectral coefficients for use in determining of the context class.
-
公开(公告)号:US10334382B2
公开(公告)日:2019-06-25
申请号:US15891606
申请日:2018-02-08
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: H04S3/00 , G10L19/008 , G10L19/24
Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k−1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k−1)). The ambient HOA component ({tilde over (C)}AMB(k−1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k−1)) in lower positions and second HOA coefficient sequences (cAMB,n(k−1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
-
公开(公告)号:US09990934B2
公开(公告)日:2018-06-05
申请号:US15110354
申请日:2014-12-19
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Alexander Krueger , Sven Kordon , Oliver Wuebbolt
IPC: H04R5/00 , G10L19/20 , G10L19/008 , H04S3/00
CPC classification number: G10L19/20 , G10L19/008 , H04S3/008 , H04S2420/11
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
-
37.
公开(公告)号:US12205600B2
公开(公告)日:2025-01-21
申请号:US18489606
申请日:2023-10-18
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Oliver Wuebbolt , Peter Jax , Johannes Boehm
IPC: G10L19/008 , H04S3/00 , G10L19/16 , H04R5/027
Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the audio data in an Ambisonics format into encoded multi-channel audio data is also provided.
-
38.
公开(公告)号:US11798568B2
公开(公告)日:2023-10-24
申请号:US17392210
申请日:2021-08-02
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Oliver Wuebbolt , Johannes Boehm , Peter Jax
IPC: G10L19/008 , H04S3/00 , H04R5/027 , G10L19/16
CPC classification number: G10L19/008 , H04S3/008 , G10L19/167 , H04R5/027 , H04S2400/01 , H04S2400/03 , H04S2400/15 , H04S2420/03 , H04S2420/11
Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the audio data in an Ambisonics format into encoded multi-channel audio data is also provided.
-
公开(公告)号:US11770131B2
公开(公告)日:2023-09-26
申请号:US17854866
申请日:2022-06-30
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Oliver Wuebbolt
IPC: H04N11/02 , H03M7/40 , H04N19/124 , H04N19/182
CPC classification number: H03M7/4006 , H04N19/124 , H04N19/182
Abstract: The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values.
Said method comprises processing the preceding spectral coefficients, using the processed preceding spectral coefficients for determining a context class being one of at least two different context classes, using the determined context class and a mapping from the at least two different context classes to at least two different probability density functions for determining the probability density function, and arithmetic encoding the current spectral coefficient based on the determined probability density function wherein processing the preceding spectral coefficients comprises non-uniformly quantizing absolutes of the preceding spectral coefficients for use in determining of the context class.-
40.
公开(公告)号:US20220115027A1
公开(公告)日:2022-04-14
申请号:US17558550
申请日:2021-12-21
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: G10L19/20 , H04S3/00 , G10L19/008
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
-
-
-
-
-
-
-
-
-