-
61.
公开(公告)号:US11211078B2
公开(公告)日:2021-12-28
申请号:US16925334
申请日:2020-07-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: H04R5/00 , G10L19/20 , G10L19/008 , H04S3/00
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
-
62.
公开(公告)号:US20210027795A1
公开(公告)日:2021-01-28
申请号:US16925334
申请日:2020-07-10
Applicant: Dolby Laboratories Licensing Corporation
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: G10L19/20 , G10L19/008 , H04S3/00
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
-
63.
公开(公告)号:US10516958B2
公开(公告)日:2019-12-24
申请号:US16210957
申请日:2018-12-05
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Sven Kordon , Alexander Krueger
IPC: H04S3/02 , H04S5/02 , H04S3/00 , G10L19/008
Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (βe) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to βe=┌log2(┌log2(√{square root over (KMAX)}·O)┐+1)┐.
-
公开(公告)号:US10515645B2
公开(公告)日:2019-12-24
申请号:US16457501
申请日:2019-06-28
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Florian Keiler , Sven Kordon , Alexander Krueger
IPC: G10L19/008 , H04S3/00 , H04S3/02
Abstract: The present invention relates to methods and apparatus for encoding an HOA signal representation (c(t)) of a sound field having an order of N and a number O=(N+1)2 of coefficient sequences to a mezzanine HOA signal representation (wMEZZ(t)) is generated that consists of an arbitrary number I of virtual loudspeaker signals wMEZZ,1(t), wMEZZ,2(t), . . . , wMEZZ,I(t). The present invention further relates to methods and apparatus for decoding a reconstructed HOA signal representation from the mezzanine HOA signal representation.
-
65.
公开(公告)号:US20190297443A1
公开(公告)日:2019-09-26
申请号:US16379091
申请日:2019-04-09
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Sven Kordon , Alexander Krueger
IPC: H04S3/00 , G10L19/008
Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.
-
66.
公开(公告)号:US10390164B2
公开(公告)日:2019-08-20
申请号:US15927985
申请日:2018-03-21
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alexander Krüger , Sven Kordon , Johannes Boehm , Johann-Markus Batke
Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
-
公开(公告)号:US10341802B2
公开(公告)日:2019-07-02
申请号:US15768695
申请日:2016-11-11
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alexander Krueger , Johannes Boehm , Sven Kordon , Xiaoming Chen , Stefan Abeling , Florian Keiler , Holger Kropp
Abstract: Currently there is no simple and satisfying way to create 3D audio from existing 2D content. The conversion from 2D to 3D sound should spatially redistribute the sound from existing channels. From a multi-channel 2D audio input signal (x(k)(t)) a 3D sound representation is generated which includes an HOA representation Formula (I) and channel object signals Formula (II) scaled from channels of the 2D audio input signal. Additional signals Formula (III) placed in the 3D space are generated by scaling (21, 222; 41, 422; Formula (IV)) channels from the 2D audio input signal and by decorrelating (24, 25; 44, 45, 451; Formula (V)) a scaled version of a mix of channels from the 2D audio input signal, whereby spatial positions for the additional signals are predetermined. The additional signals Formula (III) are converted (27; 47) to a HOA representation Formula (I).
-
公开(公告)号:US10334382B2
公开(公告)日:2019-06-25
申请号:US15891606
申请日:2018-02-08
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Sven Kordon , Alexander Krueger , Oliver Wuebbolt
IPC: H04S3/00 , G10L19/008 , G10L19/24
Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k−1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k−1)). The ambient HOA component ({tilde over (C)}AMB(k−1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k−1)) in lower positions and second HOA coefficient sequences (cAMB,n(k−1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
-
公开(公告)号:US10262663B2
公开(公告)日:2019-04-16
申请号:US15509596
申请日:2015-09-25
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alexander Krueger , Sven Kordon , Florian Keiler
IPC: H04S3/02 , G10L19/008
Abstract: The invention is suited for improving a low bit rate compressed and decompressed Higher Order Ambisonics HOA signal representation of a sound field, wherein the decompression provides a spatially sparse decoded HOA representation and a set of indices of coefficient sequences of this representation. From reconstructed signals of the original HOA representation a number of modified phase spectra signals are created using de-correlation filters, which modified phase spectra signals are uncorrelated with the signals of said original representation. The modified phase spectra signals are mixed with each other using predetermined mixing parameters, in order to provide a replicated ambient HOA component. Finally the spatially sparse decoded HOA representation is enhanced with the replicated time domain HOA representation.
-
公开(公告)号:US10038965B2
公开(公告)日:2018-07-31
申请号:US15435175
申请日:2017-02-16
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Alexander Krueger , Sven Kordon , Johannes Boehm
IPC: H04R5/00 , H04S7/00 , H04S3/00 , G10L19/008
CPC classification number: H04S7/302 , G10L19/008 , H04S3/008 , H04S2400/01 , H04S2420/11
Abstract: The invention improves HOA sound field representation compression. The HOA representation is analyzed for the presence of dominant sound sources and their directions are estimated. Then the HOA representation is decomposed into a number of dominant directional signals and a residual component. This residual component is transformed into the discrete spatial domain in order to obtain general plane wave functions at uniform sampling directions, which are predicted from the dominant directional signals. Finally, the prediction error is transformed back to the HOA domain and represents the residual ambient HOA component for which an order reduction is performed, followed by perceptual encoding of the dominant directional signals and the residual component.
-
-
-
-
-
-
-
-
-