METHOD AND APPARATUS FOR GENERATING FROM A COEFFICIENT DOMAIN REPRESENTATION OF HOA SIGNALS A MIXED SPATIAL/ COEFFICIENT DOMAIN REPRESENTATION OF SAID HOA SIGNALS

    公开(公告)号:EP4456567A2

    公开(公告)日:2024-10-30

    申请号:EP24190333.5

    申请日:2014-06-24

    IPC分类号: H04S3/00

    摘要: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. A vector of coefficient domain signals is separated into a vector of coefficient domain signals having a constant number of HOA coefficients and a vector of coefficient domain signals having a variable number of HOA coefficients. The constant-number HOA coefficients vector is transformed to a corresponding spatial domain signal vector. In order to facilitate high-quality coding, without creating signal discontinuities the variable-number HOA coefficients vector of coefficient domain signals is adaptively normalised and multiplexed with the vector of spatial domain signals.

    LAYERED CODING AND DATA STRUCTURE FOR COMPRESSED HIGHER-ORDER AMBISONICS SOUND OR SOUND FIELD REPRESENTATIONS

    公开(公告)号:EP4411732A2

    公开(公告)日:2024-08-07

    申请号:EP24175983.6

    申请日:2016-10-07

    IPC分类号: G10L19/008

    CPC分类号: G10L19/008

    摘要: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream. The present document further relates to a method of decoding a frame of a compressed HOA representation of a sound or sound field, an encoder and a decoder for layered coding of a compressed HOA representation, and a data structure representing a frame of a compressed HOA representation of a sound or sound field.

    LAYERED CODING FOR COMPRESSED SOUND OR SOUND FIELD REPRESENTATIONS

    公开(公告)号:EP3992963A1

    公开(公告)日:2022-05-04

    申请号:EP21201640.6

    申请日:2016-10-07

    摘要: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation. The method comprises sub-dividing the plurality of components into a plurality of groups of components and assigning each of the plurality of groups to a respective one of a plurality of hierarchical layers, the number of groups corresponding to the number of layers, and the plurality of layers including a base layer and one or more hierarchical enhancement layers, adding the basic side information to the base layer, and determining a plurality of portions of enhancement side information from the enhancement side information and assigning each of the plurality of portions of enhancement side information to a respective one of the plurality of layers, wherein each portion of enhancement side information includes parameters for improving a reconstructed sound representation obtainable from data included in the respective layer and any layers lower than the respective layer. The document further relates to a method of decoding a compressed sound representation of a sound or sound field, wherein the compressed sound representation is encoded in a plurality of hierarchical layers that include a base layer and one or more hierarchical enhancement layers, as well as to an encoder and a decoder for layered coding of a compressed sound representation.

    VALUE RANGE FOR HOA AND MODIFICATION OF GAIN CONTROL FOR PARAMETERS FOR HOA COMPRESSION

    公开(公告)号:EP3860154A1

    公开(公告)日:2021-08-04

    申请号:EP21159478.3

    申请日:2015-06-22

    IPC分类号: H04S3/02 G10L19/008

    摘要: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number ( β e ) of bits the HOA data frame representation ( C (k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation ( C ( k )). Then the lowest integer number of bits is set to β e = ⌈ log 2 ⌈ log 2 K MAX ⋅ O ⌉ + 1 ⌉ .

    METHOD AND APPARATUS FOR GENERATING FROM AN HOA SIGNAL REPRESENTATION A MEZZANINE HOA SIGNAL REPRESENTATION

    公开(公告)号:EP3739578A1

    公开(公告)日:2020-11-18

    申请号:EP20179680.2

    申请日:2016-07-29

    IPC分类号: G10L19/008 H04S3/02

    摘要: From an HOA signal representation ( c ( t )) of a sound field having an order of N and a number 0 = ( N + 1) 2 of coefficient sequences a mezzanine HOA signal representation ( W MEZZ ( t )) is generated that consists of an arbitrary number I 0 of virtual loudspeaker signals W MEZZ,1 ( t ), W MEZZ,2 ( t ), ..., W MEZZ, I ( t ). 0 directions are computed which are nearly uniformly distributed on the unit sphere. The mode vectors with respect to these directions are linearly weighted for constructing a matrix, of which the pseudo-inverse is used for multiplying the HOA signal representation ( c ( t )) in order to form (11) the mezzanine HOA signal representation ( W MEZZ ( t )).

    METHOD AND APPARATUS FOR GENERATING FROM A COEFFICIENT DOMAIN REPRESENTATION OF HOA SIGNALS A MIXED SPATIAL/ COEFFICIENT DOMAIN REPRESENTATION OF SAID HOA SIGNALS

    公开(公告)号:EP3518235A1

    公开(公告)日:2019-07-31

    申请号:EP18205365.2

    申请日:2014-06-24

    IPC分类号: G10L19/008 H04S3/00

    摘要: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. A vector of coefficient domain signals is separated into a vector of coefficient domain signals having a constant number of HOA coefficients and a vector of coefficient domain signals having a variable number of HOA coefficients. The constant-number HOA coefficients vector is transformed to a corresponding spatial domain signal vector. In order to facilitate high-quality coding, without creating signal discontinuities the variable-number HOA coefficients vector of coefficient domain signals is adaptively normalised and multiplexed with the vector of spatial domain signals.

    LAYERED CODING FOR COMPRESSED SOUND OR SOUND FIELD REPRESENTATIONS

    公开(公告)号:EP3360133A1

    公开(公告)日:2018-08-15

    申请号:EP16778365.3

    申请日:2016-10-07

    IPC分类号: G10L19/008

    摘要: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation. The method comprises sub-dividing the plurality of components into a plurality of groups of components and assigning each of the plurality of groups to a respective one of a plurality of hierarchical layers, the number of groups corresponding to the number of layers, and the plurality of layers including a base layer and one or more hierarchical enhancement layers, adding the basic side information to the base layer, and determining a plurality of portions of enhancement side information from the enhancement side information and assigning each of the plurality of portions of enhancement side information to a respective one of the plurality of layers, wherein each portion of enhancement side information includes parameters for improving a reconstructed sound representation obtainable from data included in the respective layer and any layers lower than the respective layer. The document further relates to a method of decoding a compressed sound representation of a sound or sound field, wherein the compressed sound representation is encoded in a plurality of hierarchical layers that include a base layer and one or more hierarchical enhancement layers, as well as to an encoder and a decoder for layered coding of a compressed sound representation.