Methods and apparatus for compressing and decompressing a higher order ambisonics representation

    公开(公告)号:US10264382B2

    公开(公告)日:2019-04-16

    申请号:US15876442

    申请日:2018-01-22

    Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.

    Method and apparatus for compressing and decompressing a higher order ambisonics signal representation
    16.
    发明授权
    Method and apparatus for compressing and decompressing a higher order ambisonics signal representation 有权
    用于压缩和解压缩高阶态态信号表示的方法和装置

    公开(公告)号:US09454971B2

    公开(公告)日:2016-09-27

    申请号:US14400039

    申请日:2013-05-06

    Abstract: Higher Order Ambisonics (HOA) represents a complete sound field in the vicinity of a sweet spot, independent of loudspeaker set-up. The high spatial resolution requires a high number of HOA coefficients. In the invention, dominant sound directions are estimated and the HOA signal representation is decomposed into dominant directional signals in time domain and related direction information, and an ambient component in HOA domain, followed by compression of the ambient component by reducing its order. The reduced-order ambient component is transformed to the spatial domain, and is perceptually coded together with the directional signals. At receiver side, the encoded directional signals and the order-reduced encoded ambient component are perceptually decompressed, the perceptually decompressed ambient signals are transformed to an HOA domain representation of reduced order, followed by order extension. The total HOA representation is recomposed from the directional signals, the corresponding direction information, and the original-order ambient HOA component.

    Abstract translation: 高阶Ambisonics(HOA)代表一个完美的声场,位于甜蜜点附近,独立于扬声器设置。 高空间分辨率需要大量的HOA系数。 在本发明中,主要声音方向被估计,并且HOA信号表示被分解成时域中的主要方向信号和相关方向信息,以及HOA域中的环境分量,随后通过降低其顺序来压缩环境分量。 将低阶环境分量变换为空间域,并与定向信号一起被感知地编码。 在接收机侧,编码的定向信号和降序编码的环境分量被感知地解压缩,感知解压缩的环境信号被转换为降序的HOA域表示,随后是顺序扩展。 总的HOA表示由方向信号,相应的方向信息和原始阶环境HOA分量重组。

    Method and apparatus for compressing and decompressing a higher order ambisonics signal representation

    公开(公告)号:US12245012B2

    公开(公告)日:2025-03-04

    申请号:US18487280

    申请日:2023-10-16

    Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.

    Methods and apparatus for determining for decoding a compressed HOA sound representation

    公开(公告)号:US11875803B2

    公开(公告)日:2024-01-16

    申请号:US17733757

    申请日:2022-04-29

    CPC classification number: G10L19/008 H04S7/30 H04S2400/11 H04S2420/11

    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (βe) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to βe=┌log2(┌log2(√{square root over (KMAX)}·O)┐+1)┐.

    Methods and apparatus for decoding encoded HOA signals

    公开(公告)号:US11863958B2

    公开(公告)日:2024-01-02

    申请号:US18081956

    申请日:2022-12-15

    CPC classification number: H04S3/008 G10L19/008 H04S2420/11

    Abstract: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. An aspect of the invention further relates to methods and apparatus decoding multiplexed and perceptually encoded HOA signals, including transforming a vector of PCM encoded spatial domain signals of the HOA representation to a corresponding vector of coefficient domain signals by multiplying the vector of PCM encoded spatial domain signals with a transform matrix and de-normalizing the vector of PCM encoded and normalized coefficient domain signals, wherein said de-normalizing comprises. The methods may include combining a vector of coefficient domain signals and the vector of de-normalized coefficient domain signals to determine a combined vector of HOA coefficient domain signals that can have a variable number of HOA coefficients.

    Methods and apparatus for decoding a compressed HOA signal

    公开(公告)号:US11462222B2

    公开(公告)日:2022-10-04

    申请号:US16892154

    申请日:2020-06-03

    Abstract: Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components. For a frame k, the sequence of decoded HOA representations are represented at least in part by c ^ ~ n ⁡ ( k - 1 ) = ⁢ { c ^ A ⁢ M ⁢ B , n ⁡ ( k - 1 ) for ⁢ ⁢ n ⁢ ⁢ in ⁢ ⁢ the ⁢ ⁢ first ⁢ ⁢ subset ⁢ c ^ n ⁡ ( k - 1 ) = c ^ P ⁢ S , n ⁡ ( k - 1 ) + c ^ A ⁢ M ⁢ B , n ⁡ ( k - 1 ) ⁢ , for ⁢ ⁢ n ⁢ ⁢ in ⁢ ⁢ the ⁢ ⁢ second ⁢ ⁢ subset ⁢ where ĉAMB,n(k−1) corresponds to the corresponding ambient HOA components and ĉPS,n(k−1) corresponds to the corresponding predominant sound components.

Patent Agency Ranking