Method and device for decoding a higher-order ambisonics (HOA) representation of an audio soundfield

    公开(公告)号:US11451920B2

    公开(公告)日:2022-09-20

    申请号:US17189067

    申请日:2021-03-01

    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (ψ)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (ψ)} was determined based on the spherical modelling grid and the HOA order N.

    METHODS, APPARATUS AND SYSTEMS FOR ENCODING AND DECODING OF MULTI-CHANNEL AMBISONICS AUDIO DATA

    公开(公告)号:US20220020382A1

    公开(公告)日:2022-01-20

    申请号:US17392210

    申请日:2021-08-02

    Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the audio data in an Ambisonics format into encoded multi-channel audio data is also provided.

    Method and device for decoding an audio soundfield representation

    公开(公告)号:US11217258B2

    公开(公告)日:2022-01-04

    申请号:US16852459

    申请日:2020-04-18

    Abstract: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics (HOA) uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function (W) using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix (Ξ) from the loudspeaker positions, calculating a pseudo-inverse mode matrix (Ξ+) and decoding the audio soundfield representation. The decoding is based on a decode matrix (D) that is obtained from the panning function (W) and the pseudo-inverse mode matrix (Ξ+).

    METHOD AND DEVICE FOR DECODING A HIGHER-ORDER AMBISONICS (HOA) REPRESENTATION OF AN AUDIO SOUNDFIELD

    公开(公告)号:US20210258708A1

    公开(公告)日:2021-08-19

    申请号:US17189067

    申请日:2021-03-01

    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (ψ)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (ψ)} was determined based on the spherical modelling grid and the HOA order N.

    Method and device for applying dynamic range compression to a higher order ambisonics signal

    公开(公告)号:US10893372B2

    公开(公告)日:2021-01-12

    申请号:US16857093

    申请日:2020-04-23

    Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.

    Method and apparatus for generating 3D audio content from two-channel stereo content

    公开(公告)号:US10827295B2

    公开(公告)日:2020-11-03

    申请号:US16560733

    申请日:2019-09-04

    Abstract: For generating 3D audio content from a two-channel stereo signal, the stereo signal (x(t)) is partitioned into overlapping sample blocks and is transformed into time-frequency domain. From the stereo signal directional and ambient signal components are separated, wherein the estimated directions of the directional components are changed by a predetermined factor, wherein, if changes are within a predetermined interval, they are combined in order to form a directional centre channel object signal. For the other directions an encoding to Higher Order Ambisonics HOA is performed. Additional ambient signal channels are generated by de-correlation and rating by gain factors, followed by encoding to HOA. The directional HOA signals and the ambient HOA signals are combined, and the combined HOA signal and the centre channel object signals are transformed to time domain.

    Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field

    公开(公告)号:US10609501B2

    公开(公告)日:2020-03-31

    申请号:US16276363

    申请日:2019-02-14

    Abstract: The invention improves HOA sound field representation compression and decompression. A decoder decodes compressed dominant directional signals and compressed residual component signals so as to provide decompressed dominant directional signals and decompressed time domain signals representing a residual HOA component in a spatial domain. A re-correlator re-correlates the decompressed time domain signals to obtain a corresponding reduced-order residual HOA component. A processor determines a decompressed residual HOA component based on the corresponding reduced-order residual HOA component, and determines predicted directional signals based on at least a parameter. The processor is further configured to determine an HOA sound field representation based on the decompressed dominant directional signals, the predicted directional signals, and the decompressed residual HOA component.

    METHODS, APPARATUS AND SYSTEMS FOR ENCODING AND DECODING OF MULTI-CHANNEL AUDIO DATA

    公开(公告)号:US20190259396A1

    公开(公告)日:2019-08-22

    申请号:US16403224

    申请日:2019-05-03

    Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the firstAmbisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data. A method for encoding multi-channel audio data that includes audio data in an Ambisonics format, wherein the encoding includes transforming the encoded multi-channel audio data into a second format encoded multi-channel audio data is also provided.

Patent Agency Ranking