摘要:
In general, techniques are described for signaling channels for scalable coding of higher order ambisonic audio data. A device comprising a memory and a processor may be configured to perform the techniques. The memory may be configured to store the bitstream. The processor may be configured to obtain, from the bitstream, an indication of a number of channels specified in one or more layers in the bitstream, and obtain the channels specified in the one or more layers in the bitstream based on the indication of the number of channels.
摘要:
In general, techniques are described for coding of vectors decomposed from higher order ambisonic coefficients. A device comprising a processor and a memory may perform the techniques. The processor may be configured to obtain from a bitstream data indicative of a plurality of weight values that represent a vector that is included in a decomposed version of the plurality of HOA coefficients. Each of the weight values may correspond to a respective one of a plurality of weights in a weighted sum of code vectors that represents the vector and that includes a set of code vectors. The processor may further be configured to reconstruct the vector based on the weight values and the code vectors. The memory may be configured to store the reconstructed vector.
摘要:
In general, techniques are described for coding higher-order ambisonic coefficients during multiple transitions. A device comprising a processor and a memory coupled to the processor may be configured to perform the techniques. The processor may be configured to obtain a multi-transition indication of whether an ambient HOA coefficient is in transition during a same frame of the bitstream as a foreground audio signal is in transition. The processor may also be configured to obtain a vector that describes a spatial characteristic of a corresponding foreground audio signal based on the multi-transition indication, both the vector and the corresponding HOA audio signal decomposed from the HOA audio data. The memory may be configured to store the vector.
摘要:
A device obtains an object-based representation of an audio signal of an audio object. The audio signal corresponds to a time interval. Additionally, the device obtains a representation of a spatial vector for the audio object, wherein the spatial vector is defined in a Higher-Order Ambisonics (HOA) domain and is based on a first plurality of loudspeaker locations. The device generates, based on the audio signal of the audio object and the spatial vector, a plurality of audio signals. Each respective audio signal of the plurality of audio signals corresponds to a respective loudspeaker in a plurality of local loudspeakers at the second plurality of loudspeaker locations different from the first plurality of loudspeaker locations.
摘要:
A device comprising a memory and one or more processors may be configured extract, from the bitstream, a type of quantization mode. The one or more processors may also be configured to switch, based on the type of quantization mode, between non-predictive vector dequantization to reconstruct a first set of one or more weights used to approximate the multi-directional V-Vector in the higher order ambisonics domain, and predictive vector dequantization to reconstruct a second set of one or more weights used to approximate the multi-directional V-Vector in the higher order ambisonics domain. The memory may be configured to store the reconstructed first set of one or more weights used to approximate the multi-directional V-Vector in the higher order ambisonics domain, and the reconstructed second set of one or more weights used to approximate the multi-directional V-Vector in the higher order ambisonics domain.
摘要:
In general, techniques are described for closed loop quantization of HOA coefficients that provide a three-dimensional representation of the sound field. An audio encoding device may perform closed loop quantization of an audio object based at least in part on a result of performing quantization of directional information associated with the audio object. An audio decoding device may obtain an audio object that has been closed loop quantized based at least in part on a result of performing quantization of directional information associated with the audio object, and may dequantize the audio object.