摘要:
The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the method suggested in the research controls rendering in a spatial cue domain in the process of decoding the multi-object or multi-channel audio signals.
摘要:
Provided is an apparatus and method for coding and decoding multi-object audio signals with various channels and providing backward compatibility with a conventional spatial audio coding (SAC) bitstream. The apparatus includes: an audio object coding unit for coding audio-object signals inputted to the coding apparatus based on a spatial cue and creating rendering information for the coded audio-object signals, where the rendering information provides a coding apparatus including spatial cue information for audio-object signals; channel information of the audio-object signals; and identification information of the audio-object signals, and used in coding and decoding of the audio signals.
摘要:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
摘要:
Provided is a method for controlling synchronizing downmix signals and MPEG surround side information signals by controlling a delay according to the kind of downmix audio signals in an MPEG surround decoder. When multi-channel audio signals are restored using an HE-AAC decoder and a low-power MPEG surround decoder and complex QMF signals outputted from the HE-AAC decoder are used as downmix signals, a delay unit compensates for a delay caused in a real-to-complex converter. Anther delay unit delays spatial parameters to compensate for a delay caused in QMF and Nyquist banks when time-domain downmix signals are used. Also, when multi-channel audio signals are restored using an HE-AAC decoder and a high-quality MPEG surround decoder and complex QMF signals outputted from the HE-AAC decoder are used as downmix signals, a delay unit compensates for a delay caused in a real-to-complex converter.
摘要:
Provided are an apparatus and method for coding and decoding a multi-object audio signal. The apparatus includes a down-mixer for down-mixing the audio signals into one down-mixed audio signal and extracting supplementary information including header information and spatial cue information for each of the audio signals, a coder for coding the down-mixed audio signal, and a supplementary information coder for generating the supplementary information as a bit stream. The header information includes identification information for each of the audio signals and channel information for the audio signals.
摘要:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
摘要:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
摘要:
The present research relates to controlling rendering of multi-object or multi-channel audio signals. The present research provides a method and apparatus for controlling rendering of multi-object or multi-channel audio signals based on spatial cues in a process of decoding the multi-object or multi-channel audio signals. To achieve the purpose, the method suggested in the research controls rendering in a spatial cue domain in the process of decoding the multi-object or multi-channel audio signals.
摘要:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
摘要:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.