摘要:
In the conventional art inventions for coding multi-channel audio signals, three of the major processes involved are: generation of a reverberation signal using an all-pass filter; segmentation of a signal in the time and frequency domains for the purpose of level adjustment; and mixing of a coded binaural signal with an original signal coded up to a fixed crossover frequency. These processes pose the problems mentioned in the present invention. The present invention proposes the following three embodiments: to control the extent of reverberations by dynamically adjusting all-pass filter coefficients with the inter-channel coherence cues; to segment a signal in the time domain finely in the lower frequency region and coarsely in the higher frequency region; and to control a crossover frequency used for mixing based on a bit rate, and if the original signal is coarsely quantized, to mix a downmix signal with an original signal in proportions determined by an inter-channel coherence cue.
摘要:
An audio encoder, which is capable of encoding multiple-channel signals so that only a downmixed signal is decoded and of further generating specific auxiliary information necessary for dividing the downmixed signal, is provided. An audio encoder (10), which compresses and encodes audio signals of N-channels (N>1), includes a downmixed signal encoding unit (11) which encodes the downmixed signal obtained by downmixing the audio signals, and an auxiliary information generation unit (12a) which generates auxiliary information necessary for decoding the downmixed signal encoded by the downmixed signal encoding unit (11) into N-channel audio signals. The auxiliary information generation unit (12a) includes transformation units (121) and (122) which transform audio signals respectively into frequency domain signals, a detection unit (123) which detects phase difference information and gain ratio information each indicates a degree of difference between frequency domain signals, and a quantization unit (125) which quantizes, for each frequency band, the phase difference information and gain ratio information detected by the detection unit (123) using the quantization precision setting table (124). The quantization precision setting table (124) functions as a division unit which divides a frequency band of a frequency domain signal into plural sub-bands.
摘要:
An audio signal encoding device includes a downmix signal encoding unit 203 and an auxiliary information generation unit 204. The downmix signal encoding unit 203 generates a downmix signal acquired by adding input signals each other using a predetermined method, encodes the downmix signal, and outputs downmix signal information 206. The auxiliary information generation unit 204 generates auxiliary information 205 using the downmix signal and the downmix signal information 206 generated by the downmix signal encoding unit 203. The auxiliary information generation unit 204 efficiently quantizes the auxiliary information 205 using human's characteristics of a perceptual direction of a sound source, a perceptual broadening, and a perceptual distance.