Abstract:
Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “.1” indicates a single low-frequency effects (LFE) channel and “.2” indicates two LFE channels.
Abstract:
An embodiment of an apparatus for computing control information for a suppression filter for filtering a second audio signal to suppress an echo based on a first audio signal includes a computer having a value determiner for determining at least one energy-related value for a band-pass signal of at least two temporally successive data blocks of at least one signal of a group of signals. The computer further includes a mean value determiner for determining at least one mean value of the at least one determined energy-related value for the band-pass signal. The computer further includes a modifier for modifying the at least one energy-related value for the band-pass signal on the basis of the determined mean value for the band-pass signal. The computer further includes a control information computer for computing the control information for the suppression filter on the basis of the at least one modified energy-related value.
Abstract:
The present invention includes an audio signal receiving unit receiving the audio signal having a plurality of channel signals including an ambient component signal and a source component signal; an ambient component signal extracting unit extracting the ambient component signal of each of the channels based on correlation between the channel signals; an ambient component signal modifying unit modifying the ambient component signal using surround effect information; a source component signal extracting unit extracting the source component signal of each of the channels based on the correlation between the channel signals; a first signal output unit outputting the modified ambient component signal and the source component signal; and a second signal output unit outputting the audio signal or the source component signal. Accordingly, in an apparatus for decoding an audio signal and method thereof according to the present invention, an ambient component signal is extracted and modified based on correlation and the modified ambient and source component signals are outputted using different signal output units, respectively. Therefore, the present invention enhances a stereo effect of the audio signal. And, a signal output unit for outputting an ambient component signal is arranged to have an output direction different from that of another signal output unit for outputting a source component signal, whereby a listener can be provided with an audio signal of which ambient sound is enhanced.
Abstract:
For a multi-channel audio signal, parametric coding is applied to different subsets of audio input channels for different frequency regions. For example, for a 5.1 surround sound signal having five regular channels and one low-frequency (LFE) channel, binaural cue coding (BCC) can be applied to all six audio channels for sub-bands at or below a specified cut-off frequency, but to only five audio channels (excluding the LFE channel) for sub-bands above the cut-off frequency. Such frequency-based coding of channels can reduce the encoding and decoding processing loads and/or size of the encoded audio bitstream relative to parametric coding techniques that are applied to all input channels over the entire frequency range.
Abstract:
A method of decoding an audio signal is disclosed, The present invention includes the steps of receiving the audio signal having a plurality of channel signals including an ambient component signal and a source component signal, extracting the ambient component signal and the source component signal of each of the channels based on correlation between the channel signals, modifying the ambient component signal using surround effect information, and generating the audio signal including a plurality of channels using the modified ambient component signal and the source component signal. Accordingly, in an apparatus for decoding an audio signal and method thereof according to the present invention, an ambient component signal is extracted and modified based on correlation and the modified ambient and source component signals are outputted using different signal output units, respectively. Therefore, the present invention enhances a stereo effect of the audio signal. And, a signal output unit for outputting a ambient component signal is arranged to have an output direction different from that of another signal output unit for outputting a source component signal, whereby a listener can be provided with an audio signal of which ambient sound is enhanced.
Abstract:
A scheme for stereo and multi-channel synthesis of inter-channel correlation (ICC) (normalized cross-correlation) cues for parametric stereo and multi-channel coding. The scheme synthesizes ICC cues such that they approximate those of the original. For that purpose, diffuse audio channels are generated and mixed with the transmitted combined (e.g., sum) signal(s). The diffuse audio channels are preferably generated using relatively long filters with exponentially decaying Gaussian impulse responses. Such impulse responses generate diffuse sound similar to late reverberation. An alternative implementation for reduced computational complexity is proposed, where inter-channel level difference (ICLD), inter-channel time difference (ICTD), and ICC synthesis are all carried out in the domain of a single short-time Fourier transform (STFT), including the filtering for diffuse sound generation.
Abstract:
A method and apparatus are disclosed for controlling a buffer in a digital audio broadcasting (DAB) communication system. An audio encoder marks a frame as “dropped” whenever a buffer overflow might occur. Only a small number of bits are utilized to process a lost frame, thereby preventing the buffer from overflowing and allowing the encoder buffer-level to quickly recover from the potential overflow condition. The audio encoder optionally sets a flag that provides an indication to the receivers that a frame has been lost. If a “frame lost” condition is detected by a receiver, the receiver can optionally employ mitigation techniques to reduce the impact of the lost frame(s).
Abstract:
The purpose of the invention is to bridge the gap between parametric multi-channel audio coding and matrixed-surround multi-channel coding by gradually improving the sound of an up-mix signal while raising the bit-rate consumed by the side-information starting from 0 up to the bit-rates of the parametric methods. More specifically, it provides a method of flexibly choosing an “operating point” somewhere between matrixed-surround (no side-information, limited audio quality) and fully parametric reconstruction (full side-information rate required, good quality). This operating point can be chosen dynamically (i.e. varying over time) and in response to the permissible side-information rate, as it is dictated by the individual application.
Abstract:
A binaural cue coding scheme in which cue codes are derived from the transmitted audio signal. In one embodiment, an encoder downmixes C input channels to generate E transmitted channels, where C>E>1. A decoder derives cue codes from the transmitted channels and uses those cue codes to synthesize playback channels. For example, in one 5-to-2 BCC embodiment, the encoder downmixes a 5-channel surround signal to generate left and right channels of a stereo signal. The decoder derives stereo cues from the transmitted stereo signal, maps those stereo cues to surround cues, and applies the surround cues to the transmitted stereo channels to generate playback channels of a 5-channel synthesized surround signal.
Abstract:
An input audio signal having an input temporal envelope is converted into an output audio signal having an output temporal envelope. The input temporal envelope of the input audio signal is characterized. The input audio signal is processed to generate a processed audio signal, wherein the processing de-correlates the input audio signal. The processed audio signal is adjusted based on the characterized input temporal envelope to generate the output audio signal, wherein the output temporal envelope substantially matches the input temporal envelope.