Abstract:
An adaptive time/frequency-based encoding mode determination apparatus including a time domain feature extraction unit to generate a time domain feature by analysis of a time domain signal of an input audio signal, a frequency domain feature extraction unit to generate a frequency domain feature corresponding to each frequency band generated by division of a frequency domain corresponding to a frame of the input audio signal into a plurality of frequency domains, by analysis of a frequency domain signal of the input audio signal, and a mode determination unit to determine any one of a time-based encoding mode and a frequency-based encoding mode, with respect to the each frequency band, by use of the time domain feature and the frequency domain feature.
Abstract:
A surround sound virtualization apparatus and method. The surround sound virtualization apparatus may include an audio decoder to perform head-related transfer function (HRTF) filtering, and a time delay unit to provide a time delay to a plurality of output signals of the audio decoder.
Abstract:
An error concealment method and apparatus for an audio signal and a decoding method and apparatus for an audio signal using the error concealment method and apparatus. The error concealment method includes selecting one of an error concealment in a frequency domain and an error concealment in a time domain as an error concealment scheme for a current frame based on a predetermined criteria when an error occurs in the current frame, selecting one of a repetition scheme and an interpolation scheme in the frequency domain as the error concealment scheme for the current frame based on a predetermined criteria when the error concealment in the frequency domain is selected, and concealing the error of the current frame using the selected scheme.
Abstract:
A virtual speaker apparatus and a virtual speaker processing method is disclosed. The virtual speaker apparatus uses the closest surround speaker when a virtual speaker to be virtually generated is a back left virtual speaker or a back right virtual speaker.
Abstract:
A speech compression apparatus including: a first band-transform unit transforming a wideband speech signal to a narrowband low-band speech signal; a narrowband speech compressor compressing the narrowband low-band speech signal and outputting a result of the compressing as a low-band speech packet; a decompression unit decompressing the low-band speech packet and obtaining a decompressed wideband low-band speech signal; an error detection unit detecting an error signal that corresponds to a difference between the wideband speech signal and the decompressed wideband low-band speech signal; and a high-band speech compression unit compressing the error signal and a high-band speech signal of the wideband speech signal and outputting the result of the compressing as a high-band speech packet.
Abstract:
A speech compression apparatus including: a first band-transform unit transforming a wideband speech signal to a narrowband low-band speech signal; a narrowband speech compressor compressing the narrowband low-band speech signal and outputting a result of the compressing as a low-band speech packet; a decompression unit decompressing the low-band speech packet and obtaining a decompressed wideband low-band speech signal; an error detection unit detecting an error signal that corresponds to a difference between the wideband speech signal and the decompressed wideband low-band speech signal; and a high-band speech compression unit compressing the error signal and a high-band speech signal of the wideband speech signal and outputting the result of the compressing as a high-band speech packet.
Abstract:
Encoding and/or decoding a wideband signal produces high frequency band spectra from low frequency band spectral information. Linear prediction filter coefficients are determined for the entire wideband spectrum of an input signal. An energy value in each of a plurality of sub-bands in the high frequency band is determined and encoded. The short-term correlation removed input signal is then down-sampled to form a low frequency band signal. At a decoder, the high frequency band signal is generated using the encoded low frequency band signal. The energy in each sub-band of the high frequency band is adjusted using the encoded energy value. Thus, the spectral envelope for the entire wideband signal is synthesized and decoded using linear predictive synthesis.
Abstract:
A method of multi-path trellis coded quantization (TCQ) usable in a speech coding system, and a quantizer using the method. Specifically the method includes calculating accumulated distortions corresponding to 2N survivor paths, wherein N indicates an integer greater than two, each of the 2N survivor paths is going towards one of nodes at an i th stage of a trellis, and i indicates an integer greater than zero, comparing the accumulated distortions respectively corresponding to the 2N survivor paths to select N paths among the 2N survivor paths, wherein the accumulated distortions corresponding to selected N paths are smaller than the accumulated distortions corresponding to unselected N paths establishing the selected N paths as survivor paths going toward an i+1 th stage, and selecting an optimal path among the 2N survivor paths corresponding to each node of a last stage.
Abstract:
An apparatus to compress a wide-band speech signal, the apparatus including a narrow-band speech compressor to compress a low-band speech signal of the wide-band speech signal and output the compressed low-band speech signal as a low-band speech packet; and a high-band speech compressor to compress a high-band speech signal of the wide-band speech signal using energy information of the low-band speech signal provided from the narrow-band speech compressor, and outputs the compressed high-band speech signal as a high-band speech packet.
Abstract:
A method and apparatus to determine an encoding mode of an audio signal, and a method and apparatus to encode an audio signal according to the encoding mode. In the encoding mode determination method, a mode determination threshold for the current frame that is subject to encoding mode determination is adaptively adjusted according to a long-term feature of the audio signal for a frame (the current frame) that is subject to encoding mode determination, thereby improving the hit rate of encoding mode determination and signal classification, suppressing frequent oscillation of an encoding mode in frame units, improving noise tolerance, and improving smoothness of a reconstructed audio signal.