摘要:
A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.
摘要:
A speech signal compression and/or decompression method, medium, and apparatus in which the speech signal is transformed into the frequency domain for quantizing and dequantizing information of frequency coefficients. The speech signal compression apparatus includes a transform unit to transform a speech signal into the frequency domain and obtain frequency coefficients, a magnitude quantization unit to transform magnitudes of the frequency coefficients, quantize the transformed magnitudes and obtain magnitude quantization indices, a sign quantization unit to quantize signs of the frequency coefficients and obtain sign quantization indices, and a packetizing unit to generate the magnitude and sign quantization indices as a speech packet.
摘要:
A method and apparatus to search a codebook including pulses that model a predetermined component of a speech signal. The method includes the operations of selecting a predetermined number of paths corresponding to a predetermined number of pulse locations that are most consistent with the predetermined component, from among paths corresponding to pulse locations of a predetermined pulse location set allocated to at least one branch that connects one state of a predetermined Trellis structure to another state, performing the path selecting operation on each of states other than the one state, and selecting a path corresponding to pulse locations that are most consistent with the predetermined component from among paths including the selected paths, wherein each path corresponds to a union of plural tracks of an algebraic codebook. Accordingly, a number of calculations required during a codebook search is reduced.
摘要:
An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec, configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode, the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.
摘要:
A speech signal compression and/or decompression method, medium, and apparatus in which the speech signal is transformed into the frequency domain for quantizing and dequantizing information of frequency coefficients. The speech signal compression apparatus includes a transform unit to transform a speech signal into the frequency domain and obtain frequency coefficients, a magnitude quantization unit to transform magnitudes of the frequency coefficients, quantize the transformed magnitudes and obtain magnitude quantization indices, a sign quantization unit to quantize signs of the frequency coefficients and obtain sign quantization indices, and a packetizing unit to generate the magnitude and sign quantization indices as a speech packet.
摘要:
A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.
摘要:
An encoder and decoder to encode one or more input signals into a scalable codec and to decode the scalable codec, and encoding and decoding methods using a bitstream with a layered structure in the scalable codec change a top coding bit rate to encode the input signals according to a network status, and the bitstream is decoded by analyzing the top coding bit rate included in the bitstream.
摘要:
A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.
摘要:
A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.
摘要:
Audio coding and decoding apparatuses and methods that can optimize the quality of an audio signal including harmonics, and recording mediums storing the methods. An audio coding apparatus includes: a first harmonic coding module performing first harmonic coding on an input audio signal using a pitch lag of the input audio signal and producing a quantized linear prediction coding coefficient; a first detector detecting a first difference audio signal from a difference between an audio signal output from the first harmonic coding module and the input audio signal; a second harmonic coding module performing harmonic coding on the first difference audio signal using the quantized linear prediction coding coefficient and a previous harmonic coding result; a second detector detecting a second difference audio signal obtained from a difference between an audio signal output from the second harmonic coding module and the first difference audio signal; and a code excited linear prediction (CELP) module CELP coding the second difference audio signal using the quantized linear prediction coding coefficient obtained from the first harmonic coding module.