摘要:
Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers in a horizontal or vertical direction, considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.
摘要:
An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
摘要:
A device and method for encoding/decoding a speech signal and an audio signal. The device for encoding the speech signal and the audio signal includes a speech encoding unit which speech-encodes an input signal; an speech decoding unit which speech-decodes the speech-encoded signal; and an audio encoding unit which divides a difference signal between the speech-decoded signal and the input signal into a low band and a high band, allocates the number of bits to the divided bands, and audio-encodes the difference signal.
摘要:
A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.
摘要:
An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
摘要:
A method and apparatus to encode/decode an audio signal, in which a bit rate for each bit plane can be controlled. A method of encoding an audio signal for each of a plurality of bit plane can include dividing the audio signal into a plurality of frequency bands and encoding the bit planes of the frequency bands from a low frequency band to a high frequency band, wherein, in the encoding the bit planes of the frequency bands, the bit planes are encoded from the most significant bit (MSB) to the least significant bit (LSB) within bits allocated for the frequency bands, and when there are allocated bits remaining after the encoding of the currently encoded frequency band, un-encoded bit planes corresponding to the MSB in a frequency band that has the fewest encoded bit planes among frequency bands with a lower frequency than the currently encoded frequency band are encoded using the remaining allocated bits. Accordingly, when encoding/decoding an audio signal, an encoding sequence of bit planes is determined so that an audio signal that significantly affects audio quality during decoding is first encoded, thereby reducing audio quality deterioration at a low bit rate.
摘要:
Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.
摘要:
Provided are an audio encoding method and apparatus capable of fast bit rate control. The audio encoding method includes: converting audio sampling data into frequency domain data; adjusting a scalefactor value in each predetermined frequency band based on an available bits and allowed distortion of a psychoacoustic model to allocate a number of necessary bits to the frequency domain data and quantize the frequency domain data; and generating a bit stream based on the quantized data. The quantizing of the frequency domain data includes: obtaining the available bits for the frequency domain data; obtaining the common scalefactor value satisfying that the used bits is not larger than the available bits, using a difference the available bits and the used bits to quantize the audio data; calculating quantization noise in the each predetermined quantization band; and adjusting a scalefactor value of a quantization band in which the quantization noise exceeds the allowed distortion of the psychoacoustic model to quantize the audio data.
摘要:
A method, medium, and apparatus for converting compressed audio data, including decoding compressed audio input data, in accordance with a corresponding compression format, coding a result of the decoding, in accordance with a predetermined compression format, and combining a result of the coding with the side information to generate audio output data to be compressed according to the predetermined compression format.
摘要:
A digital signal encoding method and apparatus using a plurality of lookup tables. The method includes: preparing a plurality of lookup tables storing numbers of allocated bits for encoding frequency bands of an input signal according to a characteristic of the input signal in a predetermined number of addresses; dividing an input signal in the time domain into signals in predetermined frequency bands; calculating address values of the frequency bands; selecting one of the plurality of lookup tables according to the characteristic of the input signal; extracting numbers of allocated bits of addresses having the calculated address values from the selected lookup table with respect to the frequency bands and allocating the numbers of bits to the frequency bands; and generating a bitstream by quantizing the input signal according to the numbers of allocated bits. Bit amount control suitable for a characteristic of an input signal can be performed by extracting numbers of allocated bits of frequency bands from an optimal lookup table selected according to the characteristic of the input signal. Also, an additional computational time can be reduced by using each occupancy rate per frequency band equal to each address of the lookup table as the characteristic of the input signal.