摘要:
Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers in a horizontal or vertical direction, considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.
摘要:
An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
摘要:
A device and method for encoding/decoding a speech signal and an audio signal. The device for encoding the speech signal and the audio signal includes a speech encoding unit which speech-encodes an input signal; an speech decoding unit which speech-decodes the speech-encoded signal; and an audio encoding unit which divides a difference signal between the speech-decoded signal and the input signal into a low band and a high band, allocates the number of bits to the divided bands, and audio-encodes the difference signal.
摘要:
A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.
摘要:
An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.
摘要:
A method and apparatus to encode/decode an audio signal, in which a bit rate for each bit plane can be controlled. A method of encoding an audio signal for each of a plurality of bit plane can include dividing the audio signal into a plurality of frequency bands and encoding the bit planes of the frequency bands from a low frequency band to a high frequency band, wherein, in the encoding the bit planes of the frequency bands, the bit planes are encoded from the most significant bit (MSB) to the least significant bit (LSB) within bits allocated for the frequency bands, and when there are allocated bits remaining after the encoding of the currently encoded frequency band, un-encoded bit planes corresponding to the MSB in a frequency band that has the fewest encoded bit planes among frequency bands with a lower frequency than the currently encoded frequency band are encoded using the remaining allocated bits. Accordingly, when encoding/decoding an audio signal, an encoding sequence of bit planes is determined so that an audio signal that significantly affects audio quality during decoding is first encoded, thereby reducing audio quality deterioration at a low bit rate.
摘要:
Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.
摘要:
Provided are an audio encoding method and apparatus capable of fast bit rate control. The audio encoding method includes: converting audio sampling data into frequency domain data; adjusting a scalefactor value in each predetermined frequency band based on an available bits and allowed distortion of a psychoacoustic model to allocate a number of necessary bits to the frequency domain data and quantize the frequency domain data; and generating a bit stream based on the quantized data. The quantizing of the frequency domain data includes: obtaining the available bits for the frequency domain data; obtaining the common scalefactor value satisfying that the used bits is not larger than the available bits, using a difference the available bits and the used bits to quantize the audio data; calculating quantization noise in the each predetermined quantization band; and adjusting a scalefactor value of a quantization band in which the quantization noise exceeds the allowed distortion of the psychoacoustic model to quantize the audio data.
摘要:
A method and apparatus for generating a bitstream of an audio signal, in which an audio signal can be easily extended to a multichannel audio signal, the processing speed of an audio signal can be improved, and channel signals of an audio signal can be processed simultaneously, and an audio encoding/decoding method and apparatus using the method and apparatus. The method for generating a bitstream of an audio signal using an encoded audio signal and encoding information includes generating a flag indicating whether the encoded audio signal is a multichannel audio signal, generating a bitstream header including the generated flag, and generating the bitstream using the generated bitstream header and the encoded audio signal.
摘要:
A multichannel audio data encoding and/or decoding method and apparatus. The encoding method includes: encoding mono and/or stereo audio data; and encoding extended multichannel audio data other than the mono and/or stereo audio data. The decoding method includes: decoding mono and/or stereo audio data; examining whether there is extended multichannel audio data to be decoded other than the mono and/or stereo audio data; and when there is extended data to be decoded, decoding the extended multichannel audio data.