Scalable audio encoding and decoding apparatus, method, and medium
    1.
    发明授权
    Scalable audio encoding and decoding apparatus, method, and medium 有权
    可扩展音频编解码设备,方法和介质

    公开(公告)号:US08069048B2

    公开(公告)日:2011-11-29

    申请号:US11528314

    申请日:2006-09-28

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24

    摘要: Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers in a horizontal or vertical direction, considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.

    摘要翻译: 提供了可扩展的编码方法,装置和介质。 该方法包括:在具有基本层的帧中编码基本层并对第一增强层和第二增强层进行编码; 以及通过合成编码结果来生成编码帧。 因此,只有当编码帧的损失不如编码的第一增强层损耗那么大时,不会发生关于部分频带的语音恢复的情况。 此外,由于编码器在水平或垂直方向上将第二增强层划分为多个层,考虑属于第二增强层的数据的分布模式,并且首先编码在划分层之间分布有大量数据的层 即使编码的第二增强层的一部分被损坏,音频信息的丢失也可以被最小化。

    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
    2.
    发明申请
    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data 失效
    编码音频数据的装置和方法以及对编码音频数据进行解码的方法

    公开(公告)号:US20100332239A1

    公开(公告)日:2010-12-30

    申请号:US12923171

    申请日:2010-09-07

    IPC分类号: G10L19/00

    摘要: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.

    摘要翻译: 一种对音频数据进行编码的装置和方法,并且对编码音频数据进行解码的装置和方法。 音频数据编码装置包括:可分级编码单元,将音频数据划分成多个层,以多个层中的每个层中的预定位数表示音频数据;以及在对上层进行编码之前编码下层, 在对每个层的较低位进行编码之前的每一层的高位; SBR编码单元,生成频谱带复制(SBR)数据,其对于要编码的音频数据中具有等于或大于预定频率的频率的频带中的音频数据具有信息,并对SBR数据进行编码; 以及比特流产生单元,其使用编码的SBR数据和对应于预定比特率的编码音频数据来生成比特流。

    Device and method for encoding, decoding speech and audio signal
    3.
    发明申请
    Device and method for encoding, decoding speech and audio signal 审中-公开
    用于编码,解码语音和音频信号的装置和方法

    公开(公告)号:US20070078651A1

    公开(公告)日:2007-04-05

    申请号:US11527550

    申请日:2006-09-27

    IPC分类号: G10L19/02

    CPC分类号: G10L19/0204 G10L19/20

    摘要: A device and method for encoding/decoding a speech signal and an audio signal. The device for encoding the speech signal and the audio signal includes a speech encoding unit which speech-encodes an input signal; an speech decoding unit which speech-decodes the speech-encoded signal; and an audio encoding unit which divides a difference signal between the speech-decoded signal and the input signal into a low band and a high band, allocates the number of bits to the divided bands, and audio-encodes the difference signal.

    摘要翻译: 一种用于对语音信号和音频信号进行编码/解码的装置和方法。 用于编码语音信号和音频信号的装置包括对输入信号进行语音编码的语音编码单元; 语音解码单元,其对语音编码信号进行语音解码; 以及音频编码单元,其将语音解码信号和输入信号之间的差分信号分成低频带和高频带,将分配的频带数分配给分频频带,并对差分信号进行音频编码。

    Scalable audio encoding and/or decoding method and apparatus
    4.
    发明申请
    Scalable audio encoding and/or decoding method and apparatus 审中-公开
    可扩展音频编码和/或解码方法和装置

    公开(公告)号:US20070040709A1

    公开(公告)日:2007-02-22

    申请号:US11485468

    申请日:2006-07-13

    IPC分类号: H03M7/00

    CPC分类号: G10L19/0208

    摘要: A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.

    摘要翻译: 一种用于对音频信号进行可缩放编码和/或解码的方法和装置包括编码包括在输入信号中的特定频带信号,对其中去除编码的特定频带信号的激励信号的频率包络进行编码, 残留信号,其中编码的频率包络从激励信号中去除,以及通过可扩展地打包编码的特定频带信号,频率包络和残余信号来形成比特流。

    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
    5.
    发明授权
    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data 失效
    编码音频数据的装置和方法以及对编码音频数据进行解码的方法

    公开(公告)号:US08046235B2

    公开(公告)日:2011-10-25

    申请号:US12923171

    申请日:2010-09-07

    IPC分类号: G10L19/00

    摘要: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.

    摘要翻译: 一种对音频数据进行编码的装置和方法,并且对编码音频数据进行解码的装置和方法。 音频数据编码装置包括:可分级编码单元,将音频数据划分成多个层,以多个层中的每个层中的预定位数表示音频数据;以及在对上层进行编码之前编码下层, 在对每个层的较低位进行编码之前的每一层的高位; SBR编码单元,生成频谱带复制(SBR)数据,其对于要编码的音频数据中具有等于或大于预定频率的频率的频带中的音频数据具有信息,并对SBR数据进行编码; 以及比特流产生单元,其使用编码的SBR数据和对应于预定比特率的编码音频数据来生成比特流。

    METHOD AND APPARATUS TO ENCODE/DECODE AUDIO SIGNAL
    6.
    发明申请
    METHOD AND APPARATUS TO ENCODE/DECODE AUDIO SIGNAL 审中-公开
    编码/解码音频信号的方法和设备

    公开(公告)号:US20070078646A1

    公开(公告)日:2007-04-05

    申请号:US11535638

    申请日:2006-09-27

    IPC分类号: G10L19/00

    CPC分类号: G10L19/0208 G10L19/24

    摘要: A method and apparatus to encode/decode an audio signal, in which a bit rate for each bit plane can be controlled. A method of encoding an audio signal for each of a plurality of bit plane can include dividing the audio signal into a plurality of frequency bands and encoding the bit planes of the frequency bands from a low frequency band to a high frequency band, wherein, in the encoding the bit planes of the frequency bands, the bit planes are encoded from the most significant bit (MSB) to the least significant bit (LSB) within bits allocated for the frequency bands, and when there are allocated bits remaining after the encoding of the currently encoded frequency band, un-encoded bit planes corresponding to the MSB in a frequency band that has the fewest encoded bit planes among frequency bands with a lower frequency than the currently encoded frequency band are encoded using the remaining allocated bits. Accordingly, when encoding/decoding an audio signal, an encoding sequence of bit planes is determined so that an audio signal that significantly affects audio quality during decoding is first encoded, thereby reducing audio quality deterioration at a low bit rate.

    摘要翻译: 一种编码/解码音频信号的方法和装置,其中可以控制每个比特平面的比特率。 对多个位平面中的每一个编码音频信号的方法可以包括将音频信号划分成多个频带并将频带的位平面从低频带编码到高频带,其中,在 对频带的位平面进行编码,在针对频带分配的位内,将位平面从最高有效位(MSB)编码为最低有效位(LSB),并且当编码 使用剩余的分配比特来编码当前编码的频带,对应于具有比当前编码的频带更低的频率的频带中具有最少编码比特平面的频带中的MSB的未编码比特平面。 因此,当对音频信号进行编码/解码时,确定位平面的编码序列,使得首先编码在解码期间显着影响音频质量的音频信号,从而以低比特率降低音频质量恶化。

    Scalable audio encoding and decoding apparatus, method, and medium
    7.
    发明申请
    Scalable audio encoding and decoding apparatus, method, and medium 有权
    可扩展音频编解码设备,方法和介质

    公开(公告)号:US20070071089A1

    公开(公告)日:2007-03-29

    申请号:US11528314

    申请日:2006-09-28

    IPC分类号: H04B1/66 H04N7/12

    CPC分类号: G10L19/24

    摘要: Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.

    摘要翻译: 提供了可扩展的编码方法,装置和介质。 该方法包括:在具有基本层的帧中编码基本层并编码第一增强层和第二增强层; 以及通过合成编码结果来生成编码帧。 因此,只有当编码帧的损失不如编码的第一增强层损耗那么大时,不会发生关于部分频带的语音恢复的情况。 此外,由于考虑到属于第二增强层的数据的分布模式,编码器将第二增强层划分成多个层,并且首先编码分割层之间分布有大量数据的层,音频信息的丢失可以是 即使编码的第二增强层的一部分被损坏,也被最小化。

    Audio encoding method and apparatus capable of fast bit rate control
    8.
    发明申请
    Audio encoding method and apparatus capable of fast bit rate control 有权
    能够进行快速比特率控制的音频编码方法和装置

    公开(公告)号:US20060053006A1

    公开(公告)日:2006-03-09

    申请号:US11220568

    申请日:2005-09-08

    IPC分类号: G10L21/00

    CPC分类号: G10L19/035

    摘要: Provided are an audio encoding method and apparatus capable of fast bit rate control. The audio encoding method includes: converting audio sampling data into frequency domain data; adjusting a scalefactor value in each predetermined frequency band based on an available bits and allowed distortion of a psychoacoustic model to allocate a number of necessary bits to the frequency domain data and quantize the frequency domain data; and generating a bit stream based on the quantized data. The quantizing of the frequency domain data includes: obtaining the available bits for the frequency domain data; obtaining the common scalefactor value satisfying that the used bits is not larger than the available bits, using a difference the available bits and the used bits to quantize the audio data; calculating quantization noise in the each predetermined quantization band; and adjusting a scalefactor value of a quantization band in which the quantization noise exceeds the allowed distortion of the psychoacoustic model to quantize the audio data.

    摘要翻译: 提供能够进行快速比特率控制的音频编码方法和装置。 音频编码方法包括:将音频采样数据转换成频域数据; 基于心理声学模型的可用位和允许的失真来调整每个预定频带中的比例因子值,以向频域数据分配多个必需比特并量化频域数据; 以及基于所述量化数据生成比特流。 频域数据的量化包括:获得频域数据的可用比特; 使用所述可用位和所使用的比特来量化所述音频数据的差异来获得满足所使用的比特不大于所述可用比特的公共比例因子值; 计算每个预定量化频带中的量化噪声; 以及调整所述量化噪声超过所述心理声学模型的允许失真量化所述音频数据的量化频带的比例因子值。

    Method and apparatus for generating bitstream of audio signal and audio encoding/decoding method and apparatus thereof
    9.
    发明申请
    Method and apparatus for generating bitstream of audio signal and audio encoding/decoding method and apparatus thereof 失效
    用于产生音频信号和音频编码/解码方法的比特流的方法和装置及其装置

    公开(公告)号:US20060293902A1

    公开(公告)日:2006-12-28

    申请号:US11445312

    申请日:2006-06-02

    IPC分类号: G10L21/00

    CPC分类号: G10L19/167 G10L19/008

    摘要: A method and apparatus for generating a bitstream of an audio signal, in which an audio signal can be easily extended to a multichannel audio signal, the processing speed of an audio signal can be improved, and channel signals of an audio signal can be processed simultaneously, and an audio encoding/decoding method and apparatus using the method and apparatus. The method for generating a bitstream of an audio signal using an encoded audio signal and encoding information includes generating a flag indicating whether the encoded audio signal is a multichannel audio signal, generating a bitstream header including the generated flag, and generating the bitstream using the generated bitstream header and the encoded audio signal.

    摘要翻译: 一种用于生成音频信号的比特流的方法和装置,其中音频信号可以容易地扩展到多声道音频信号,可以提高音频信号的处理速度,并且可以同时处理音频信号的信道信号 以及使用该方法和装置的音频编码/解码方法和装置。 使用编码音频信号和编码信息生成音频信号的比特流的方法包括:生成指示编码音频信号是多声道音频信号的标志,生成包括所生成的标志的比特流头部,以及使用生成的 比特流报头和编码音频信号。

    Multichannel audio data encoding/decoding method and apparatus
    10.
    发明申请
    Multichannel audio data encoding/decoding method and apparatus 审中-公开
    多声道音频数据编码/解码方法及装置

    公开(公告)号:US20060013405A1

    公开(公告)日:2006-01-19

    申请号:US11180625

    申请日:2005-07-14

    IPC分类号: H04R5/00

    CPC分类号: H04S3/008 G10L19/008

    摘要: A multichannel audio data encoding and/or decoding method and apparatus. The encoding method includes: encoding mono and/or stereo audio data; and encoding extended multichannel audio data other than the mono and/or stereo audio data. The decoding method includes: decoding mono and/or stereo audio data; examining whether there is extended multichannel audio data to be decoded other than the mono and/or stereo audio data; and when there is extended data to be decoded, decoding the extended multichannel audio data.

    摘要翻译: 多声道音频数据编码和/或解码方法和装置。 编码方法包括:对单声道和/或立体声音频数据进行编码; 以及对单声道和/或立体声音频数据以外的扩展多声道音频数据进行编码。 解码方法包括:对单声道和/或立体声音频数据进行解码; 检查除了单声道和/或立体声音频数据之外是否存在要被解码的扩展多声道音频数据; 并且当存在要解码的扩展数据时,对扩展的多声道音频数据进行解码。