Scalable audio encoding and decoding apparatus, method, and medium
    1.
    发明授权
    Scalable audio encoding and decoding apparatus, method, and medium 有权
    可扩展音频编解码设备,方法和介质

    公开(公告)号:US08069048B2

    公开(公告)日:2011-11-29

    申请号:US11528314

    申请日:2006-09-28

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24

    摘要: Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers in a horizontal or vertical direction, considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.

    摘要翻译: 提供了可扩展的编码方法,装置和介质。 该方法包括:在具有基本层的帧中编码基本层并对第一增强层和第二增强层进行编码; 以及通过合成编码结果来生成编码帧。 因此,只有当编码帧的损失不如编码的第一增强层损耗那么大时,不会发生关于部分频带的语音恢复的情况。 此外,由于编码器在水平或垂直方向上将第二增强层划分为多个层,考虑属于第二增强层的数据的分布模式,并且首先编码在划分层之间分布有大量数据的层 即使编码的第二增强层的一部分被损坏,音频信息的丢失也可以被最小化。

    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
    2.
    发明申请
    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data 失效
    编码音频数据的装置和方法以及对编码音频数据进行解码的方法

    公开(公告)号:US20100332239A1

    公开(公告)日:2010-12-30

    申请号:US12923171

    申请日:2010-09-07

    IPC分类号: G10L19/00

    摘要: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.

    摘要翻译: 一种对音频数据进行编码的装置和方法,并且对编码音频数据进行解码的装置和方法。 音频数据编码装置包括:可分级编码单元,将音频数据划分成多个层,以多个层中的每个层中的预定位数表示音频数据;以及在对上层进行编码之前编码下层, 在对每个层的较低位进行编码之前的每一层的高位; SBR编码单元,生成频谱带复制(SBR)数据,其对于要编码的音频数据中具有等于或大于预定频率的频率的频带中的音频数据具有信息,并对SBR数据进行编码; 以及比特流产生单元,其使用编码的SBR数据和对应于预定比特率的编码音频数据来生成比特流。

    Device and method for encoding, decoding speech and audio signal
    3.
    发明申请
    Device and method for encoding, decoding speech and audio signal 审中-公开
    用于编码,解码语音和音频信号的装置和方法

    公开(公告)号:US20070078651A1

    公开(公告)日:2007-04-05

    申请号:US11527550

    申请日:2006-09-27

    IPC分类号: G10L19/02

    CPC分类号: G10L19/0204 G10L19/20

    摘要: A device and method for encoding/decoding a speech signal and an audio signal. The device for encoding the speech signal and the audio signal includes a speech encoding unit which speech-encodes an input signal; an speech decoding unit which speech-decodes the speech-encoded signal; and an audio encoding unit which divides a difference signal between the speech-decoded signal and the input signal into a low band and a high band, allocates the number of bits to the divided bands, and audio-encodes the difference signal.

    摘要翻译: 一种用于对语音信号和音频信号进行编码/解码的装置和方法。 用于编码语音信号和音频信号的装置包括对输入信号进行语音编码的语音编码单元; 语音解码单元,其对语音编码信号进行语音解码; 以及音频编码单元,其将语音解码信号和输入信号之间的差分信号分成低频带和高频带,将分配的频带数分配给分频频带,并对差分信号进行音频编码。

    Scalable audio encoding and/or decoding method and apparatus
    4.
    发明申请
    Scalable audio encoding and/or decoding method and apparatus 审中-公开
    可扩展音频编码和/或解码方法和装置

    公开(公告)号:US20070040709A1

    公开(公告)日:2007-02-22

    申请号:US11485468

    申请日:2006-07-13

    IPC分类号: H03M7/00

    CPC分类号: G10L19/0208

    摘要: A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.

    摘要翻译: 一种用于对音频信号进行可缩放编码和/或解码的方法和装置包括编码包括在输入信号中的特定频带信号,对其中去除编码的特定频带信号的激励信号的频率包络进行编码, 残留信号,其中编码的频率包络从激励信号中去除,以及通过可扩展地打包编码的特定频带信号,频率包络和残余信号来形成比特流。

    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
    5.
    发明授权
    Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data 失效
    编码音频数据的装置和方法以及对编码音频数据进行解码的方法

    公开(公告)号:US08046235B2

    公开(公告)日:2011-10-25

    申请号:US12923171

    申请日:2010-09-07

    IPC分类号: G10L19/00

    摘要: An apparatus and method encode audio data, and an apparatus and method decode encoded audio data. An audio data encoding apparatus includes: a scalable encoding unit dividing audio data into a plurality of layers, representing the audio data in predetermined numbers of bits in each of the plurality of layers, and encoding a lower layer prior to encoding an upper layer and an upper bit of each layer prior to encoding a lower bit of each layer; an SBR encoding unit generating spectral band replication (SBR) data that has information with respect to audio data in a frequency band of frequencies equal to or greater than a predetermined frequency among the audio data to be encoded, and encoding the SBR data; and a bitstream production unit generating a bitstream using the encoded SBR data and the encoded audio data corresponding to a predetermined bitrate.

    摘要翻译: 一种对音频数据进行编码的装置和方法,并且对编码音频数据进行解码的装置和方法。 音频数据编码装置包括:可分级编码单元,将音频数据划分成多个层,以多个层中的每个层中的预定位数表示音频数据;以及在对上层进行编码之前编码下层, 在对每个层的较低位进行编码之前的每一层的高位; SBR编码单元,生成频谱带复制(SBR)数据,其对于要编码的音频数据中具有等于或大于预定频率的频率的频带中的音频数据具有信息,并对SBR数据进行编码; 以及比特流产生单元,其使用编码的SBR数据和对应于预定比特率的编码音频数据来生成比特流。

    METHOD AND APPARATUS TO ENCODE/DECODE AUDIO SIGNAL
    6.
    发明申请
    METHOD AND APPARATUS TO ENCODE/DECODE AUDIO SIGNAL 审中-公开
    编码/解码音频信号的方法和设备

    公开(公告)号:US20070078646A1

    公开(公告)日:2007-04-05

    申请号:US11535638

    申请日:2006-09-27

    IPC分类号: G10L19/00

    CPC分类号: G10L19/0208 G10L19/24

    摘要: A method and apparatus to encode/decode an audio signal, in which a bit rate for each bit plane can be controlled. A method of encoding an audio signal for each of a plurality of bit plane can include dividing the audio signal into a plurality of frequency bands and encoding the bit planes of the frequency bands from a low frequency band to a high frequency band, wherein, in the encoding the bit planes of the frequency bands, the bit planes are encoded from the most significant bit (MSB) to the least significant bit (LSB) within bits allocated for the frequency bands, and when there are allocated bits remaining after the encoding of the currently encoded frequency band, un-encoded bit planes corresponding to the MSB in a frequency band that has the fewest encoded bit planes among frequency bands with a lower frequency than the currently encoded frequency band are encoded using the remaining allocated bits. Accordingly, when encoding/decoding an audio signal, an encoding sequence of bit planes is determined so that an audio signal that significantly affects audio quality during decoding is first encoded, thereby reducing audio quality deterioration at a low bit rate.

    摘要翻译: 一种编码/解码音频信号的方法和装置,其中可以控制每个比特平面的比特率。 对多个位平面中的每一个编码音频信号的方法可以包括将音频信号划分成多个频带并将频带的位平面从低频带编码到高频带,其中,在 对频带的位平面进行编码,在针对频带分配的位内,将位平面从最高有效位(MSB)编码为最低有效位(LSB),并且当编码 使用剩余的分配比特来编码当前编码的频带,对应于具有比当前编码的频带更低的频率的频带中具有最少编码比特平面的频带中的MSB的未编码比特平面。 因此,当对音频信号进行编码/解码时,确定位平面的编码序列,使得首先编码在解码期间显着影响音频质量的音频信号,从而以低比特率降低音频质量恶化。

    Scalable audio encoding and decoding apparatus, method, and medium
    7.
    发明申请
    Scalable audio encoding and decoding apparatus, method, and medium 有权
    可扩展音频编解码设备,方法和介质

    公开(公告)号:US20070071089A1

    公开(公告)日:2007-03-29

    申请号:US11528314

    申请日:2006-09-28

    IPC分类号: H04B1/66 H04N7/12

    CPC分类号: G10L19/24

    摘要: Provided is a scalable encoding method, apparatus, and medium. The method includes: encoding a base layer and encoding a first enhancement layer and a second enhancement layer in a frame having the base layer; and generating an encoded frame by synthesizing the encoded results. Accordingly, only if the loss of the encoding frame is not as great as the encoded first enhancement layer is damaged, a case where speech restoration with respect to partial frequency bands must be given up does not occur. Furthermore, since an encoder divides the second enhancement layer into a plurality of layers considering a distribution pattern of data belonging to the second enhancement layer and first encodes a layer in which lots of data are distributed among the divided layers, loss of audio information can be minimized even if a portion of the encoded second enhancement layer is damaged.

    摘要翻译: 提供了可扩展的编码方法,装置和介质。 该方法包括:在具有基本层的帧中编码基本层并编码第一增强层和第二增强层; 以及通过合成编码结果来生成编码帧。 因此,只有当编码帧的损失不如编码的第一增强层损耗那么大时,不会发生关于部分频带的语音恢复的情况。 此外,由于考虑到属于第二增强层的数据的分布模式,编码器将第二增强层划分成多个层,并且首先编码分割层之间分布有大量数据的层,音频信息的丢失可以是 即使编码的第二增强层的一部分被损坏,也被最小化。

    Audio encoding method and apparatus capable of fast bit rate control
    8.
    发明申请
    Audio encoding method and apparatus capable of fast bit rate control 有权
    能够进行快速比特率控制的音频编码方法和装置

    公开(公告)号:US20060053006A1

    公开(公告)日:2006-03-09

    申请号:US11220568

    申请日:2005-09-08

    IPC分类号: G10L21/00

    CPC分类号: G10L19/035

    摘要: Provided are an audio encoding method and apparatus capable of fast bit rate control. The audio encoding method includes: converting audio sampling data into frequency domain data; adjusting a scalefactor value in each predetermined frequency band based on an available bits and allowed distortion of a psychoacoustic model to allocate a number of necessary bits to the frequency domain data and quantize the frequency domain data; and generating a bit stream based on the quantized data. The quantizing of the frequency domain data includes: obtaining the available bits for the frequency domain data; obtaining the common scalefactor value satisfying that the used bits is not larger than the available bits, using a difference the available bits and the used bits to quantize the audio data; calculating quantization noise in the each predetermined quantization band; and adjusting a scalefactor value of a quantization band in which the quantization noise exceeds the allowed distortion of the psychoacoustic model to quantize the audio data.

    摘要翻译: 提供能够进行快速比特率控制的音频编码方法和装置。 音频编码方法包括:将音频采样数据转换成频域数据; 基于心理声学模型的可用位和允许的失真来调整每个预定频带中的比例因子值,以向频域数据分配多个必需比特并量化频域数据; 以及基于所述量化数据生成比特流。 频域数据的量化包括:获得频域数据的可用比特; 使用所述可用位和所使用的比特来量化所述音频数据的差异来获得满足所使用的比特不大于所述可用比特的公共比例因子值; 计算每个预定量化频带中的量化噪声; 以及调整所述量化噪声超过所述心理声学模型的允许失真量化所述音频数据的量化频带的比例因子值。

    Digital signal encoding method and apparatus using plural lookup tables
    10.
    发明申请
    Digital signal encoding method and apparatus using plural lookup tables 有权
    使用多个查找表的数字信号编码方法和装置

    公开(公告)号:US20050254588A1

    公开(公告)日:2005-11-17

    申请号:US11080409

    申请日:2005-03-16

    CPC分类号: G10L19/035

    摘要: A digital signal encoding method and apparatus using a plurality of lookup tables. The method includes: preparing a plurality of lookup tables storing numbers of allocated bits for encoding frequency bands of an input signal according to a characteristic of the input signal in a predetermined number of addresses; dividing an input signal in the time domain into signals in predetermined frequency bands; calculating address values of the frequency bands; selecting one of the plurality of lookup tables according to the characteristic of the input signal; extracting numbers of allocated bits of addresses having the calculated address values from the selected lookup table with respect to the frequency bands and allocating the numbers of bits to the frequency bands; and generating a bitstream by quantizing the input signal according to the numbers of allocated bits. Bit amount control suitable for a characteristic of an input signal can be performed by extracting numbers of allocated bits of frequency bands from an optimal lookup table selected according to the characteristic of the input signal. Also, an additional computational time can be reduced by using each occupancy rate per frequency band equal to each address of the lookup table as the characteristic of the input signal.

    摘要翻译: 一种使用多个查找表的数字信号编码方法和装置。 该方法包括:根据预定数量的地址中的输入信号的特性,准备多个查找表,该查找表存储用于编码输入信号频带的分配位数; 将时域中的输入信号划分成预定频带中的信号; 计算频带的地址值; 根据输入信号的特性选择多个查找表之一; 从所选择的查找表中提取相对于频带的具有所计算的地址值的地址的分配比特数,并将比特数分配给频带; 以及通过根据所分配的比特数来量化输入信号来生成比特流。 可以通过从根据输入信号的特性选择的最佳查找表中提取频带的分配比特数来执行适合于输入信号特性的比特量控制。 而且,通过使用等于查找表的每个地址的每个频带的每个占用率作为输入信号的特性,可以减少额外的计算时间。