Audio transcoder using encoder-generated side information to transcode to target bit-rate
    1.
    发明授权
    Audio transcoder using encoder-generated side information to transcode to target bit-rate 有权
    音频代码转换器使用编码器生成的侧面信息转码为目标比特率

    公开(公告)号:US08457958B2

    公开(公告)日:2013-06-04

    申请号:US11938194

    申请日:2007-11-09

    IPC分类号: G10L19/02 H04B1/66

    CPC分类号: G10L19/173

    摘要: An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.

    摘要翻译: 音频编码器将侧信息编码成包含由编码器使用的编码参数的压缩音频比特流,用于一种或多种编码技术,例如用于速率控制的噪声屏蔽比曲线。 代码转换器使用编码器产生的侧信息将来自具有初始比特率的原始压缩比特流的音频转码为具有新比特率的第二比特流。 因为侧信息是从原始音频导出的,所以代码转换器能够更好地保持转码的音频质量。 侧面信息还允许代码转换器从中间解码/编码级重新编码,以实现更快和更低复杂度的代码转换。

    TRANSCODER USING ENCODER GENERATED SIDE INFORMATION
    2.
    发明申请
    TRANSCODER USING ENCODER GENERATED SIDE INFORMATION 有权
    使用编码器生成侧信息的TRANSCODER

    公开(公告)号:US20090125315A1

    公开(公告)日:2009-05-14

    申请号:US11938194

    申请日:2007-11-09

    IPC分类号: G10L19/00

    CPC分类号: G10L19/173

    摘要: An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.

    摘要翻译: 音频编码器将侧信息编码成包含由编码器使用的编码参数的压缩音频比特流,用于一种或多种编码技术,例如用于速率控制的噪声屏蔽比曲线。 代码转换器使用编码器产生的侧信息将来自具有初始比特率的原始压缩比特流的音频转码为具有新比特率的第二比特流。 因为侧信息是从原始音频导出的,所以代码转换器能够更好地保持转码的音频质量。 侧面信息还允许代码转换器从中间解码/编码级重新编码,以实现更快和更低复杂度的代码转换。

    CODING OF SPARSE DIGITAL MEDIA SPECTRAL DATA
    3.
    发明申请
    CODING OF SPARSE DIGITAL MEDIA SPECTRAL DATA 有权
    编码数字媒体光谱数据

    公开(公告)号:US20080312758A1

    公开(公告)日:2008-12-18

    申请号:US11764108

    申请日:2007-06-15

    IPC分类号: G06F17/00

    摘要: An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.

    摘要翻译: 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。 音频编码器/解码器对频谱峰值的频率位置进行时间预测。 从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。 与常规游程长度编码相比,预测可以避免编码非常大的零电平变换系数运行。 对于未在前面的变换编码块中预测的频谱峰值,频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。

    Shape and scale parameters for extended-band frequency coding
    7.
    发明授权
    Shape and scale parameters for extended-band frequency coding 有权
    扩展频带编码的形状和缩放参数

    公开(公告)号:US07953604B2

    公开(公告)日:2011-05-31

    申请号:US11336618

    申请日:2006-01-20

    IPC分类号: G10L19/00

    CPC分类号: G10L21/038

    摘要: An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.

    摘要翻译: 音频编码器执行频率扩展编码,其包括使用对应于偶数位移的位移矢量来确定一个或多个形状参数(例如,基带频率范围中的子带和偶数个子带之间的偶数个子带) 子带在扩展频带范围内)。 形状参数可以基于每个音频块来确定。 将位移限制为偶数(在频率扩展编码或其他信号调制方案中)可以提高重构音频的质量。 音频编码器还可以执行频率扩展编码,其包括确定一个或多个音频块处的一个或多个缩放参数,以及确定用于内插一个或多个缩放参数的一个或多个定位点。

    Flexible frequency and time partitioning in perceptual transform coding of audio
    8.
    发明授权
    Flexible frequency and time partitioning in perceptual transform coding of audio 有权
    音频感知变换编码中灵活的频率和时间分割

    公开(公告)号:US07761290B2

    公开(公告)日:2010-07-20

    申请号:US11764134

    申请日:2007-06-15

    IPC分类号: G10L19/02

    CPC分类号: G10L19/0208 G10L19/032

    摘要: An audio encoder/decoder performs band partitioning for vector quantization encoding of spectral holes and missing high frequencies that result from quantization when encoding at low bit rates. The encoder/decoder determines a band structure for spectral holes based on two threshold parameters: a minimum hole size threshold and a maximum band size threshold. Spectral holes wider than the minimum hole size threshold are partitioned evenly into bands not exceeding the maximum band size threshold in size. Such hole filling bands are configured up to a preset number of hole filling bands. The bands for missing high frequencies are then configured by dividing the high frequency region into bands having binary-increasing, linearly-increasing or arbitrarily-configured band sizes up to a maximum overall number of bands.

    摘要翻译: 音频编码器/解码器对以低比特率进行编码的频谱空间矢量量化编码和由量化产生的缺失高频进行频带划分。 编码器/解码器基于两个阈值参数确定频谱孔的频带结构:最小孔尺寸阈值和最大频带尺寸阈值。 比最小孔尺寸阈值更宽的光谱孔被均匀地分割成不超过最大带尺寸阈值的带。 这样的孔填充带被配置成预定数量的填充孔。 然后通过将高频区域划分成具有二进制增加,线性增加或任意配置的频带大小直到最大总带数的频带来配置用于缺失高频的频带。

    Modification of codewords in dictionary used for efficient coding of digital media spectral data
    9.
    发明授权
    Modification of codewords in dictionary used for efficient coding of digital media spectral data 有权
    用于数字媒体频谱数据有效编码的字典中的码字的修改

    公开(公告)号:US07562021B2

    公开(公告)日:2009-07-14

    申请号:US11183084

    申请日:2005-07-15

    IPC分类号: G10L19/00

    CPC分类号: G10L19/038 G10L19/24

    摘要: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

    摘要翻译: 通过将频谱数据的某些部分表示为编码矢量的缩放版本来编码频谱数据,其中从固定预定码本或从基带获取的码本中选择码矢量。 描述了各种可选特征,用于根据允许代码矢量更好地表示其正在建模的数据的一些规则来修改码本中的码矢量。 码矢量修改包括一个或多个码矢量的线性或非线性变换,例如通过取幂,否定,反转或从多个码矢量组合元素。

    Shape and scale parameters for extended-band frequency coding
    10.
    发明申请
    Shape and scale parameters for extended-band frequency coding 有权
    扩展频带编码的形状和缩放参数

    公开(公告)号:US20070174063A1

    公开(公告)日:2007-07-26

    申请号:US11336618

    申请日:2006-01-20

    IPC分类号: G10L19/00

    CPC分类号: G10L21/038

    摘要: An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.

    摘要翻译: 音频编码器执行频率扩展编码,其包括使用对应于偶数位移的位移矢量来确定一个或多个形状参数(例如,基带频率范围中的子带和偶数个子带之间的偶数个子带) 子带在扩展频带范围内)。 形状参数可以基于每个音频块来确定。 将位移限制为偶数(在频率扩展编码或其他信号调制方案中)可以提高重构音频的质量。 音频编码器还可以执行频率扩展编码,其包括确定一个或多个音频块处的一个或多个缩放参数,以及确定用于内插一个或多个缩放参数的一个或多个定位点。