Complex-transform channel coding with extended-band frequency coding
    41.
    发明授权
    Complex-transform channel coding with extended-band frequency coding 有权
    具有扩展频带编码的复变换信道编码

    公开(公告)号:US09105271B2

    公开(公告)日:2015-08-11

    申请号:US12907889

    申请日:2010-10-19

    CPC分类号: G10L21/038 G10L19/008

    摘要: An audio encoder receives multi-channel audio data comprising a group of plural source channels and performs channel extension coding, which comprises encoding a combined channel for the group and determining plural parameters for representing individual source channels of the group as modified versions of the encoded combined channel. The encoder also performs frequency extension coding. The frequency extension coding can comprise, for example, partitioning frequency bands in the multi-channel audio data into a baseband group and an extended band group, and coding audio coefficients in the extended band group based on audio coefficients in the baseband group. The encoder also can perform other kinds of transforms. An audio decoder performs corresponding decoding and/or additional processing tasks, such as a forward complex transform.

    摘要翻译: 音频编码器接收包括一组多个源信道的多声道音频数据,并执行信道扩展编码,其包括对该组的组合信道进行编码,并确定用于表示该组的各个源信道的多个参数,作为编码组合的修改版本 渠道。 编码器还执行频率扩展编码。 频率扩展编码可以包括例如将多声道音频数据中的频带划分成基带组和扩展频带组,并且基于基带组中的音频系数对扩展频带组中的音频系数进行编码。 编码器还可以执行其他类型的转换。 音频解码器执行相应的解码和/或附加处理任务,例如前向复合变换。

    ENTROPY ENCODING AND DECODING USING DIRECT LEVEL AND RUN-LENGTH/LEVEL CONTEXT-ADAPTIVE ARITHMETIC CODING/DECODING MODES
    42.
    发明申请
    ENTROPY ENCODING AND DECODING USING DIRECT LEVEL AND RUN-LENGTH/LEVEL CONTEXT-ADAPTIVE ARITHMETIC CODING/DECODING MODES 有权
    使用直接级和运行长度/级别的入侵编码和解码自适应算术编码/解码模式

    公开(公告)号:US20120069899A1

    公开(公告)日:2012-03-22

    申请号:US13306761

    申请日:2011-11-29

    IPC分类号: H04N7/26 G10L19/00

    摘要: An encoder performs context-adaptive arithmetic encoding of transform coefficient data. For example, an encoder switches between coding of direct levels of quantized transform coefficient data and run-level coding of run lengths and levels of quantized transform coefficient data. The encoder can determine when to switch between coding modes based on a pre-determined switch point or by counting consecutive coefficients having a predominant value (e.g., zero). A decoder performs corresponding context-adaptive arithmetic decoding.

    摘要翻译: 编码器执行变换系数数据的上下文自适应算术编码。 例如,编码器在量化变换系数数据的直接电平的编码和运行长度的运行电平编码和量化的变换系数数据的电平之间切换。 编码器可以基于预定的切换点或通过计算具有主要值(例如,零)的连续系数来确定何时在编码模式之间切换。 解码器执行相应的上下文自适应算术解码。

    Complex-transform channel coding with extended-band frequency coding
    43.
    发明授权
    Complex-transform channel coding with extended-band frequency coding 有权
    具有扩展频带编码的复变换信道编码

    公开(公告)号:US07831434B2

    公开(公告)日:2010-11-09

    申请号:US11336606

    申请日:2006-01-20

    IPC分类号: G10L19/00 H04R5/00

    CPC分类号: G10L21/038 G10L19/008

    摘要: An audio encoder receives multi-channel audio data comprising a group of plural source channels and performs channel extension coding, which comprises encoding a combined channel for the group and determining plural parameters for representing individual source channels of the group as modified versions of the encoded combined channel. The encoder also performs frequency extension coding. The frequency extension coding can comprise, for example, partitioning frequency bands in the multi-channel audio data into a baseband group and an extended band group, and coding audio coefficients in the extended band group based on audio coefficients in the baseband group. The encoder also can perform other kinds of transforms. An audio decoder performs corresponding decoding and/or additional processing tasks, such as a forward complex transform.

    摘要翻译: 音频编码器接收包括一组多个源信道的多声道音频数据,并执行信道扩展编码,其包括对该组的组合信道进行编码,并确定用于表示该组的各个源信道的多个参数,作为编码组合的修改版本 渠道。 编码器还执行频率扩展编码。 频率扩展编码可以包括例如将多声道音频数据中的频带划分为基带组和扩展频带组,并且基于基带组中的音频系数对扩展频带组中的音频系数进行编码。 编码器还可以执行其他类型的转换。 音频解码器执行相应的解码和/或附加处理任务,例如前向复合变换。

    EFFICIENT CODING OF DIGITAL MEDIA SPECTRAL DATA USING WIDE-SENSE PERCEPTUAL SIMILARITY
    44.
    发明申请
    EFFICIENT CODING OF DIGITAL MEDIA SPECTRAL DATA USING WIDE-SENSE PERCEPTUAL SIMILARITY 有权
    数字媒体光谱数据的高效编码使用宽视觉相似度

    公开(公告)号:US20090083046A1

    公开(公告)日:2009-03-26

    申请号:US12324689

    申请日:2008-11-26

    IPC分类号: G10L19/00

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    Efficient coding of digital media spectral data using wide-sense perceptual similarity
    45.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US07460990B2

    公开(公告)日:2008-12-02

    申请号:US10882801

    申请日:2004-06-29

    IPC分类号: G10L11/04

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    Efficient coding of digital media spectral data using wide-sense perceptual similarity
    46.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US08645127B2

    公开(公告)日:2014-02-04

    申请号:US12324689

    申请日:2008-11-26

    IPC分类号: G10L11/04

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    Low complexity decoder for complex transform coding of multi-channel sound
    47.
    发明授权
    Low complexity decoder for complex transform coding of multi-channel sound 有权
    低复杂度解码器,用于多声道声音的复杂变换编码

    公开(公告)号:US08046214B2

    公开(公告)日:2011-10-25

    申请号:US11767457

    申请日:2007-06-22

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

    摘要翻译: 多声道音频解码器提供了一种降低复杂度的处理,从编码比特流重建多声道音频,其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。 解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。 多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

    Selectively using multiple entropy models in adaptive coding and decoding
    50.
    发明申请
    Selectively using multiple entropy models in adaptive coding and decoding 有权
    在自适应编码和解码中选择性地使用多个熵模型

    公开(公告)号:US20070016418A1

    公开(公告)日:2007-01-18

    申请号:US11183266

    申请日:2005-07-15

    IPC分类号: G10L15/00

    摘要: Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

    摘要翻译: 本文描述了在自适应编码和解码中选择性地使用多个熵模型的技术和工具。 例如,对于多个符号,音频编码器从包括多个熵模型的第一模型集合中选择熵模型。 多个熵模型中的每一个包括用于切换到包括一个或多个熵模型的第二模型集合的模型切换点。 编码器使用所选择的熵模型处理多个符号并输出结果。 还描述了用于生成熵模型的技术和工具。