Multi-Channel Hole-Filling For Audio Compression
    11.
    发明申请
    Multi-Channel Hole-Filling For Audio Compression 审中-公开
    用于音频压缩的多通道孔填充

    公开(公告)号:US20090210222A1

    公开(公告)日:2009-08-20

    申请号:US12032119

    申请日:2008-02-15

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008 G10L19/0204

    摘要: Multi-channel hole-filling for audio compression is disclosed. Channel dependency groups (CDGs) are explicitly extracted based on channel transform information. Holes are detected within each CDG for each bark, and a CDG hole is identified as requiring filling as a particular section of frequency bandwidth larger than a predetermined hole bandwidth threshold and with all zero-value coefficients in all channels after quantizing. Bark weights are adjusted by multiplying the original bark weights with one calculated scalar so as to remove each detected CDG hole.

    摘要翻译: 公开了用于音频压缩的多通道孔填充。 基于信道变换信息明确提取频道依赖组(CDG)。 在每个CDG内为每个树皮检测孔,并且将CDG孔识别为需要填充作为大于预定孔带宽阈值的频率带宽的特定部分,并且在量化之后具有所有通道中的所有零值系数。 通过将原始树皮重量乘以一个计算标量来调整树皮重量,以便去除每个检测到的CDG孔。

    Efficient coding of digital media spectral data using wide-sense perceptual similarity
    13.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US08645127B2

    公开(公告)日:2014-02-04

    申请号:US12324689

    申请日:2008-11-26

    IPC分类号: G10L11/04

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。

    ADAPTIVE BANDWIDTH ESTIMATION
    14.
    发明申请
    ADAPTIVE BANDWIDTH ESTIMATION 有权
    自适应带宽估计

    公开(公告)号:US20130114421A1

    公开(公告)日:2013-05-09

    申请号:US13288968

    申请日:2011-11-04

    IPC分类号: H04L12/26

    摘要: It can be determined whether relative one way delay for data packets in a data stream exceeds a delay threshold. If so, then a delay congestion signal indicating that the relative one way delay exceeds the delay threshold can be generated. The delay congestion signal can be used in calculating an adaptive bandwidth estimate for the data stream. A packet loss rate congestion signal may also be used in calculating the bandwidth estimate. It can be determined whether a data stream of data packets is in a contention state. If the data stream is in the contention state, then an adaptive bandwidth estimate can be calculated for the data stream using a first bandwidth estimation technique. If the data stream is not in the contention state, then the bandwidth estimate for the data stream can be calculated using a second bandwidth estimation technique.

    摘要翻译: 可以确定数据流中的数据分组的相对单向延迟是否超过延迟阈值。 如果是,则可以产生指示相对单向延迟超过延迟阈值的延迟拥塞信号。 延迟拥塞信号可用于计算数据流的自适应带宽估计。 丢包率拥塞信号也可用于计算带宽估计。 可以确定数据包的数据流是否处于争用状态。 如果数据流处于竞争状态,则可以使用第一带宽估计技术对数据流计算自适应带宽估计。 如果数据流不处于竞争状态,则可以使用第二带宽估计技术来计算数据流的带宽估计。

    Low complexity decoder for complex transform coding of multi-channel sound
    15.
    发明授权
    Low complexity decoder for complex transform coding of multi-channel sound 有权
    低复杂度解码器,用于多声道声音的复杂变换编码

    公开(公告)号:US08046214B2

    公开(公告)日:2011-10-25

    申请号:US11767457

    申请日:2007-06-22

    IPC分类号: G10L19/00

    CPC分类号: G10L19/008

    摘要: A multi-channel audio decoder provides a reduced complexity processing to reconstruct multi-channel audio from an encoded bitstream in which the multi-channel audio is represented as a coded subset of the channels along with a complex channel correlation matrix parameterization. The decoder translates the complex channel correlation matrix parameterization to a real transform that satisfies the magnitude of the complex channel correlation matrix. The multi-channel audio is derived from the coded subset of channels via channel extension processing using a real value effect signal and real number scaling.

    摘要翻译: 多声道音频解码器提供了一种降低复杂度的处理,从编码比特流重建多声道音频,其中多声道音频被表示为频道的编码子集以及复信道相关矩阵参数化。 解码器将复信道相关矩阵参数化转换为满足复信道相关矩阵幅度的实数变换。 多声道音频通过使用实数值效应信号和实数缩放的信道扩展处理从编码的信道子集导出。

    Coding of sparse digital media spectral data
    17.
    发明授权
    Coding of sparse digital media spectral data 有权
    稀疏数字媒体光谱数据编码

    公开(公告)号:US07774205B2

    公开(公告)日:2010-08-10

    申请号:US11764108

    申请日:2007-06-15

    IPC分类号: G10L21/04

    摘要: An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.

    摘要翻译: 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。 音频编码器/解码器对频谱峰值的频率位置进行时间预测。 从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。 与常规游程长度编码相比,预测可以避免编码非常大的零电平变换系数运行。 对于未在前面的变换编码块中预测的频谱峰值,频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。

    Selectively using multiple entropy models in adaptive coding and decoding
    19.
    发明申请
    Selectively using multiple entropy models in adaptive coding and decoding 有权
    在自适应编码和解码中选择性地使用多个熵模型

    公开(公告)号:US20070016418A1

    公开(公告)日:2007-01-18

    申请号:US11183266

    申请日:2005-07-15

    IPC分类号: G10L15/00

    摘要: Techniques and tools for selectively using multiple entropy models in adaptive coding and decoding are described herein. For example, for multiple symbols, an audio encoder selects an entropy model from a first model set that includes multiple entropy models. Each of the multiple entropy models includes a model switch point for switching to a second model set that includes one or more entropy models. The encoder processes the multiple symbols using the selected entropy model and outputs results. Techniques and tools for generating entropy models are also described.

    摘要翻译: 本文描述了在自适应编码和解码中选择性地使用多个熵模型的技术和工具。 例如,对于多个符号,音频编码器从包括多个熵模型的第一模型集合中选择熵模型。 多个熵模型中的每一个包括用于切换到包括一个或多个熵模型的第二模型集合的模型切换点。 编码器使用所选择的熵模型处理多个符号并输出结果。 还描述了用于生成熵模型的技术和工具。

    Modification of codewords in dictionary used for efficient coding of digital media spectral data
    20.
    发明申请
    Modification of codewords in dictionary used for efficient coding of digital media spectral data 有权
    用于数字媒体频谱数据有效编码的字典中的码字的修改

    公开(公告)号:US20070016414A1

    公开(公告)日:2007-01-18

    申请号:US11183084

    申请日:2005-07-15

    IPC分类号: G10L19/00

    CPC分类号: G10L19/038 G10L19/24

    摘要: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

    摘要翻译: 通过将频谱数据的某些部分表示为编码矢量的缩放版本来编码频谱数据,其中从固定预定码本或从基带获取的码本中选择码矢量。 描述了各种可选特征,用于根据允许代码矢量更好地表示其正在建模的数据的一些规则来修改码本中的码矢量。 码矢量修改包括一个或多个码矢量的线性或非线性变换,例如通过取幂,否定,反转或从多个码矢量组合元素。