Method and apparatus for processing audio frames to transition between different codecs
    1.
    发明授权
    Method and apparatus for processing audio frames to transition between different codecs 有权
    用于处理音频帧以在不同编解码器之间转换的方法和装置

    公开(公告)号:US09043201B2

    公开(公告)日:2015-05-26

    申请号:US13342462

    申请日:2012-01-03

    摘要: A method (700, 800) and apparatus (100, 200) processes audio frames to transition between different codecs. The method can include producing (720), using a first coding method, a first frame of coded output audio samples by coding a first audio frame in a sequence of frames. The method can include forming (730) an overlap-add portion of the first frame using the first coding method. The method can include generating (740) a combination first frame of coded audio samples based on combining the first frame of coded output audio samples with the overlap-add portion of the first frame. The method can include initializing (760) a state of a second coding method based on the combination first frame of coded audio samples. The method can include constructing (770) an output signal based on the initialized state of the second coding method.

    摘要翻译: 方法(700,800)和装置(100,200)处理音频帧以在不同编解码器之间转换。 该方法可以包括:使用第一编码方法,通过对帧序列中的第一音频帧进行编码来产生编码的输出音频样本的第一帧(720)。 该方法可以包括使用第一编码方法形成(730)第一帧的重叠部分。 该方法可以包括:通过将编码的输出音频样本的第一帧与第一帧的重叠相加部分组合来产生(740)编码音频样本的组合第一帧。 该方法可以包括基于编码音频样本的组合第一帧来初始化(760)第二编码方法的状态。 该方法可以包括基于第二编码方法的初始化状态来构造(770)输出信号。

    Method and Apparatus for Processing Audio Frames to Transition Between Different Codecs
    2.
    发明申请
    Method and Apparatus for Processing Audio Frames to Transition Between Different Codecs 有权
    用于处理音频帧以转换不同编解码器的方法和装置

    公开(公告)号:US20130173259A1

    公开(公告)日:2013-07-04

    申请号:US13342462

    申请日:2012-01-03

    IPC分类号: G10L19/00

    摘要: A method (700, 800) and apparatus (100, 200) processes audio frames to transition between different codecs. The method can include producing (720), using a first coding method, a first frame of coded output audio samples by coding a first audio frame in a sequence of frames. The method can include forming (730) an overlap-add portion of the first frame using the first coding method. The method can include generating (740) a combination first frame of coded audio samples based on combining the first frame of coded output audio samples with the overlap-add portion of the first frame. The method can include initializing (760) a state of a second coding method based on the combination first frame of coded audio samples. The method can include constructing (770) an output signal based on the initialized state of the second coding method.

    摘要翻译: 方法(700,800)和装置(100,200)处理音频帧以在不同编解码器之间转换。 该方法可以包括:使用第一编码方法,通过对帧序列中的第一音频帧进行编码来产生编码的输出音频样本的第一帧(720)。 该方法可以包括使用第一编码方法形成(730)第一帧的重叠部分。 该方法可以包括:通过将编码的输出音频样本的第一帧与第一帧的重叠相加部分组合来产生(740)编码音频样本的组合第一帧。 该方法可以包括基于编码音频样本的组合第一帧来初始化(760)第二编码方法的状态。 该方法可以包括基于第二编码方法的初始化状态来构造(770)输出信号。

    Audio signal decoder and method for producing a scaled reconstructed audio signal
    3.
    发明授权
    Audio signal decoder and method for producing a scaled reconstructed audio signal 有权
    音频信号解码器和用于产生缩放的重构音频信号的方法

    公开(公告)号:US08219408B2

    公开(公告)日:2012-07-10

    申请号:US12345117

    申请日:2008-12-29

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24 G10L19/008

    摘要: During operation a multiple channel audio input signal is received and coded to generate a coded audio signal. A balance factor having balance factor components each associated with an audio signal of the multiple channel audio signal is generated. A gain value to be applied to the coded audio signal to generate an estimate of the multiple channel audio signal based on the balance factor and the multiple channel audio signal is determined, with the gain value configured to minimize a distortion value between the multiple channel audio signal and the estimate of the multiple channel audio signal. The representation of the gain value may be output for transmission and/or storage.

    摘要翻译: 在操作期间,接收并编码多声道音频输入信号以产生编码音频信号。 产生具有各自与多声道音频信号的音频信号相关联的平衡因子分量的平衡因子。 确定要应用于编码音频信号以产生基于平衡因子和多声道音频信号的多声道音频信号的估计的增益值,其中增益值被配置为最小化多声道音频之间的失真值 信号和多声道音频信号的估计。 可以输出增益值的表示用于传输和/或存储。

    Encoder that optimizes bit allocation for information sub-parts
    4.
    发明授权
    Encoder that optimizes bit allocation for information sub-parts 有权
    用于优化信息子部分的位分配的编码器

    公开(公告)号:US08207875B2

    公开(公告)日:2012-06-26

    申请号:US12607439

    申请日:2009-10-28

    IPC分类号: H03M7/34

    CPC分类号: H03M7/4006 H03M7/3082

    摘要: A encoder/decoder architecture (200, 300, 700) that uses an arithmetic encoder (220) to encode the MSB portions of the output of a Factorial Pulse Coder (212), that encodes the output of a first-level source encoder (210), e.g., MDCT. Sub-parts (e.g., frequency bands) of portions (e.g., frames) of the signal are suitably sorted in increasing order based on a measure related to signal energy (e.g., signal energy itself). Doing this in a system (100) that overlays Arithmetic Encoding on Factorial Pulse coding results in bits being re-allocated to bands with higher signal energy content, ultimately yielding higher signal quality and higher bit utilization efficiency.

    摘要翻译: 一种编码器/解码器架构(200,300,700),其使用算术编码器(220)对因子脉冲编码器(212)的输出的MSB部分进行编码,其对第一级源编码器(210)的输出进行编码 ),例如MDCT。 基于与信号能量(例如,信号能量本身)相关的测量,以增加的顺序适当地分类信号的部分(例如,帧)的子部分(例如,频带)。 在覆盖因子脉冲编码的算术编码的系统(100)中进行这一操作导致将比特重新分配给具有较高信号能量内容的频带,最终产生更高的信号质量和更高的比特利用效率。

    AUDIO SIGNAL BANDWIDTH EXTENSION IN CELP-BASED SPEECH CODER
    5.
    发明申请
    AUDIO SIGNAL BANDWIDTH EXTENSION IN CELP-BASED SPEECH CODER 有权
    基于CELP的语音编码器中的音频信号带宽扩展

    公开(公告)号:US20120095757A1

    公开(公告)日:2012-04-19

    申请号:US13247129

    申请日:2011-09-28

    IPC分类号: G10L19/12

    CPC分类号: G10L21/038

    摘要: A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes obtaining a second excitation signal having an audio bandwidth extending beyond the audio bandwidth of the CELP excitation signal, obtaining a set of signals by filtering the second excitation signal with a set of bandpass filters, scaling the set of signals using a set of energy-based parameters, and obtaining a composite output signal by combining the scaled set of signals with a signal based on the audio signal decoded by the CELP-based decoder element.

    摘要翻译: 一种用于对包括基于CELP的解码器元件的音频解码器中具有超出CELP激励信号的带宽的带宽的音频信号进行解码的方法。 该方法包括获得具有超出CELP激励信号的音频带宽的音频带宽的第二激励信号,通过用一组带通滤波器对第二激励信号进行滤波来获得一组信号,使用一组 基于能量的参数,并且通过将缩放的信号组合与基于由基于CELP的解码器元素解码的音频信号的信号组合来获得复合输出信号。

    Hybrid arithmetic-combinatorial encoder
    6.
    发明授权
    Hybrid arithmetic-combinatorial encoder 有权
    混合算术组合编码器

    公开(公告)号:US08149144B2

    公开(公告)日:2012-04-03

    申请号:US12651303

    申请日:2009-12-31

    IPC分类号: H03M7/38

    CPC分类号: H03M7/3082 H03M7/4006

    摘要: Hybrid range coding/combinatorial coding (FPC) encoders and decoders are provided. Encoding and decoding can be dynamically switched between range coding and combinatorial according to the ratio of ones to the ratio of bits in a partial remaining sequence in order to reduce the computational complexity of encoding and decoding.

    摘要翻译: 提供混合范围编码/组合编码(FPC)编码器和解码器。 编码和解码可以在范围编码和组合之间根据比例与部分剩余序列中的比特比进行动态切换,以减少编码和解码的计算复杂度。

    Selective scaling mask computation based on peak detection
    7.
    发明授权
    Selective scaling mask computation based on peak detection 有权
    基于峰值检测的选择性缩放掩码计算

    公开(公告)号:US08140342B2

    公开(公告)日:2012-03-20

    申请号:US12345141

    申请日:2008-12-29

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24

    摘要: A set of peaks in a reconstructed audio vector Ŝ of a received audio signal is detected and a scaling mask ψ(Ŝ) based on the detected set of peaks is generated. A gain vector g* is generated based on at least the scaling mask and an index j representative of the gain vector. The reconstructed audio signal is scaled with the gain vector to produce a scaled reconstructed audio signal. A distortion is generated based on the audio signal and the scaled reconstructed audio signal. The index of the gain vector based on the generated distortion is output.

    摘要翻译: 检测接收到的音频信号的重构音频向量中的一组峰值,并且生成基于所检测的峰值组的缩放掩码ψ(Ŝ)。 基于至少缩放掩码和表示增益矢量的索引j生成增益向量g *。 重建音频信号用增益矢量进行缩放以产生缩放的重构音频信号。 基于音频信号和缩放的重构音频信号产生失真。 输出基于产生的失真的增益矢量的索引。

    ENCODER FOR AUDIO SIGNAL INCLUDING GENERIC AUDIO AND SPEECH FRAMES
    8.
    发明申请
    ENCODER FOR AUDIO SIGNAL INCLUDING GENERIC AUDIO AND SPEECH FRAMES 有权
    音频信号编码器,包括一般音频和语音框架

    公开(公告)号:US20110218797A1

    公开(公告)日:2011-09-08

    申请号:US12844199

    申请日:2010-07-27

    IPC分类号: G10L19/00

    摘要: A method for encoding audio frames by producing a first frame of coded audio samples by coding a first audio frame in a sequence of frames, producing at least a portion of a second frame of coded audio samples by coding at least a portion of a second audio frame in the sequence of frames, and producing parameters for generating audio gap filler samples, wherein the parameters are representative of either a weighted segment of the first frame of coded audio samples or a weighted segment of the portion of the second frame of coded audio samples.

    摘要翻译: 一种通过以帧序列编码第一音频帧来产生编码音频样本的第一帧来对音频帧进行编码的方法,通过编码第二音频的至少一部分来产生编码音频采样的第二帧的至少一部分 帧,并产生用于产生音频间隙填充样本的参数,其中参数表示编码音频样本的第一帧的加权段或编码音频样本的第二帧的部分的加权段 。

    Encoder that Optimizes Bit Allocation for Information Sub-Parts
    9.
    发明申请
    Encoder that Optimizes Bit Allocation for Information Sub-Parts 有权
    优化信息子部分的位分配的编码器

    公开(公告)号:US20110096830A1

    公开(公告)日:2011-04-28

    申请号:US12607439

    申请日:2009-10-28

    IPC分类号: H04N7/12

    CPC分类号: H03M7/4006 H03M7/3082

    摘要: A encoder/decoder architecture (200, 300, 700) that uses an arithmetic encoder (220) to encode the MSB portions of the output of a Factorial Pulse Coder (212), that encodes the output of a first-level source encoder (210), e.g., MDCT. Sub-parts (e.g., frequency bands) of portions (e.g., frames) of the signal are suitably sorted in increasing order based on a measure related to signal energy (e.g., signal energy itself). Doing this in a system (100) that overlays Arithmetic Encoding on Factorial Pulse coding results in bits being re-allocated to bands with higher signal energy content, ultimately yielding higher signal quality and higher bit utilization efficiency.

    摘要翻译: 一种编码器/解码器架构(200,300,700),其使用算术编码器(220)对因子脉冲编码器(212)的输出的MSB部分进行编码,其对第一级源编码器(210)的输出进行编码 ),例如MDCT。 基于与信号能量(例如,信号能量本身)相关的测量,以增加的顺序适当地分类信号的部分(例如,帧)的子部分(例如,频带)。 在覆盖因子脉冲编码的算术编码的系统(100)中进行这一操作导致将比特重新分配给具有较高信号能量内容的频带,最终产生更高的信号质量和更高的比特利用效率。

    Method and apparatus for speech coding
    10.
    发明授权
    Method and apparatus for speech coding 有权
    用于语音编码的方法和装置

    公开(公告)号:US07792670B2

    公开(公告)日:2010-09-07

    申请号:US10964861

    申请日:2004-10-14

    IPC分类号: G10L19/00

    CPC分类号: G10L19/09

    摘要: A method and apparatus for prediction in a speech-coding system is provided herein. The method of a 1st order long-term predictor (LTP) filter, using a sub-sample resolution delay, is extended to a multi-tap LTP filter, or, viewed from another vantage point, the conventional integer-sample resolution multi-tap LTP filter is extended to use sub-sample resolution delay. This novel formulation of a multi-tap LTP filter offers a number of advantages over the prior-art LTP filter configurations. Particularly, defining the lag with sub-sample resolution makes it possible to explicitly model the delay values that have a fractional component, within the limits of resolution of the over-sampling factor used by the interpolation filter. The coefficients of such a multi-tap LTP filter are thus largely freed from modeling the effect of delays that have a fractional component. Consequently their main function is to maximize the prediction gain of the LTP filter via modeling the degree of periodicity that is present and by imposing spectral shaping.

    摘要翻译: 本发明提供了语音编码系统中的预测方法和装置。 使用子样本分辨率延迟的1阶长期预测器(LTP)滤波器的方法被扩展到多抽头LTP滤波器,或者从另一个有利位置观察传统的整数样本分辨率多抽头 LTP过滤器扩展到使用子样本分辨率延迟。 多抽头LTP滤波器的这种新颖配方提供了优于现有技术的LTP滤波器配置的许多优点。 特别地,使用子样本分辨率定义滞后使得可以在内插滤波器使用的过采样因子的分辨率的限度内明确地建模具有分数分量的延迟值。 因此,这种多抽头LTP滤波器的系数在很大程度上不会对具有分数分量的延迟的影响进行建模。 因此,它们的主要功能是通过建模存在的周期程度和施加频谱整形来最大化LTP滤波器的预测增益。