Scalable audio encoding and/or decoding method and apparatus
    21.
    发明申请
    Scalable audio encoding and/or decoding method and apparatus 审中-公开
    可扩展音频编码和/或解码方法和装置

    公开(公告)号:US20070040709A1

    公开(公告)日:2007-02-22

    申请号:US11485468

    申请日:2006-07-13

    IPC分类号: H03M7/00

    CPC分类号: G10L19/0208

    摘要: A method and apparatus to scalably encode and/or decode an audio signal includes encoding a specific band signal included in an input signal, encoding a frequency envelope of an excited signal in which the encoded specific band signal is removed from the input signal, encoding a residual signal in which the encoded frequency envelope is removed from the excited signal, and forming a bit-stream by scalably packing the encoded specific band signal, frequency envelop, and residual signal.

    摘要翻译: 一种用于对音频信号进行可缩放编码和/或解码的方法和装置包括编码包括在输入信号中的特定频带信号,对其中去除编码的特定频带信号的激励信号的频率包络进行编码, 残留信号,其中编码的频率包络从激励信号中去除,以及通过可扩展地打包编码的特定频带信号,频率包络和残余信号来形成比特流。

    Speech signal compression and/or decompression method, medium, and apparatus
    22.
    发明申请
    Speech signal compression and/or decompression method, medium, and apparatus 有权
    语音信号压缩和/或解压缩方法,媒体和装置

    公开(公告)号:US20060020453A1

    公开(公告)日:2006-01-26

    申请号:US11128432

    申请日:2005-05-13

    IPC分类号: G10L19/00

    CPC分类号: G10L19/025

    摘要: A speech signal compression and/or decompression method, medium, and apparatus in which the speech signal is transformed into the frequency domain for quantizing and dequantizing information of frequency coefficients. The speech signal compression apparatus includes a transform unit to transform a speech signal into the frequency domain and obtain frequency coefficients, a magnitude quantization unit to transform magnitudes of the frequency coefficients, quantize the transformed magnitudes and obtain magnitude quantization indices, a sign quantization unit to quantize signs of the frequency coefficients and obtain sign quantization indices, and a packetizing unit to generate the magnitude and sign quantization indices as a speech packet.

    摘要翻译: 一种语音信号压缩和/或解压缩方法,介质和装置,其中语音信号被变换成频域以量化和去量化频率系数的信息。 语音信号压缩装置包括将语音信号变换为频域并获得频率系数的变换单元,变换频率系数的幅度量化单位,量化变换幅度并获得幅度量化索引,符号量化单元, 量化频率系数的符号并获得符号量化索引,以及分组单元,用于生成幅度和符号量化索引作为语音分组。

    Method and apparatus to search fixed codebook using tracks of a trellis structure with each track being a union of tracks of an algebraic codebook
    23.
    发明授权
    Method and apparatus to search fixed codebook using tracks of a trellis structure with each track being a union of tracks of an algebraic codebook 有权
    使用网格结构的轨道来搜索固定码本的方法和装置,每个轨道是代数码本的轨道的并集

    公开(公告)号:US08560306B2

    公开(公告)日:2013-10-15

    申请号:US11457251

    申请日:2006-07-13

    摘要: A method and apparatus to search a codebook including pulses that model a predetermined component of a speech signal. The method includes the operations of selecting a predetermined number of paths corresponding to a predetermined number of pulse locations that are most consistent with the predetermined component, from among paths corresponding to pulse locations of a predetermined pulse location set allocated to at least one branch that connects one state of a predetermined Trellis structure to another state, performing the path selecting operation on each of states other than the one state, and selecting a path corresponding to pulse locations that are most consistent with the predetermined component from among paths including the selected paths, wherein each path corresponds to a union of plural tracks of an algebraic codebook. Accordingly, a number of calculations required during a codebook search is reduced.

    摘要翻译: 一种搜索包括对语音信号的预定分量进行建模的脉冲的码本的方法和装置。 该方法包括从对应于分配给连接至少一个分支的预定脉冲位置集的脉冲位置的路径中选择对应于与预定分量最一致的预定数量的脉冲位置的预定数量的路径的操作 将预定网格结构的一个状态转换到另一状态,对除了一个状态之外的每个状态执行路径选择操作,并且从包括所选择的路径的路径中选择与预定分量最一致的脉冲位置对应的路径, 其中每个路径对应于代数码本的多个轨道的并集。 因此,减少了码本搜索期间所需的一些计算。

    FRAME ERASURE CONCEALMENT FOR A MULTI RATE SPEECH AND AUDIO CODEC
    24.
    发明申请
    FRAME ERASURE CONCEALMENT FOR A MULTI RATE SPEECH AND AUDIO CODEC 有权
    多语音和音频编解码器的帧擦除保护

    公开(公告)号:US20120265523A1

    公开(公告)日:2012-10-18

    申请号:US13443204

    申请日:2012-04-10

    IPC分类号: G10L21/00 G10L19/00

    摘要: An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec, configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode, the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.

    摘要翻译: 提供一种音频编码终端和方法。 该终端包括编码模式设置单元,用于根据多个操作模式设置用于通过编解码器进行输入音频编码的操作模式,用于基于所设置的操作模式对输入音频进行编码,使得当所设置的操作模式为高帧时 擦除率(FER)模式,编解码器根据一种或多种FEC模式的选择帧擦除隐藏(FEC)模式对输入音频的当前帧进行编码。 在将操作模式设置为高FER模式时,从针对高FER模式预定的一个或多个FEC模式中选择一个FEC模式,以通过在输入音频的编码中并入冗余来控制编解码器 或者根据所选择的一种FEC模式作为单独的冗余信息与编码的输入音频分离。

    Speech signal compression and/or decompression method, medium, and apparatus
    25.
    发明授权
    Speech signal compression and/or decompression method, medium, and apparatus 有权
    语音信号压缩和/或解压缩方法,媒体和装置

    公开(公告)号:US08019600B2

    公开(公告)日:2011-09-13

    申请号:US11128432

    申请日:2005-05-13

    IPC分类号: G10L19/00 G10L19/02 G10L19/14

    CPC分类号: G10L19/025

    摘要: A speech signal compression and/or decompression method, medium, and apparatus in which the speech signal is transformed into the frequency domain for quantizing and dequantizing information of frequency coefficients. The speech signal compression apparatus includes a transform unit to transform a speech signal into the frequency domain and obtain frequency coefficients, a magnitude quantization unit to transform magnitudes of the frequency coefficients, quantize the transformed magnitudes and obtain magnitude quantization indices, a sign quantization unit to quantize signs of the frequency coefficients and obtain sign quantization indices, and a packetizing unit to generate the magnitude and sign quantization indices as a speech packet.

    摘要翻译: 一种语音信号压缩和/或解压缩方法,介质和装置,其中语音信号被变换成频域以量化和去量化频率系数的信息。 语音信号压缩装置包括将语音信号变换为频域并获得频率系数的变换单元,变换频率系数的幅度量化单位,量化变换幅度并获得幅度量化索引,符号量化单元, 量化频率系数的符号并获得符号量化索引,以及分组单元,用于生成幅度和符号量化索引作为语音分组。

    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data
    26.
    发明授权
    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data 有权
    使用该方法和装置来量化/去量化频率振幅数据的方法和装置以及方法和装置进行音频编码/解码以量化/去量化频率振幅数据

    公开(公告)号:US07805314B2

    公开(公告)日:2010-09-28

    申请号:US11471635

    申请日:2006-06-21

    IPC分类号: G10L19/00 G10L19/02

    摘要: A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.

    摘要翻译: 一种用于量化/去量化频率幅度数据的方法和装置以及使用该方法和装置对频率振幅数据进行量化/去量化的音频编码/解码的方法和装置。 该方法包括:计算和量化构成音频帧的多个频带中的每个频带的频率幅度的功率,使用量化功率归一化每个频带的频率振幅数据,以及量化偶数或奇数数据中的第一个 在归一化的频率振幅数据中。 该方法可以进一步包括使用偶数或奇数编号的量化的第一个量化的归一化频率幅度数据中对应于未被量化的偶数或奇数频率振幅中的第二频率振幅数据, 量化与未量化的第二频率振幅数据和内插频率振幅数据之间的差对应的内插误差。

    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same
    28.
    发明申请
    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same 有权
    用于对语音信号进行分类的方法,装置和介质以及使用其编码语音信号的方法,装置和介质

    公开(公告)号:US20070038440A1

    公开(公告)日:2007-02-15

    申请号:US11480449

    申请日:2006-07-05

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22 G10L19/022

    摘要: A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.

    摘要翻译: 提供了一种用于对语音信号进行分类的方法,装置和媒体,以及使用该语音信号编码语音信号的方法,装置和媒体。 用于分类语音信号的方法包括从具有块单位的输入信号计算分类参数,从分类参数计算多个分类标准,以及使用多个分类标准对输入信号的等级进行分类。 分类参数包括输入信号的能量参数,当前帧的特定块与输入信号之间的互相关参数,以及通过累加互相关参数而获得的积分互相关参数中的至少一个。

    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data
    29.
    发明申请
    Method and apparatus to quantize/dequantize frequency amplitude data and method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize frequency amplitude data 有权
    使用该方法和装置来量化/去量化频率振幅数据的方法和装置以及方法和装置进行音频编码/解码以量化/去量化频率振幅数据

    公开(公告)号:US20070016417A1

    公开(公告)日:2007-01-18

    申请号:US11471635

    申请日:2006-06-21

    IPC分类号: G10L19/00

    摘要: A method and apparatus to quantize/dequantize frequency amplitude data and a method and apparatus to audio encode/decode using the method and apparatus to quantize/dequantize the frequency amplitude data. The method includes calculating and quantizing power of frequency amplitudes for each of a plurality of bands constituting an audio frame, normalizing frequency amplitude data for each of the bands using the quantized power, and quantizing a first one of even-numbered or odd-numbered data among the normalized frequency amplitude data. The method may further include interpolating frequency amplitude data that corresponds to a second one of the even-numbered or odd-numbered frequency amplitude that is not quantized from among the normalized frequency amplitude data using the quantized first one of the even-numbered or odd-numbered data, and quantizing an interpolation error corresponding to a difference between the second frequency amplitude data that is not quantized and the interpolated frequency amplitude data.

    摘要翻译: 一种用于量化/去量化频率幅度数据的方法和装置以及使用该方法和装置对频率振幅数据进行量化/去量化的音频编码/解码的方法和装置。 该方法包括:计算和量化构成音频帧的多个频带中的每个频带的频率幅度的功率,使用量化功率归一化每个频带的频率振幅数据,以及量化偶数或奇数数据中的第一个 在归一化的频率振幅数据中。 该方法可以进一步包括使用偶数或奇数编号的量化的第一个量化的归一化频率幅度数据中对应于未被量化的偶数或奇数频率振幅中的第二频率振幅数据, 量化与未量化的第二频率振幅数据和内插频率振幅数据之间的差对应的内插误差。

    Audio coding and decoding apparatuses and methods, and recording mediums storing the methods
    30.
    发明申请
    Audio coding and decoding apparatuses and methods, and recording mediums storing the methods 审中-公开
    音频编码和解码装置和方法以及存储方法的记录介质

    公开(公告)号:US20060206316A1

    公开(公告)日:2006-09-14

    申请号:US11333342

    申请日:2006-01-18

    IPC分类号: G10L11/04

    摘要: Audio coding and decoding apparatuses and methods that can optimize the quality of an audio signal including harmonics, and recording mediums storing the methods. An audio coding apparatus includes: a first harmonic coding module performing first harmonic coding on an input audio signal using a pitch lag of the input audio signal and producing a quantized linear prediction coding coefficient; a first detector detecting a first difference audio signal from a difference between an audio signal output from the first harmonic coding module and the input audio signal; a second harmonic coding module performing harmonic coding on the first difference audio signal using the quantized linear prediction coding coefficient and a previous harmonic coding result; a second detector detecting a second difference audio signal obtained from a difference between an audio signal output from the second harmonic coding module and the first difference audio signal; and a code excited linear prediction (CELP) module CELP coding the second difference audio signal using the quantized linear prediction coding coefficient obtained from the first harmonic coding module.

    摘要翻译: 可以优化包括谐波的音频信号的质量的音频编码和解码装置和方法,以及存储方法的记录介质。 音频编码装置包括:第一谐波编码模块,使用输入音频信号的音调滞后对输入音频信号执行一次谐波编码,并产生量化的线性预测编码系数; 第一检测器,从第一谐波编码模块输出的音频信号与输入音频信号之间的差检测第一差分音频信号; 使用量化线性预测编码系数和先前的谐波编码结果对第一差分音频信号执行谐波编码的二次谐波编码模块; 第二检测器,检测从二次谐波编码模块输出的音频信号与第一差音频信号之间的差获得的第二差分音频信号; 以及使用从第一谐波编码模块获得的量化线性预测编码系数对第二差音频信号进行编码的码激励线性预测(CELP)模块CELP。