Frame erasure concealment for a multi rate speech and audio codec
    1.
    发明授权
    Frame erasure concealment for a multi rate speech and audio codec 有权
    用于多速率语音和音频编解码器的帧擦除隐藏

    公开(公告)号:US09026434B2

    公开(公告)日:2015-05-05

    申请号:US13443204

    申请日:2012-04-10

    摘要: An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.

    摘要翻译: 提供一种音频编码终端和方法。 终端包括编码模式设置单元,用于根据多个操作模式设置用于通过编解码器进行输入音频编码的操作模式,编解码器被配置为基于设置的操作模式对输入音频进行编码,使得当设置的操作模式是高帧擦除 速率(FER)模式,编解码器根据一个或多个FEC模式的选择帧擦除隐藏(FEC)模式对输入音频的当前帧进行编码。 在将操作模式设置为高FER模式时,从针对高FER模式预定的一个或多个FEC模式中选择一个FEC模式,以通过在输入音频的编码中并入冗余来控制编解码器,或 作为根据所选择的一种FEC模式与编码的输入音频分离的单独的冗余信息。

    FRAME ERASURE CONCEALMENT FOR A MULTI RATE SPEECH AND AUDIO CODEC
    2.
    发明申请
    FRAME ERASURE CONCEALMENT FOR A MULTI RATE SPEECH AND AUDIO CODEC 有权
    多语音和音频编解码器的帧擦除保护

    公开(公告)号:US20120265523A1

    公开(公告)日:2012-10-18

    申请号:US13443204

    申请日:2012-04-10

    IPC分类号: G10L21/00 G10L19/00

    摘要: An audio coding terminal and method is provided. The terminal includes a coding mode setting unit to set an operation mode, from plural operation modes, for input audio coding by a codec, configured to code the input audio based on the set operation mode such that when the set operation mode is a high frame erasure rate (FER) mode the codec codes a current frame of the input audio according to a select frame erasure concealment (FEC) mode of one or more FEC modes. Upon the setting of the operation mode to be the High FER mode, the one FEC mode is selected, from the one or more FEC modes predetermined for the High FER mode, to control the codec by incorporating of redundancy within a coding of the input audio or as separate redundancy information separate from the coded input audio according to the selected one FEC mode.

    摘要翻译: 提供一种音频编码终端和方法。 该终端包括编码模式设置单元,用于根据多个操作模式设置用于通过编解码器进行输入音频编码的操作模式,用于基于所设置的操作模式对输入音频进行编码,使得当所设置的操作模式为高帧时 擦除率(FER)模式,编解码器根据一种或多种FEC模式的选择帧擦除隐藏(FEC)模式对输入音频的当前帧进行编码。 在将操作模式设置为高FER模式时,从针对高FER模式预定的一个或多个FEC模式中选择一个FEC模式,以通过在输入音频的编码中并入冗余来控制编解码器 或者根据所选择的一种FEC模式作为单独的冗余信息与编码的输入音频分离。

    APPARATUS AND METHOD FOR CONCEALING FRAME ERASURE AND VOICE DECODING APPARATUS AND METHOD USING THE SAME
    3.
    发明申请
    APPARATUS AND METHOD FOR CONCEALING FRAME ERASURE AND VOICE DECODING APPARATUS AND METHOD USING THE SAME 有权
    用于保护框架擦除和声音解码装置的装置和方法及其使用方法

    公开(公告)号:US20120232888A1

    公开(公告)日:2012-09-13

    申请号:US13477461

    申请日:2012-05-22

    IPC分类号: G10L19/04

    摘要: An apparatus and method for concealing frame erasure and a voice decoding apparatus and method using the same. The frame erasure concealment apparatus includes: a parameter extraction unit determining whether there is an erased frame in a voice packet, and extracting an excitement signal parameter and a line spectrum pair parameter of a previous good frame; and an erasure frame concealment unit, if there is an erased frame, restoring the excitement signal and line spectrum pair parameter of the erased frame by using a regression analysis from the excitement signal and line spectrum pair parameter of the previous good frame. According to the method and apparatus, by predicting and restoring the parameter of the erased frame through the regression analysis, the quality of the restored voice signal can be enhanced and the algorithm can be simplified.

    摘要翻译: 一种用于隐藏帧擦除的装置和方法,以及使用该帧擦除的语音解码装置和方法。 帧擦除隐藏装置包括:参数提取单元,确定语音分组中是否存在被擦除的帧,以及提取先前好帧的兴奋信号参数和线谱对参数; 以及擦除帧隐藏单元,如果存在擦除帧,则通过使用来自先前好帧的兴奋信号和线谱对参数的回归分析来恢复被擦除帧的兴奋信号和线谱对参数。 根据该方法和装置,通过回归分析预测和恢复被擦除的帧的参数,可以提高恢复的语音信号的质量,并且可以简化算法。

    Method and apparatus for recovering line spectrum pair parameter and speech decoding apparatus using same
    4.
    发明授权
    Method and apparatus for recovering line spectrum pair parameter and speech decoding apparatus using same 有权
    用于恢复线谱对参数和使用其的语音解码装置的方法和装置

    公开(公告)号:US08214203B2

    公开(公告)日:2012-07-03

    申请号:US12659943

    申请日:2010-03-25

    CPC分类号: G10L19/005 G10L19/07

    摘要: A method and an apparatus for recovering a line spectrum pair (LSP) parameter of a spectrum region when frame loss occurs during speech decoding and a speech decoding apparatus adopting the same are provided. The method of recovering an LSP parameter in speech decoding includes: if it is determined that a received speech packet has an erased frame, converting an LSP parameter of a previous good frame (PGF) of the erased frame or LSP parameters of the PGF and a next good frame (NGF) of the erased frame into a spectrum region and obtaining a spectrum envelope of the PGF or spectrum envelopes of the PGF and NGF; recovering a spectrum envelope of the erased frame using the spectrum envelope of the PGF or the spectrum envelopes of the PGF and NGF; and converting the recovered spectrum envelope of the erased frame into an LSP parameter of the erased frame. The method and apparatus can improve the quality of a recovered speech signal, be applied to a variety of technologies, and provide a method of recovering an LSP parameter for development of an algorithm for speech decoding.

    摘要翻译: 提供了一种用于在语音解码期间发生帧丢失时恢复频谱区的线谱对(LSP)参数的方法和装置,以及采用该频谱对参数的语音解码装置。 在语音解码中恢复LSP参数的方法包括:如果确定接收到的语音分组具有已擦除的帧,则将已擦除帧的先前好帧(PGF)的LSP参数或PGF的LSP参数和 将擦除的帧的下一个良好帧(NGF)进入频谱区域并获得PGF和NGF的PGF或频谱包络的​​频谱包络; 使用PGF的频谱包络或PGF和NGF的频谱包络来恢复被擦除的帧的频谱包络; 以及将所述已擦除帧的所恢复的频谱包络转换为所述已擦除帧的LSP参数。 该方法和装置可以提高恢复的语音信号的质量,应用于各种技术,并提供一种恢复用于语音解码算法开发的LSP参数的方法。

    Scalable speech coding/decoding apparatus, method, and medium having mixed structure
    5.
    发明申请
    Scalable speech coding/decoding apparatus, method, and medium having mixed structure 有权
    可扩展语音编码/解码装置,方法和具有混合结构的介质

    公开(公告)号:US20070033023A1

    公开(公告)日:2007-02-08

    申请号:US11490139

    申请日:2006-07-21

    IPC分类号: G10L19/02

    摘要: Provided are a scalable wide-band speech coding/decoding apparatus, method, and medium. An input wide-band speech input signal is first divided into a low-band signal and a high-band signal. The divided low-band signal is then coded using a code excited linear prediction (CELP) method. The divided high-band signal is coded using a harmonic method. A signal representing a difference between a synthetic signal obtained from the low-band and the high band, and a signal input to the low-band and the high-band is then coded using a modified discrete cosine transform (MDCT) method. The coded signal is then multiplexed. The multiplexed signal is then output. Accordingly, high quality speech can be achieved for all layers.

    摘要翻译: 提供了一种可扩展的宽带语音编码/解码装置,方法和媒体。 输入宽带语音输入信号首先被分成低频带信号和高频带信号。 然后使用码激励线性预测(CELP)方法对分频的低频带信号进行编码。 分频高频信号采用谐波法编码。 然后,使用修正的离散余弦变换(MDCT)方法对表示从低频带和高频带获得的合成信号之间的差异以及输入到低频带和高频带的信号进行编码的信号。 然后对编码信号进行多路复用。 然后输出复用的信号。 因此,可以实现对所有层的高质量语音。

    Apparatus and method for concealing frame erasure and voice decoding apparatus and method using the same
    8.
    发明授权
    Apparatus and method for concealing frame erasure and voice decoding apparatus and method using the same 有权
    用于隐藏帧擦除和语音解码装置的方法和使用该方法的方法

    公开(公告)号:US08204743B2

    公开(公告)日:2012-06-19

    申请号:US11417165

    申请日:2006-05-04

    IPC分类号: G10L11/06

    摘要: An apparatus and method for concealing frame erasure and a voice decoding apparatus and method using the same. The frame erasure concealment apparatus includes: a parameter extraction unit determining whether there is an erased frame in a voice packet, and extracting an excitement signal parameter and a line spectrum pair parameter of a previous good frame; and an erasure frame concealment unit, if there is an erased frame, restoring the excitement signal and line spectrum pair parameter of the erased frame by using a regression analysis from the excitement signal and line spectrum pair parameter of the previous good frame. According to the method and apparatus, by predicting and restoring the parameter of the erased frame through the regression analysis, the quality of the restored voice signal can be enhanced and the algorithm can be simplified.

    摘要翻译: 一种用于隐藏帧擦除的装置和方法,以及使用该帧擦除的语音解码装置和方法。 帧擦除隐藏装置包括:参数提取单元,确定语音分组中是否存在被擦除的帧,以及提取先前好帧的兴奋信号参数和线谱对参数; 以及擦除帧隐藏单元,如果存在擦除帧,则通过使用来自先前好帧的兴奋信号和线谱对参数的回归分析来恢复被擦除帧的兴奋信号和线谱对参数。 根据该方法和装置,通过回归分析预测和恢复被擦除的帧的参数,可以提高恢复的语音信号的质量,并且可以简化算法。

    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same
    9.
    发明授权
    Method, apparatus, and medium for classifying speech signal and method, apparatus, and medium for encoding speech signal using the same 有权
    用于对语音信号进行分类的方法,装置和介质以及使用其编码语音信号的方法,装置和介质

    公开(公告)号:US08175869B2

    公开(公告)日:2012-05-08

    申请号:US11480449

    申请日:2006-07-05

    IPC分类号: G10L19/00

    CPC分类号: G10L19/22 G10L19/022

    摘要: A method, apparatus, and medium for classifying a speech signal and a method, apparatus, and medium for encoding the speech signal using the same are provided. The method for classifying a speech signal includes calculating classification parameters from an input signal having block units, calculating a plurality of classification criteria from the classification parameters, and classifying the level of the input signal using the plurality of classification criteria. The classification parameters include at least one of an energy parameter of the input signal, a cross-correlation parameter between a specific block of a present frame and the input signal, and an integrated cross-correlation parameter obtained by accumulating the cross-correlation parameter.

    摘要翻译: 提供了一种用于对语音信号进行分类的方法,装置和媒体,以及使用该语音信号编码语音信号的方法,装置和媒体。 用于分类语音信号的方法包括从具有块单位的输入信号计算分类参数,从分类参数计算多个分类标准,以及使用多个分类标准对输入信号的等级进行分类。 分类参数包括输入信号的能量参数,当前帧的特定块与输入信号之间的互相关参数,以及通过累加互相关参数而获得的积分互相关参数中的至少一个。