APPARATUS AND METHOD FOR CODING AND DECODING RESIDUAL SIGNAL
    1.
    发明申请
    APPARATUS AND METHOD FOR CODING AND DECODING RESIDUAL SIGNAL 审中-公开
    用于编码和解码残留信号的装置和方法

    公开(公告)号:US20090210219A1

    公开(公告)日:2009-08-20

    申请号:US12420215

    申请日:2009-04-08

    IPC分类号: G10L21/00

    摘要: Provided is a residual signal coding/decoding apparatus and method. The residual signal coding apparatus includes a transformer, a band splitter, a pulse searcher, and a pulse quantizer. The transformer transforms time-domain residual signals into a frequency domain to output transform coefficients. The band splitter splits the transform coefficients into bands to output the transform coefficients. The pulse searcher searches the transform coefficients for the respective bands to select optimal pulses and output parameters of the optimal pulses. The pulse quantizer quantizes the parameters of the optimal pulses.

    摘要翻译: 提供了一种残留信号编码/解码装置和方法。 残余信号编码装置包括变压器,分束器,脉冲搜索器和脉冲量化器。 变压器将时域残差信号变换为频域,输出变换系数。 频带分离器将变换系数分成频带以输出变换系数。 脉冲搜索器搜索各个频带的变换系数,以选择最佳脉冲和最佳脉冲的输出参数。 脉冲量化器对最佳脉冲的参数进行量化。

    MDCT domain post-filtering apparatus and method for quality enhancement of speech
    2.
    发明申请
    MDCT domain post-filtering apparatus and method for quality enhancement of speech 失效
    MDCT域后置滤波装置及语音质量提升方法

    公开(公告)号:US20090150143A1

    公开(公告)日:2009-06-11

    申请号:US12155542

    申请日:2008-06-05

    IPC分类号: G10L21/00 G10L19/00

    CPC分类号: G10L19/26 G10L19/0212

    摘要: A post-filtering apparatus and method for speech enhancement in a modified discrete cosine transform (MDCT) domain are disclosed. In the apparatus and method, previous and current MDCT coefficients are used for obtaining a speech spectrum coefficient similar to a real speech spectrum, and a convex function is used for transforming the speech spectrum coefficient and obtaining a post-filter coefficient so that difference can increase in the case where the speech spectrum coefficient is small but decrease in the case where the coefficient is large. Then, the post-filter coefficient is applied to the MDCT coefficient. With this configuration, both the current and previous MDCT values are used, so that it is possible to obtain a spectrum coefficient similar to the real speech spectrum and to obtain a more accurate filter coefficient. Further, the coefficient is adaptively transformed through the convex function, thereby enhancing speech quality.

    摘要翻译: 公开了一种用于修正的离散余弦变换(MDCT)域中的语音增强的后置滤波装置和方法。 在该装置和方法中,先前和当前的MDCT系数被用于获得类似于真实语音频谱的语音频谱系数,并且使用凸函数来变换语音频谱系数并获得后置滤波器系数,使得差值可以增加 在语音频谱系数小的情况下,在系数大的情况下减少。 然后,将后滤波器系数应用于MDCT系数。 利用该配置,使用当前和先前的MDCT值,使得可以获得与真实语音频谱相似的频谱系数并获得更准确的滤波器系数。 此外,系数通过凸函数自适应地变换,从而提高语音质量。

    ENCODING APPARATUS AND METHOD AND DECODING APPARATUS AND METHOD OF AUDIO/VOICE SIGNAL PROCESSING APPARATUS
    3.
    发明申请
    ENCODING APPARATUS AND METHOD AND DECODING APPARATUS AND METHOD OF AUDIO/VOICE SIGNAL PROCESSING APPARATUS 审中-公开
    编码装置和方法和解码装置和音频/语音信号处理装置的方法

    公开(公告)号:US20110153337A1

    公开(公告)日:2011-06-23

    申请号:US12957027

    申请日:2010-11-30

    IPC分类号: G10L19/00

    CPC分类号: G10L19/10

    摘要: An encoding apparatus is provided. The encoding apparatus includes a track structure determiner determining a track structure using frequency coefficients, a frequency coefficient allocator allocating the frequency coefficients to each track according to the determined track structure, and a quantizer quantizing one or more pulses in each track based on a number of frequency coefficients allocated to a corresponding track. The encoding apparatus can prevent the degradation of sound quality by avoiding the problem faced by most sinusoidal quantization techniques using a fixed track structure, i.e., a failure to quantize all pulses due to mismatches between the pulse distribution of frequency coefficients and a track structure.

    摘要翻译: 提供一种编码装置。 编码装置包括:轨道结构确定器,使用频率系数确定轨迹结构;频率系数分配器,根据所确定的轨道结构将频率系数分配给每个轨道,以及量化器,基于多个轨道结构的数量量化每个轨道中的一个或多个脉冲 分配给相应轨道的频率系数。 编码装置可以通过避免使用固定轨道结构的大多数正弦量化技术所面临的问题,即由于频率系数的脉冲分布与轨道结构之间的不匹配造成的所有脉冲的量化,从而可以防止音质的恶化。

    ENCODING AND DECODING APPARATUSES FOR IMPROVING SOUND QUALITY OF G.711 CODEC
    4.
    发明申请
    ENCODING AND DECODING APPARATUSES FOR IMPROVING SOUND QUALITY OF G.711 CODEC 失效
    编码和解码设备,用于提高G.711编解码器的声音质量

    公开(公告)号:US20100161322A1

    公开(公告)日:2010-06-24

    申请号:US12640745

    申请日:2009-12-17

    IPC分类号: G10L19/00

    CPC分类号: G10L19/24 G10L19/032

    摘要: An encoding apparatus and a decoding apparatus for reducing the quantization error of a G.711 codec and improving sound quality are provided. The encoding apparatus includes a G.711 encoder which generates a G.711 bitstream by encoding an input audio signal; an enhancement-layer encoder which chooses one of a static bit allocation method and a dynamic bit allocation method that can produce less quantization error based on the input audio signal and the G.711 bitstream, and outputs an enhancement-layer bitstream including encoded additional mantissa information obtained by using the chosen bit allocation method; and a multiplexer which multiplexes the G.711 bitstream and the enhancement-layer bitstream. Therefore, it is possible to reduce the quantization error of a G.711 codec and improve sound quality.

    摘要翻译: 提供了一种用于降低G.711编解码器的量化误差并提高声音质量的编码装置和解码装置。 编码装置包括:G.711编码器,通过编码输入音频信号来产生G.711比特流; 增强层编码器,其选择可以基于输入音频信号和G.711比特流产生较少的量化误差的静态比特分配方法和动态比特分配方法中的一种,并且输出包括经编码的附加尾数的增强层比特流 通过使用选择的比特分配方法获得的信息; 以及复用器,用于复用G.711比特流和增强层比特流。 因此,可以降低G.711编解码器的量化误差,提高音质。

    Pitch conversion method for reducing complexity of transcoder
    5.
    发明申请
    Pitch conversion method for reducing complexity of transcoder 审中-公开
    用于降低代码转换器复杂度的间距转换方法

    公开(公告)号:US20060095255A1

    公开(公告)日:2006-05-04

    申请号:US11261348

    申请日:2005-10-27

    IPC分类号: G10L11/04

    CPC分类号: G10L21/003 G10L19/173

    摘要: The present invention provides a pitch conversion method for reducing complexity of a transcoder for optimizing a speech quality and a complexity using characteristics of encoder in a transmitter and decoder in a receiver. The pitch conversion method for reducing complexity of the transcoder includes: classifying plural frames transmitted from a transmitter into frame units, each having a predetermined number of frame; recognizing a transmitting pitch included in the frame units; deciding a pitch estimation range based on the transmitting pitch; estimating at least one candidate pitch in the pitch estimation range by using a open-loop pitch search operation; and searching a final pitch around the estimated candidate pitch by using a closed-loop pitch search operation.

    摘要翻译: 本发明提供了一种音调转换方法,用于降低代码转换器的复杂度,以便使用接收机中的发射机和解码器中的编码器的特性优化语音质量和复杂度。 用于降低代码转换器的复杂度的音调转换方法包括:将从发送器发送的多个帧分为帧单位,每个帧具有预定数量的帧; 识别包括在所述帧单元中的发送音调; 基于发送间距决定音调估计范围; 通过使用开环音调搜索操作来估计音调估计范围中的至少一个候选音调; 并且通过使用闭环音调搜索操作来搜索所估计的候选音调周围的最后音调。

    Fixed codebook search method through iteration-free global pulse replacement and speech coder using the same method
    6.
    发明授权
    Fixed codebook search method through iteration-free global pulse replacement and speech coder using the same method 失效
    固定码本搜索方法通过无迭代全局脉冲替换和语音编码器采用相同的方法

    公开(公告)号:US08249864B2

    公开(公告)日:2012-08-21

    申请号:US12442554

    申请日:2007-04-11

    IPC分类号: G10L19/00 G10L15/00

    CPC分类号: G10L19/12 G10L2019/0013

    摘要: Provided are a fixed codebook search method based on iteration-free global pulse replacement in a speech codec, and a Code-Excited Linear-Prediction (CELP)-based speech codec using the method. The fixed codebook search method based on iteration-free global pulse replacement in a speech codec includes the steps of: (a) determining an initial codevector using a pulse-position likelihood vector or a correlation vector; (b) calculating a fixed-codebook search criterion value for the initial codevector; (c) calculating fixed-codebook search criterion values for respective codevectors obtained by replacing a pulse of the initial codevector each time for respective tracks, and determining a pulse position generating the largest fixed-codebook search criterion value as a candidate pulse position for the respective tracks, respectively; (d) calculating fixed-codebook search criterion values for respective codevectors of all combinations obtained by replacing at least one pulse position of the initial codevector with the candidate pulse positions of the respective tracks, and determining the largest value of the fixed-codebook search criterion values; and (e) comparing the fixed-codebook search criterion value for the initial codevector obtained in step (b) with the largest value determined in step (d) to determine an optimum fixed codevector.

    摘要翻译: 提供了一种基于语音编解码器中基于无迭代全局脉冲替换的固定码本搜索方法,以及使用该方法的基于码激励线性预测(CELP)的语音编解码器。 基于语音编解码器中基于无迭代全局脉冲替换的固定码本搜索方法包括以下步骤:(a)使用脉冲位置似然矢量或相关矢量确定初始码矢量; (b)计算初始码矢量的固定码本搜索标准值; (c)通过每次针对各个磁道替换初始码矢量的脉冲而获得的各个码矢量的固定码本搜索条件值计算,并且确定产生最大固定码本搜索标准值的脉冲位置作为相应磁道的候选脉冲位置 轨道; (d)通过将所述初始码矢量的至少一个脉冲位置替换为各个轨道的候选脉冲位置而获得的所有组合的相应代码矢量的固定码本搜索标准值计算,并且确定固定码本搜索标准的最大值 价值观 和(e)将步骤(b)中获得的初始码矢量的固定码本搜索标准值与步骤(d)中确定的最大值进行比较,以确定最佳固定码矢量。

    FIXED CODEBOOK SEARCH METHOD THROUGH ITERATION-FREE GLOBAL PULSE REPLACEMENT AND SPEECH CODER USING THE SAME METHOD
    7.
    发明申请
    FIXED CODEBOOK SEARCH METHOD THROUGH ITERATION-FREE GLOBAL PULSE REPLACEMENT AND SPEECH CODER USING THE SAME METHOD 失效
    使用相同方法通过无迭代全球脉冲替换和语音编码器的固定代码搜索方法

    公开(公告)号:US20100088091A1

    公开(公告)日:2010-04-08

    申请号:US12442554

    申请日:2007-04-11

    IPC分类号: G10L19/00

    CPC分类号: G10L19/12 G10L2019/0013

    摘要: Provided are a fixed codebook search method based on iteration-free global pulse replacement in a speech codec, and a Code-Excited Linear-Prediction (CELP)-based speech codec using the method. The fixed codebook search method based on iteration-free global pulse replacement in a speech codec includes the steps of: (a) determining an initial codevector using a pulse-position likelihood vector or a correlation vector; (b) calculating a fixed-codebook search criterion value for the initial codevector; (c) calculating fixed-codebook search criterion values for respective codevectors obtained by replacing a pulse of the initial codevector each time for respective tracks, and determining a pulse position generating the largest fixed-codebook search criterion value as a candidate pulse position for the respective tracks, respectively; (d) calculating fixed-codebook search criterion values for respective codevectors of all combinations obtained by replacing at least one pulse position of the initial codevector with the candidate pulse positions of the respective tracks, and determining the largest value of the fixed-codebook search criterion values; and (e) comparing the fixed-codebook search criterion value for the initial codevector obtained in step (b) with the largest value determined in step (d) to determine an optimum fixed codevector.

    摘要翻译: 提供了一种基于语音编解码器中基于无迭代全局脉冲替换的固定码本搜索方法,以及使用该方法的基于码激励线性预测(CELP)的语音编解码器。 基于语音编解码器中基于无迭代全局脉冲替换的固定码本搜索方法包括以下步骤:(a)使用脉冲位置似然矢量或相关矢量确定初始码矢量; (b)计算初始码矢量的固定码本搜索标准值; (c)通过每次针对各个磁道替换初始码矢量的脉冲而获得的各个码矢量的固定码本搜索条件值计算,并且确定产生最大固定码本搜索标准值的脉冲位置作为相应磁道的候选脉冲位置 轨道; (d)通过将所述初始码矢量的至少一个脉冲位置替换为各个轨道的候选脉冲位置而获得的所有组合的相应代码矢量的固定码本搜索标准值计算,并且确定固定码本搜索标准的最大值 价值观 和(e)将步骤(b)中获得的初始码矢量的固定码本搜索标准值与步骤(d)中确定的最大值进行比较,以确定最佳固定码矢量。

    Apparatus and method for coding and decoding residual signal
    8.
    发明申请
    Apparatus and method for coding and decoding residual signal 有权
    剩余信号编码和解码的装置和方法

    公开(公告)号:US20060277040A1

    公开(公告)日:2006-12-07

    申请号:US11441955

    申请日:2006-05-26

    IPC分类号: G10L19/00

    摘要: Provided is a residual signal coding/decoding apparatus and method. The residual signal coding apparatus includes a transformer, an LPC coefficient extractor, an LPC coefficient quantizer, an LP analysis filter, a band splitter, a pulse searcher, and a pulse quantizer. The transformer transforms time-domain residual signals into a frequency domain to output transform coefficients. The LPC coefficient extractor extracts LPC coefficients from the transform coefficients. The LPC coefficient quantizer quantizes the LPC coefficients to output quantized LPC coefficients and corresponding indices. The LP analysis filter performs an LP analysis on the transform coefficients to output LP residual transform coefficients. The band splitter splits the LP residual transform coefficients into bands to output the LP residual transform coefficients. The pulse searcher searches the LP residual transform coefficients for the respective bands to select optimal pulses and output parameters of the optimal pulses. The pulse quantizer quantizes the parameters of the optimal pulses.

    摘要翻译: 提供了一种残留信号编码/解码装置和方法。 残余信号编码装置包括变压器,LPC系数提取器,LPC系数量化器,LP分析滤波器,带分离器,脉冲搜索器和脉冲量化器。 变压器将时域残差信号变换为频域,输出变换系数。 LPC系数提取器从变换系数中提取LPC系数。 LPC系数量化器量化LPC系数以输出量化的LPC系数和相应的索引。 LP分析滤波器对变换系数执行LP分析以输出LP残差变换系数。 频带分离器将LP残差变换系数分解成频带以输出LP残差变换系数。 脉冲搜索器搜索各个频带的LP残差变换系数,以选择最佳脉冲和最佳脉冲的输出参数。 脉冲量化器对最佳脉冲的参数进行量化。

    MDCT domain post-filtering apparatus and method for quality enhancement of speech
    9.
    发明授权
    MDCT domain post-filtering apparatus and method for quality enhancement of speech 失效
    MDCT域后置滤波装置及语音质量提升方法

    公开(公告)号:US08315853B2

    公开(公告)日:2012-11-20

    申请号:US12155542

    申请日:2008-06-05

    CPC分类号: G10L19/26 G10L19/0212

    摘要: A post-filtering apparatus and method for speech enhancement in a modified discrete cosine transform (MDCT) domain are disclosed. In the apparatus and method, previous and current MDCT coefficients are used for obtaining a speech spectrum coefficient similar to a real speech spectrum, and a convex function is used for transforming the speech spectrum coefficient and obtaining a post-filter coefficient so that difference can increase in the case where the speech spectrum coefficient is small but decrease in the case where the coefficient is large. Then, the post-filter coefficient is applied to the MDCT coefficient. With this configuration, both the current and previous MDCT values are used, so that it is possible to obtain a spectrum coefficient similar to the real speech spectrum and to obtain a more accurate filter coefficient. Further, the coefficient is adaptively transformed through the convex function, thereby enhancing speech quality.

    摘要翻译: 公开了一种用于修正的离散余弦变换(MDCT)域中的语音增强的后置滤波装置和方法。 在该装置和方法中,先前和当前的MDCT系数被用于获得类似于真实语音频谱的语音频谱系数,并且使用凸函数来变换语音频谱系数并获得后置滤波器系数,使得差值可以增加 在语音频谱系数小的情况下,在系数大的情况下减少。 然后,将后滤波器系数应用于MDCT系数。 利用该配置,使用当前和先前的MDCT值,使得可以获得与真实语音频谱相似的频谱系数并获得更准确的滤波器系数。 此外,系数通过凸函数自适应地变换,从而提高语音质量。

    Apparatus and method for coding residual signals of audio signals into a frequency domain and apparatus and method for decoding the same
    10.
    发明授权
    Apparatus and method for coding residual signals of audio signals into a frequency domain and apparatus and method for decoding the same 有权
    将音频信号的残余信号编码成频域的装置和方法以及用于对其解码的装置和方法

    公开(公告)号:US07599833B2

    公开(公告)日:2009-10-06

    申请号:US11441955

    申请日:2006-05-26

    IPC分类号: G10L19/00 G10L19/12

    摘要: Provided is a residual signal coding/decoding apparatus and method. The residual signal coding apparatus includes a transformer, an LPC coefficient extractor, an LPC coefficient quantizer, an LP analysis filter, a band splitter, a pulse searcher, and a pulse quantizer. The transformer transforms time-domain residual signals into a frequency domain to output transform coefficients. The LPC coefficient extractor extracts LPC coefficients from the transform coefficients. The LPC coefficient quantizer quantizes the LPC coefficients to output quantized LPC coefficients and corresponding indices. The LP analysis filter performs an LP analysis on the transform coefficients to output LP residual transform coefficients. The band splitter splits the LP residual transform coefficients into bands to output the LP residual transform coefficients. The pulse searcher searches the LP residual transform coefficients for the respective bands to select optimal pulses and output parameters of the optimal pulses. The pulse quantizer quantizes the parameters of the optimal pulses.

    摘要翻译: 提供了一种残留信号编码/解码装置和方法。 残余信号编码装置包括变压器,LPC系数提取器,LPC系数量化器,LP分析滤波器,带分离器,脉冲搜索器和脉冲量化器。 变压器将时域残差信号变换为频域,输出变换系数。 LPC系数提取器从变换系数中提取LPC系数。 LPC系数量化器量化LPC系数以输出量化的LPC系数和相应的索引。 LP分析滤波器对变换系数执行LP分析以输出LP残差变换系数。 频带分离器将LP残差变换系数分解成频带以输出LP残差变换系数。 脉冲搜索器搜索各个频带的LP残差变换系数,以选择最佳脉冲和最佳脉冲的输出参数。 脉冲量化器对最佳脉冲的参数进行量化。