Perceptual masking of residual echo
    46.
    发明授权
    Perceptual masking of residual echo 有权
    残余回声的感知掩蔽

    公开(公告)号:US07711107B1

    公开(公告)日:2010-05-04

    申请号:US11129450

    申请日:2005-05-12

    IPC分类号: H04M9/08

    CPC分类号: H04B3/234

    摘要: A method of masking a residual echo signal by an echo canceller is provided. The method comprises receiving a far-end signal, adjusting filter coefficients of an adaptive filter in response to the far-end signal, generating an echo model signal based on the far-end signal using the adaptive filter, receiving a near-end signal, subtracting the echo model signal from the near-end signal to generate an output signal, defining a spectral mask based on the near-end signal, wherein the spectral mask is indicative of near-end spectral peaks and near-end spectral valleys, de-emphasizing the output signal in spectral regions of the near-end spectral peaks, and emphasizing the output signal in spectral regions of the near-end spectral valleys, wherein the de-emphasizing occurs during filter coefficients determination for the adaptive filter. A weighted filter may perform the de-emphasizing and the emphasizing operations, where the weighted filter uses medium term spectral characteristics of the near-end signal.

    摘要翻译: 提供了一种通过回波消除器掩蔽残留回波信号的方法。 该方法包括接收远端信号,响应于远端信号调整自适应滤波器的滤波器系数,使用自适应滤波器基于远端信号生成回波模型信号,接收近端信号, 从近端信号减去回波模型信号以产生输出信号,基于近端信号定义频谱屏蔽,其中频谱掩模表示近端谱峰和近端谱谷, 强调近端光谱峰值的光谱区域中的输出信号,并且强调近端光谱谷的光谱区域中的输出信号,其中在自适应滤波器的滤波器系数确定期间发生去加重。 加权滤波器可以执行去强调和强调操作,其中加权滤波器使用近端信号的中期频谱特性。

    Temporal Envelope Coding of Energy Attack Signal by Using Attack Point Location
    47.
    发明申请
    Temporal Envelope Coding of Energy Attack Signal by Using Attack Point Location 有权
    使用攻击点位置的能量攻击信号的时间信封编码

    公开(公告)号:US20100063811A1

    公开(公告)日:2010-03-11

    申请号:US12554705

    申请日:2009-09-04

    申请人: Yang Gao

    发明人: Yang Gao

    IPC分类号: G10L19/00

    CPC分类号: G10L19/025

    摘要: A method of transceiving an audio signal is disclosed. An input audio signal is provided. It is determined whether an energy attack signal exists within the input audio signal and a decision flag is set if the energy attack signal exists. A temporal location of the energy attack point in the input audio signal is detected. Energy variations before and after the temporal location of an energy attack point are determined. The energy variations to produce quantized energy variations and a peak area energy of the input audio signal to produce a quantized peak area energy are quantized. The decision flag, the temporal location of the energy attack point, the quantized energy variations, and the quantized peak energy are transmitted.

    摘要翻译: 公开了收发音频信号的方法。 提供输入音频信号。 确定能量攻击信号是否存在于输入音频信号内,并且如果能量攻击信号存在则确定判定标志。 检测输入音频信号中的能量攻击点的时间位置。 确定能量攻击点的时间位置之前和之后的能量变化。 量化量化能量变化的能量变化和输入音频信号的峰值面积能量以产生量化峰面积能量。 传输决策标志,能量攻击点的时间位置,量化的能量变化和量化的峰值能量。

    Speech post-processing using MDCT coefficients
    48.
    发明申请
    Speech post-processing using MDCT coefficients 有权
    使用MDCT系数进行语音后处理

    公开(公告)号:US20090287478A1

    公开(公告)日:2009-11-19

    申请号:US12460428

    申请日:2009-07-17

    申请人: Yang Gao

    发明人: Yang Gao

    IPC分类号: G10L19/02

    摘要: There is provided a speech post-processor for enhancing a speech signal divided into a plurality of sub-bands in frequency domain. The speech post-processor comprises an envelope modification factor generator configured to use frequency domain coefficients representative of an envelope derived from the plurality of sub-bands to generate an envelope modification factor for the envelope derived from the plurality of sub-bands, where the envelope modification factor is generated using FAC=αENV/Max+(1−α), where FAC is the envelope modification factor, ENV is the envelope, Max is the maximum envelope, and a is a value between 0 and 1, where α is a different constant value for each speech coding rate. The speech post-processor further comprises an envelope modifier configured to modify the envelope derived from the plurality of sub-bands by the envelope modification factor corresponding to each of the plurality of sub-bands.

    摘要翻译: 提供了一种语音后处理器,用于增强分频成频域中的多个子带的语音信号。 所述语音后处理器包括:包络修正因子发生器,其被配置为使用代表从所述多个子带导出的包络的频域系数来产生由所述多个子带导出的所述包络的包络修正因子, 使用FAC = alphaENV / Max +(1-alpha)生成修改因子,其中FAC是包络修饰因子,ENV是包络,Max是最大包络,a是0和1之间的值,其中α是不同的 每个语音编码率的常数值。 语音后处理器还包括一个包络修改器,其被配置为通过对应于多个子带中的每一个的包络修正因子修改从多个子带导出的包络。

    Speech gain quantization strategy
    49.
    发明申请
    Speech gain quantization strategy 审中-公开
    语音增益量化策略

    公开(公告)号:US20090177464A1

    公开(公告)日:2009-07-09

    申请号:US12381036

    申请日:2009-03-06

    IPC分类号: G10L11/04

    摘要: A speech encoder that analyzes and classifies each frame of speech as being periodic-like speech or non-periodic like speech where the speech encoder performs a different gain quantization process depending if the speech is periodic or not. If the speech is periodic, the improved speech encoder obtains the pitch gains from the unquantized weighted speech signal and performs a pre-vector quantization of the adaptive codebook gain GP for each subframe of the frame before subframe processing begins and a closed-loop delayed decision vector quantization of the fixed codebook gain GC. If the frame of speech is non-periodic, the speech encoder may use any known method of gain quantization. The result of quantizing gains of periodic speech in this manner results in a reduction of the number of bits required to represent the quantized gain information and for periodic speech, the ability to use the quantized pitch gain for the current subframe to search the fixed codebook for the fixed codebook excitation vector for the current subframe. Alternatively, the new gain quantization process which was used only for periodic signals may be extended to non-periodic signals as well. This second strategy results in a slightly higher bit rate than that for periodic signals that use the new gain quantization strategy, but is still lower than the prior art's bit rate. Yet another alternative is to use the new gain quantization process for all speech signals without distinguishing between periodic and non-periodic signals.

    摘要翻译: 语音编码器,其将每个语音帧分析和分类为周期性语音或非周期性类似语音,其中语音编码器根据语音是否周期性执行不同的增益量化处理。 如果语音是周期性的,则改进的语音编码器从非量化加权语音信号获得音调增益,并且在子帧处理开始之前针对帧的每个子帧执行自适应码本增益GP的前向量化,并且执行闭环延迟判定 固定码本增益GC的矢量量化。 如果语音是非周期性的,语音编码器可以使用任何已知的增益量化方法。 以这种方式量化周期性语音的增益的结果导致表示量化增益信息所需的位数减少,对于周期性语音,使用当前子帧的量化音调增益来搜索固定码本的能力 用于当前子帧的固定码本激励矢量。 或者,仅用于周期性信号的新增益量化处理也可以扩展到非周期信号。 该第二策略比使用新的增益量化策略的周期信号的比特率稍高,但是仍低于现有技术的比特率。 另一个替代方案是对所有语音信号使用新的增益量化处理,而不区分周期性和非周期性信号。

    Adaptive tilt compensation for synthesized speech
    50.
    发明申请
    Adaptive tilt compensation for synthesized speech 有权
    合成语音的自适应倾斜补偿

    公开(公告)号:US20080294429A1

    公开(公告)日:2008-11-27

    申请号:US12215649

    申请日:2008-06-27

    申请人: Huan-Yu Su Yang Gao

    发明人: Huan-Yu Su Yang Gao

    IPC分类号: G10L19/12

    摘要: There is provided a method of using an adaptive tilt compensation by a speech decoder. The method comprises receiving a bit stream including a plurality of parameters representative of a speech signal; identifying an adaptive code vector and a fixed code vector using the plurality of parameters; scaling the adaptive code vector and the fixed code vector to generate a scaled adaptive code vector and a scaled fixed code vector; summing the scaled adaptive code vector and the scaled fixed code vector to generate a synthesized output; calculating a first reflection coefficient based on the plurality of parameters representative of the speech signal; multiplying the first reflection coefficient by a factor to generate a tilt factor; and applying the tilt factor to the synthesized output based on an encoding bit rate.

    摘要翻译: 提供了一种通过语音解码器使用自适应倾斜补偿的方法。 该方法包括:接收包括表示语音信号的多个参数的比特流; 使用所述多个参数来识别自适应码矢量和固定码矢量; 缩放自适应码矢量和固定码矢量以生成缩放的自适应码矢量和缩放的固定码矢量; 对经缩放的自适应码矢量和缩放的固定码矢量求和以产生合成输出; 基于表示所述语音信号的多个参数来计算第一反射系数; 将第一反射系数乘以因子以产生倾斜因子; 以及基于编码比特率将所述倾斜因子应用于所述合成输出。