Method and device for frequency-selective pitch enhancement of synthesized speech
    1.
    发明申请
    Method and device for frequency-selective pitch enhancement of synthesized speech 有权
    合成语音频率选择音调增强的方法和装置

    公开(公告)号:US20050165603A1

    公开(公告)日:2005-07-28

    申请号:US10515553

    申请日:2003-05-30

    摘要: In a method and device for post-processing a decoded sound signal in view of enhancing a perceived quality of this decoded sound signal, the decoded sound signal is divided into a plurality of frequency sub-band signals, and post-processing is applied to at least one of the frequency sub-band signal. After post-processing of this at least one frequency sub-band signal, the frequency sub-band signals may be added to produce an output post-processed decoded sound signal. In this manner, the post-processing can be localized to a desired sub-band or sub-bands with leaving other sub-bands virtually unaltered.

    摘要翻译: 考虑到提高该解码声音信号的感知质量,对解码声音信号进行后处理的方法和装置中,解码声音信号被分成多个频率子带信号,后处理应用于 至少一个频率子带信号。 在对该至少一个频率子带信号进行后处理之后,可以添加频率子带信号以产生输出的后处理解码声音信号。 以这种方式,后处理可以被定位到期望的子带或子带,而使其他子带几乎不变。

    Method and device for frequency-selective pitch enhancement of synthesized speech
    2.
    发明授权
    Method and device for frequency-selective pitch enhancement of synthesized speech 有权
    合成语音频率选择音调增强的方法和装置

    公开(公告)号:US07529660B2

    公开(公告)日:2009-05-05

    申请号:US10515553

    申请日:2003-05-30

    IPC分类号: G10L19/02 G10L21/02

    摘要: In a method and device for post-processing a decoded sound signal in view of enhancing a perceived quality of this decoded sound signal, the decoded sound signal is divided into a plurality of frequency sub-band signals, and post-processing is applied to at least one of the frequency sub-band signal. After post-processing of this at least one frequency sub-band signal, the frequency sub-band signals may be added to produce an output post-processed decoded sound signal. In this manner, the post-processing can be localized to a desired sub-band or sub-bands with leaving other sub-bands virtually unaltered.

    摘要翻译: 考虑到提高该解码声音信号的感知质量,对解码声音信号进行后处理的方法和装置中,解码声音信号被分成多个频率子带信号,后处理应用于 至少一个频率子带信号。 在对该至少一个频率子带信号进行后处理之后,可以添加频率子带信号以产生输出的后处理解码声音信号。 以这种方式,后处理可以被定位到期望的子带或子带,而使其他子带几乎不变。

    Method and device for adaptive bandwidth pitch search in coding wideband signals
    3.
    发明授权
    Method and device for adaptive bandwidth pitch search in coding wideband signals 有权
    用于编码宽带信号中的自适应带宽音调搜索的方法和装置

    公开(公告)号:US08036885B2

    公开(公告)日:2011-10-11

    申请号:US12620394

    申请日:2009-11-17

    IPC分类号: G10L11/04 G10L19/00 G10L19/12

    CPC分类号: G10L19/26 G10L2019/0011

    摘要: A pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.

    摘要翻译: 考虑到发送或存储和合成该宽带声音信号,用于对宽带信号进行数字编码的音调搜索方法和装置,特别地但不排他地是语音信号。 实现语音频谱谐波结构的有效建模的新方法和装置使用几种形式的应用于音调码矢量的低通滤波器,选择产生较高预测增益(即,最低音调预测误差)的一种形式,并且相关音调 码本参数被转发。

    Device and Method for Noise Shaping in a Multilayer Embedded Codec Interoperable with the ITU-T G.711 Standard
    4.
    发明申请
    Device and Method for Noise Shaping in a Multilayer Embedded Codec Interoperable with the ITU-T G.711 Standard 审中-公开
    可与ITU-T G.711标准互操作的多层嵌入式编解码器中的噪声整形的装置和方法

    公开(公告)号:US20110173004A1

    公开(公告)日:2011-07-14

    申请号:US12664010

    申请日:2007-12-28

    IPC分类号: G10L19/00

    摘要: A device and method for shaping noise during encoding of an input sound signal comprise pre-emphasizing the input signal or a decoded signal from a given sound signal codec to produce a pre-emphasized signal, computing a filter transfer function based on the pre-emphasized signal, and shaping the noise by filtering the noise through the transfer function to produce a shaped noise signal, wherein the noise shaping comprises producing a noise feedback. A device and method for noise shaping in a multilayer codec, including at least Layer 1 and 2, comprise: at an encoder, producing an encoded sound signal in Layer 1 including Layer 1 noise shaping, and producing a Layer 2 enhancement signal; at a decoder, decoding the Layer 1 encoded sound signal to produce a synthesis signal, decoding the enhancement signal, computing a filter transfer function based on the synthesis signal, filtering the enhancement signal through the transfer function to produce a Layer 2 filtered enhancement signal, and adding the filtered enhancement signal to the synthesis signal to produce an output signal including contributions from Layer 1 and 2.

    摘要翻译: 用于在编码输入声音信号期间整形噪声的装置和方法包括预先强调来自给定声音信号编解码器的输入信号或解码信号以产生预加重信号,基于预先强调的信号计算滤波器传递函数 信号和整形噪声,通过传递函数对噪声进行滤波以产生成形噪声信号,其中噪声整形包括产生噪声反馈。 包括至少第1层和第2层的多层编解码器中的噪声整形的装置和方法包括:在编码器处,产生包括层1噪声整形的层1中的编码声音信号,并产生第2层增强信号; 在解码器处,解码第1层编码声音信号以产生合成信号,对增强信号进行解码,基于合成信号计算滤波器传递函数,通过传递函数对增强信号进行滤波,以产生第2层滤波的增强信号, 并将经滤波的增强信号加到合成信号上以产生包括来自层1和2的贡献的输出信号。

    Perceptual weighting device and method for efficient coding of wideband signals
    6.
    发明申请
    Perceptual weighting device and method for efficient coding of wideband signals 审中-公开
    用于有效编码宽带信号的感知加权装置和方法

    公开(公告)号:US20050108007A1

    公开(公告)日:2005-05-19

    申请号:US10965795

    申请日:2004-10-18

    CPC分类号: G10L19/26 G10L2019/0011

    摘要: A perceptual weighting device for producing a perceptually weighted signal in response to a wideband signal comprises a signal pre-emphasis filter, a synthesis filter calculator, and a perceptual weighting filter. The signal pre-emphasis filter enhances the high frequency content of the wideband signal to thereby produce a pre-emphasized signal. The signal pre-emphasis filter has a transfer function of the form: P(z)=1−μz−1, wherein μ is a pre-emphasis factor having a value located between 0 and 1. The synthesis filter calculator is responsive to the pre-emphasized signal for producing synthesis filter coefficients. Finally, the perceptual weighting filter processes the pre-emphasized signal in relation to the synthesis filter coefficients to produce the perceptually weighted signal. The perceptual weighting filter has a transfer function, with fixed denominator, of the form: W(z)=A(z/γ1)/(1−γ2z−1) where 0

    摘要翻译: 用于响应于宽带信号产生感知加权信号的感知加权装置包括信号预加重滤波器,合成滤波器计算器和感知加权滤波器。 信号预加重滤波器增强了宽带信号的高频内容,从而产生预加重信号。 信号预加重滤波器具有以下形式的传递函数:P(z)= 1-muz -1,其中mu是具有位于0和1之间的值的预加重因子。 合成滤波器计算器响应预加重信号以产生合成滤波器系数。 最后,感知加权滤波器处理关于合成滤波器系数的预加重信号以产生感知加权信号。 感知加权滤波器具有如下形式的具有固定分母的传递函数:W(z)= A(z /γ1/ 2)/(1-γ2 其中0 <γ2<1 <1> <1> <1>

    Method and device for adaptive bandwidth pitch search in coding wideband signals
    7.
    发明授权
    Method and device for adaptive bandwidth pitch search in coding wideband signals 有权
    用于编码宽带信号中的自适应带宽音调搜索的方法和装置

    公开(公告)号:US07260521B1

    公开(公告)日:2007-08-21

    申请号:US09830114

    申请日:1999-10-27

    IPC分类号: G10L19/04

    CPC分类号: G10L19/26 G10L2019/0011

    摘要: An improved pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.

    摘要翻译: 鉴于发送或存储和合成该宽带声音信号,改进的音调搜索方法和装置用于数字编码宽带信号,特别地但不排除是语音信号。 实现语音频谱谐波结构的有效建模的新方法和装置使用几种形式的应用于音调码矢量的低通滤波器,选择产生较高预测增益(即,最低音调预测误差)的一种形式,并且相关音调 码本参数被转发。

    High frequency content recovering method and device for over-sampled synthesized wideband signal
    8.
    发明授权
    High frequency content recovering method and device for over-sampled synthesized wideband signal 有权
    用于过采样合成宽带信号的高频内容恢复方法和装置

    公开(公告)号:US07151802B1

    公开(公告)日:2006-12-19

    申请号:US09830332

    申请日:1999-10-27

    IPC分类号: H04L27/00 G10L19/02 G10L11/04

    CPC分类号: G10L19/26 G10L2019/0011

    摘要: In a method and device for recovering the high frequency content of a wideband signal previously down-sampled, and for injecting this high frequency content in an over-sampled synthesized version of the wideband signal to produce a fill-spectrum synthesized wideband signal, a random noise generator produces a noise sequence having a given spectrum. A spectral shaping unit spectrally shapes the noise sequence in relation to linear prediction filter coefficients related to the down-sampled wideband signal. A signal injection circuit finally injects the spectrally-shaped noise sequence in the over-sampled synthesized signal version to thereby produce the full-spectrum synthesized wideband signal.

    摘要翻译: 在用于恢复先前下采样的宽带信号的高频内容的方法和装置中,并且用于在宽带信号的过采样合成版本中注入该高频内容以产生填充频谱合成宽带信号,随机 噪声发生器产生具有给定频谱的噪声序列。 光谱整形单元相对于与下采样宽带信号相关的线性预测滤波器系数,对噪声序列进行频谱成形。 信号注入电路最终以过采样的合成信号版本注入频谱形状的噪声序列,从而产生全频谱合成的宽带信号。