VOICING INDEX CONTROLS FOR CELP SPEECH CODING
    41.
    发明公开
    VOICING INDEX CONTROLS FOR CELP SPEECH CODING 审中-公开
    同意INDEX CONTROLS FOR CELP语音编码

    公开(公告)号:EP1604354A2

    公开(公告)日:2005-12-14

    申请号:EP04719814.8

    申请日:2004-03-11

    发明人: GAO, Yang

    IPC分类号: G10L19/04

    摘要: An approach for improving quality of speech synthesized using analysis-by-synthesis (ABS ) coders is presented. An unstable perceptual quality in analysis-by-synthesis type speech coding (e.g. CELP) may occur because the periodicity degree in a voiced speech signal may vary significantly for different segments of the voiced speech. Thus the present invention uses a voicing index, which may indicate the periodicity degree of the speech signal, to control and improve ABS type speech coding. The voicing index may be used to improve the quality stability by controlling encoder and/or decoder in: fixed-codebook (301) short-term enhancement including the spectrum tilt; perceptual weighting filter; sub-fixed codebook determination; LPC interpolation (304); fixed-codebook pitch enhancement; post-pitch enhancement; noise injection into the high-frequency band at decoder; LTP sync window; signal decomposition, etc.

    600 BPS MIXED EXCITATION LINEAR PREDICTION TRANSCODING
    42.
    发明公开
    600 BPS MIXED EXCITATION LINEAR PREDICTION TRANSCODING 有权
    的600个BPS MELP(MIXED激励线性预测)编码转换

    公开(公告)号:EP1597721A2

    公开(公告)日:2005-11-23

    申请号:EP04706439.9

    申请日:2004-01-29

    IPC分类号: G10L19/00

    CPC分类号: G10L19/087 G10L19/173

    摘要: Vector quantization techniques reduce the effective bit rate to 600 bps while maintaining intelligible speech. Four frames of speech are combined into one frame (104). The system uses mixed excitation linear prediction speech model parameters to quantized the frame and achieve a fixed rate of 600 bps (104). The system allows voice communication over bandwidth constrained channels.

    Half-rate vocoder
    44.
    发明公开
    Half-rate vocoder 有权
    Halbrätiger声码器

    公开(公告)号:EP1465158A3

    公开(公告)日:2005-09-21

    申请号:EP04251796.1

    申请日:2004-03-26

    发明人: Hardwick, John C.

    IPC分类号: G10L19/14

    摘要: Encoding a sequence of digital speech samples into a bit stream includes dividing the digital speech samples into one or more frames, computing model parameters for a frame, and quantizing the model parameters to produce pitch bits conveying pitch information, voicing bits conveying voicing information, and gain bits conveying signal level information. One or more of the pitch bits are combined with one or more of the voicing bits and one or more of the gain bits to create a first parameter codeword that is encoded with an error control code to produce a first FEC codeword that is included in a bit stream for the frame. The process may be reversed to decode the bit stream.

    METHOD FOR MODELING SPEECH HARMONIC MAGNITUDES
    45.
    发明公开
    METHOD FOR MODELING SPEECH HARMONIC MAGNITUDES 有权
    方法模拟语音谐波量的

    公开(公告)号:EP1495465A4

    公开(公告)日:2005-05-18

    申请号:EP03745516

    申请日:2003-02-14

    申请人: MOTOROLA INC

    CPC分类号: G10L19/06 G10L19/087

    摘要: A system or method for modeling a signal, such as a speech signal, wherein harmonic frequencies and amplitudes are identified (106) and the harmonic magnitudes are interpolated (110) to obtain spectral magnitudes at a set of fixed frequencies. An inverse transform is applied (112) to the spectral magnitudes to obtain a pseudo auto-correlation sequence, from which linear prediction coefficients are calculated (114). From the linear prediction coefficients, model harmonic magnitudes are generated by sampling the spectral envelope (118) defined by the linear prediction coefficients. A set of scale factors are then calculated (120) as the ratio of the harmonic magnitudes to the model harmonic magnitudes and interpolated to obtain a second set of scale factors (122) at the set of fixed frequencies. The spectral envelope magnitudes at the set of fixed frequencies (124) are multiplied by the second set of scale factors (126) to obtain new spectral magnitudes and the process is iterated to obtain final linear prediction coefficients.

    LPC-HARMONIC VOCODER WITH SUPERFRAME STRUCTURE
    46.
    发明公开
    LPC-HARMONIC VOCODER WITH SUPERFRAME STRUCTURE 有权
    LPC演讲超过帧格式和谐

    公开(公告)号:EP1222659A1

    公开(公告)日:2002-07-17

    申请号:EP00968376.4

    申请日:2000-09-20

    IPC分类号: G10L19/14

    CPC分类号: G10L19/173 G10L19/087

    摘要: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

    FRAME LOSS COMPENSATION PROCESSING METHOD AND APPARATUS

    公开(公告)号:EP3242442A2

    公开(公告)日:2017-11-08

    申请号:EP17163596.4

    申请日:2017-03-29

    IPC分类号: H04L12/26

    摘要: Embodiments of the present invention provide a frame loss compensation processing method and apparatus. The method includes: determining, by using a lost-frame flag bit, whether an i th frame is a lost frame; and when the i th frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the i th frame according to at least one of an inter-frame relationship between first N frames of the i th frame or an intra-frame relationship between first N frames of the i th frame, where the inter-frame relationship between the first N frames includes at least one of correlation between the first N frames or energy stability between the first N frames, and the intra-frame relationship between the first N frames includes at least one of inter-subframe correlation between the first N frames or inter-subframe energy stability between the first N frames. A parameter of the i th frame is determined by using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame. A relationship between signals is considered, so as to obtain a more accurate parameter of the i th frame by means of estimation, and improve voice signal decoding quality.

    ESTIMATION OF MIXING FACTORS TO GENERATE HIGH-BAND EXCITATION SIGNAL
    50.
    发明公开
    ESTIMATION OF MIXING FACTORS TO GENERATE HIGH-BAND EXCITATION SIGNAL 有权
    ÄUNG V ON EN EN EN EN EN EN EN EN EN EN EN EN EN EN EN EN EN EN

    公开(公告)号:EP3055861A1

    公开(公告)日:2016-08-17

    申请号:EP14786583.6

    申请日:2014-10-09

    摘要: A method includes generating a high-band residual signal based on a high-band portion of an audio signal. The method also includes generating a harmonically extended signal at least partially based on a low-band portion of the audio signal. The method further includes determining a mixing factor based on the high-band residual signal, the harmonically extended signal, and modulated noise. The modulated noise is at least partially based on the harmonically extended signal and white noise.

    摘要翻译: 一种方法包括基于音频信号的高频带部分生成高频残留信号。 该方法还包括至少部分地基于音频信号的低频带部分生成谐波扩展信号。 该方法还包括基于高频带残留信号,谐波扩展信号和调制噪声来确定混合因子。 调制噪声至少部分地基于谐波扩展信号和白噪声。