Audio signal compression, speech signal compression and speech recognition
    3.
    发明公开
    Audio signal compression, speech signal compression and speech recognition 有权
    音频信号压缩,语音信号压缩和语音识别

    公开(公告)号:EP0907258A3

    公开(公告)日:2004-01-02

    申请号:EP98118674.5

    申请日:1998-10-02

    IPC分类号: H04B1/66 G10L7/06 G10L9/00

    CPC分类号: H04B1/665 G10L2019/0005

    摘要: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization means having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.

    Encoder with optimally selected codebook
    7.
    发明公开
    Encoder with optimally selected codebook 审中-公开
    Koder与最佳地选择码本

    公开(公告)号:EP1047198A3

    公开(公告)日:2004-01-02

    申请号:EP00108532.3

    申请日:2000-04-19

    IPC分类号: H03M7/40 H03M7/42

    CPC分类号: H03M7/42

    摘要: An encoder of the present invention includes: a number G of storage sections (G is an integer equal to or greater than 1) for storing a number G of groups of data; a Huffman codebook selection section for selecting one of a number H of Huffman codebooks (H is an integer equal to or greater than 1) for each of the groups of data stored in the respective storage sections, each of the Huffman codebooks having a codebook number; a number G of Huffman encoding sections, each of the Huffman encoding sections Huffman-encoding a corresponding one of the G groups of data by using one of the Huffman codebooks which is selected by the Huffman codebook selection section for the one group of data; and a codebook number encoding section for encoding the codebook number of each Huffman codebook selected by the Huffman codebook selection section. The Huffman codebook selection section includes a code length calculation section for calculating a code length which would result from a Huffman encoding operation of each of the G groups of data using each Huffman codebook, and a control section for selecting one of the Huffman codebooks which is suitable for the group of data based on the code length calculated by the code length calculation section. When the Huffman codebook selected is an unsigned codebook, a number of bits required for sign information has previously been added to the code length calculated by the code length calculation section.

    Audio signal compression, speech signal compression and speech recognition
    9.
    发明公开
    Audio signal compression, speech signal compression and speech recognition 有权
    Audiosignalkompression,Sprachsignalkompression und Spracherkennung

    公开(公告)号:EP0907258A2

    公开(公告)日:1999-04-07

    申请号:EP98118674.5

    申请日:1998-10-02

    IPC分类号: H04B1/66 G10L7/06 G10L9/00

    CPC分类号: H04B1/665 G10L2019/0005

    摘要: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization means having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.

    摘要翻译: 一种用于对输入音频信号进行压缩编码的音频信号压缩装置包括用于将输入音频信号变换为频域信号的时间 - 频率变换单元; 频谱包络计算单元,用于根据输入音频信号,使用基于人类听觉特征的频率的加权函数来计算用于不同频率的不同分辨率的频谱包络; 归一化单元,用于使用频谱包络对频域信号进行归一化以获得残余信号; 功率归一化单元,用于通过电力对残留信号进行归一化; 听觉加权计算单元,用于基于输入音频信号的频谱和人类听觉特征来计算频率上的加权系数; 以及多级量化装置,其具有串联连接的多级矢量量化器,其中输入归一化残差信号,以及至少一个矢量量化器使用加权系数量化残差信号。 因此,与高频带相比,可以以更高的频率分辨率来分析具有重要意义的低频带,从而实现利用人类听觉特性的有效信号压缩。