Multistage inverse quantization having a plurality of frequency bands
    1.
    发明授权
    Multistage inverse quantization having a plurality of frequency bands 有权
    具有多个频带的多级逆量化

    公开(公告)号:US07243061B2

    公开(公告)日:2007-07-10

    申请号:US10954589

    申请日:2004-10-01

    IPC分类号: G10L19/02

    摘要: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

    摘要翻译: 对于音频信号编码和解码装置,提供了一种编码装置,即使通过编码装置不使用全部数据,能够使解码装置再现音频信号,也可以使用与编码装置对应的解码装置。 构成编码装置的量化单元包括:第一子量化单元,包括用于低频带,中频带和高频带的子量化单元; 第二子量化单元,用于量化来自第一子量化单元的量化误差; 以及用于量化由第一子量化单元和第二子量化单元处理的量化误差的第三子量化单元。

    Method and an apparatus for speech detection for determining whether an
input signal is speech or nonspeech
    2.
    发明授权
    Method and an apparatus for speech detection for determining whether an input signal is speech or nonspeech 失效
    用于确定输入信号是语音还是非声音的语音检测方法和装置

    公开(公告)号:US5611019A

    公开(公告)日:1997-03-11

    申请号:US246346

    申请日:1994-05-19

    摘要: The speech detection apparatus comprises: a reference model maker for extracting a plurality of parameters for a speech detection from training data, and for making a reference model based on the parameters; a parameter extractor for extracting the plurality of parameters from each frame of an input audio signal; and a decision device for deciding whether or not the audio signal is speech, by comparing the parameters extracted from the input audio signal with the reference model. The reference model maker makes the reference model for each phoneme. The decision devices includes: a similarity computing unit for comparing the parameters extracted from each frame of the input audio signal with the reference model, and for computing a similarity of the frame with respect to the reference model; a phoneme decision unit for deciding a phoneme of each frame of the input audio signal based on the similarity computed for each phoneme; and a final decision unit for deciding whether or not a specific period of the input audio signal including a plurality of frames is speech, based on the result of the phoneme decision for the plurality of frames.

    摘要翻译: 语音检测装置包括:参考模型制作器,用于从训练数据中提取用于语音检测的多个参数,以及用于基于参数进行参考模型; 参数提取器,用于从输入音频信号的每个帧中提取多个参数; 以及用于通过将从输入音频信号提取的参数与参考模型进行比较来决定音频信号是否是语音的决定装置。 参考模型制作者为每个音素提供参考模型。 决策装置包括:相似度计算单元,用于将从输入音频信号的每个帧提取的参数与参考模型进行比较,并用于计算帧相对于参考模型的相似度; 音素决定单元,用于基于为每个音素计算的相似度来决定输入音频信号的每个帧的音素; 以及最终决定单元,用于基于多个帧的音素决定的结果来决定包括多个帧的输入音频信号的特定周期是否是语音。

    Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus
    4.
    发明授权
    Audio signal compression method, audio signal compression apparatus, speech signal compression method, speech signal compression apparatus, speech recognition method, and speech recognition apparatus 有权
    音频信号压缩方法,音频信号压缩装置,语音信号压缩方法,语音信号压缩装置,语音识别方法和语音识别装置

    公开(公告)号:US06477490B2

    公开(公告)日:2002-11-05

    申请号:US09892745

    申请日:2001-06-28

    IPC分类号: G10L1906

    CPC分类号: H04B1/665 G10L2019/0005

    摘要: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.

    摘要翻译: 一种用于对输入音频信号进行压缩编码的音频信号压缩装置包括用于将输入音频信号变换为频域信号的时间 - 频率变换单元; 频谱包络计算单元,用于根据输入的音频信号,使用基于人的听觉特征的频率的加权函数来计算用于不同频率的不同分辨率的频谱包络; 归一化单元,用于使用频谱包络对频域信号进行归一化以获得残余信号; 功率归一化单元,用于通过所述功率归一化所述残余信号; 听觉加权计算单元,用于基于输入音频信号的频谱和人类听觉特征来计算频率上的加权系数; 以及具有串联连接的多级矢量量化器的多级量化装置,其中输入归一化残差信号,以及使用加权系数量化残差信号的矢量量化器中的至少一个。 因此,与高频带相比,可以以更高的频率分辨率来分析具有重要意义的低频带,从而实现利用人类听觉特性的有效信号压缩。

    Speech recognition method and apparatus using frequency warping of linear prediction coefficients
    5.
    发明授权
    Speech recognition method and apparatus using frequency warping of linear prediction coefficients 有权
    使用线性预测系数的频率变形的语音识别方法和装置

    公开(公告)号:US06311153B1

    公开(公告)日:2001-10-30

    申请号:US09165297

    申请日:1998-10-02

    IPC分类号: G01L2100

    CPC分类号: H04B1/665 G10L2019/0005

    摘要: An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization device having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.

    摘要翻译: 一种用于对输入音频信号进行压缩编码的音频信号压缩装置包括用于将输入音频信号变换为频域信号的时间 - 频率变换单元; 频谱包络计算单元,用于根据输入的音频信号,使用基于人的听觉特征的频率的加权函数来计算用于不同频率的不同分辨率的频谱包络; 归一化单元,用于使用频谱包络对频域信号进行归一化以获得残余信号; 功率归一化单元,用于通过所述功率归一化所述残余信号; 听觉加权计算单元,用于基于输入音频信号的频谱和人类听觉特征来计算频率上的加权系数; 以及具有串联连接的多级矢量量化器的多级量化装置,其中输入归一化残差信号,以及使用加权系数量化残差信号的矢量量化器中的至少一个。 因此,与高频带相比,可以以更高的频率分辨率来分析具有重要意义的低频带,从而实现利用人类听觉特性的有效信号压缩。

    Multistage inverse quantization having the plurality of frequency bands
    6.
    发明申请
    Multistage inverse quantization having the plurality of frequency bands 有权
    具有多个频带的多级逆量化

    公开(公告)号:US20050060147A1

    公开(公告)日:2005-03-17

    申请号:US10954589

    申请日:2004-10-01

    IPC分类号: G10L19/00 G10L21/02

    摘要: With respect to audio signal coding and decoding apparatuses, there is provided a coding apparatus that enables a decoding apparatus to reproduce an audio signal even through it does not use all of data from the coding apparatus, and a decoding apparatus corresponding to the coding apparatus. A quantization unit constituting a coding apparatus includes a first sub-quantization unit comprising sub-quantization units for low-band, intermediate-band, and high-band; a second sub-quantization unit for quantizing quantization errors from the first sub-quantization unit; and a third sub-quantization unit for quantizing quantization errors which have been processed by the first sub-quantization unit and the second sub-quantization unit.

    摘要翻译: 对于音频信号编码和解码装置,提供了一种编码装置,即使通过编码装置不使用全部数据,能够使解码装置再现音频信号,也可以使用与编码装置对应的解码装置。 构成编码装置的量化单元包括:第一子量化单元,包括用于低频带,中频带和高频带的子量化单元; 第二子量化单元,用于量化来自第一子量化单元的量化误差; 以及用于量化由第一子量化单元和第二子量化单元处理的量化误差的第三子量化单元。

    Apparatus for expanding narrowband speech to wideband speech by codebook
correspondence of linear mapping functions
    7.
    发明授权
    Apparatus for expanding narrowband speech to wideband speech by codebook correspondence of linear mapping functions 有权
    用于通过线性映射函数的码本对应将窄带语音扩展到宽带语音的装置

    公开(公告)号:US5978759A

    公开(公告)日:1999-11-02

    申请号:US157419

    申请日:1998-09-21

    摘要: Apparatus for expanding the bandwidth of speech signals such that a narrowband speech signal is input and digitized, the spectral envelope information and residual information are extracted from the digitized signal by linear predictive coding analysis, the spectral envelope information is expanded into wideband information by a spectral envelope converter, the residual information is expanded into wideband information by a residual converter, the converted spectral envelope information and residual information are combined to produce a wideband speech signal, frequency information not contained in the input signal is extracted from the obtained wideband speech signal by a filter, and the resulting signal is added to the original digitized input signal, and the obtained signal is converted into an analog signal as the output signal of the apparatus. The apparatus comprises a linear mapping function codebook used for converting spectral parameters, and a weights calculator and an adder for weighing and summing function outputs.

    摘要翻译: 用于扩展语音信号的带宽使得窄带语音信号被输入和数字化的装置,通过线性预测编码分析从数字化信号中提取频谱包络信息和残差信息,频谱包络信息通过频谱扩展成宽带信息 通过残差转换器将残差信息扩展为宽带信息,将经转换的频谱包络信息和残差信息组合起来产生宽带语音信号,从所获得的宽带语音信号中提取不包含在输入信号中的频率信息, 滤波器,并且将所得到的信号加到原始的数字化输入信号上,并将获得的信号转换为模拟信号作为装置的输出信号。 该装置包括用于转换频谱参数的线性映射函数码本,以及权重计算器和用于对功能输出求和的加法器。

    Audio decoding apparatus, audio coding apparatus, and system comprising the apparatuses
    8.
    发明授权
    Audio decoding apparatus, audio coding apparatus, and system comprising the apparatuses 有权
    音频解码装置,音频编码装置和包括该装置的系统

    公开(公告)号:US08688442B2

    公开(公告)日:2014-04-01

    申请号:US13433063

    申请日:2012-03-28

    IPC分类号: G10L19/02

    摘要: An audio decoding apparatus comprises: a plurality of decoding units; a band replicating unit which processes a decoded signal obtained when a corresponding decoding unit decodes a coded signal, according to a scheme specified by transmitted information; and an information transmitting unit which transmits, to a signal processing unit, information identifying the corresponding decoding unit from among the plurality of decoding units.

    摘要翻译: 音频解码装置包括:多个解码单元; 频带复制单元,根据所发送的信息指定的方案,处理当对应的解码单元解码编码信号时获得的解码信号; 以及信息发送单元,其向所述信号处理单元发送从所述多个解码单元中识别对应的解码单元的信息。

    HYBRID SOUND SIGNAL DECODER, HYBRID SOUND SIGNAL ENCODER, SOUND SIGNAL DECODING METHOD, AND SOUND SIGNAL ENCODING METHOD
    9.
    发明申请
    HYBRID SOUND SIGNAL DECODER, HYBRID SOUND SIGNAL ENCODER, SOUND SIGNAL DECODING METHOD, AND SOUND SIGNAL ENCODING METHOD 审中-公开
    混合声信号解码器,混合声信号编码器,声信号解码方法和声信号编码方法

    公开(公告)号:US20140058737A1

    公开(公告)日:2014-02-27

    申请号:US13996644

    申请日:2012-10-24

    IPC分类号: G10L19/002

    摘要: A hybrid sound signal decoder decodes a bitstream including audio frames encoded by an audio encoding process using a low delay filter bank and speech frames encoded by a speech encoding process using linear prediction coefficients. When a current frame to be decoded is an ith frame which is an initial speech frame after switching from an audio frame to a speech frame, the hybrid sound signal decoder generates sub-frames which are a signal corresponding to an i−1th frame before being encoded, using a sub-frame which is a signal generated using a signal of the i−1th frame before being encoded, the signal of the i−1th frame being obtained by decoding the ith frame.

    摘要翻译: 混合声音信号解码器使用低延迟滤波器组和使用线性预测系数的语音编码处理编码的语音帧来解码包括由音频编码处理编码的音频帧的比特流。 当要解码的当前帧是从音频帧切换到语音帧之后的初始语音帧的第i帧时,混合声音信号解码器产生作为与第i-1帧对应的信号的子帧 使用作为在编码之前使用第i-1帧的信号产生的信号的子帧,通过解码第i帧获得第i-1帧的信号。

    Signal processing device
    10.
    发明授权
    Signal processing device 有权
    信号处理装置

    公开(公告)号:US08284961B2

    公开(公告)日:2012-10-09

    申请号:US11995571

    申请日:2006-07-10

    IPC分类号: H04B1/00

    摘要: A signal processing device includes a generation unit that generates a second signal from a first signal that is obtained by down mixing two signals; a mixing coefficient determination unit that determines, based on a value L and a value θ, a mixing degree for mixing the first signal and the second signal; and a mixing unit that mixes the first signal and the second signal based on the mixing degree determined by the mixing coefficient determination unit. The generation unit includes a first filter that generates a low frequency band signal in the second signal, from a low frequency band signal in the first signal; and a second filter that generates a high frequency band signal in the second signal, from a high frequency band signal in the first signal. The first filter is a filter unit which, for a complex-number signal, de-correlates an input signal and adds a reverberation component by using a delay unit and an all pass filter, and the processing unit is a filter unit different from the first filter.

    摘要翻译: 信号处理装置包括生成单元,该生成单元从通过混合两个信号获得的第一信号产生第二信号; 混合系数确定单元,其基于值L和值来确定用于混合第一信号和第二信号的混合度; 以及混合单元,其基于由混合系数确定单元确定的混合程度来混合第一信号和第二信号。 该生成单元包括从第一信号中的低频带信号在第二信号中产生低频带信号的第一滤波器; 以及第二滤波器,其从第一信号中的高频带信号在第二信号中产生高频带信号。 第一滤波器是滤波器单元,对于复数信号,使输入信号去相关,并通过使用延迟单元和全通滤波器来添加混响分量,并且处理单元是与第一滤波器不同的滤波器单元 过滤。