Voiced/unvoiced decision based on frequency band ratio
    1.
    发明授权
    Voiced/unvoiced decision based on frequency band ratio 失效
    基于频带比的发声/清音决定

    公开(公告)号:US5960388A

    公开(公告)日:1999-09-28

    申请号:US871335

    申请日:1997-06-09

    摘要: Input audio signal is divided on a block-by-block basis. Frequency domain conversion is done on each of the blocks. Voiced bands of the frequency domain data for one of the blocks are searched for a voiced band B.sub.VH with the highest center frequency if it is decided that there are one or more shift points of voiced (V)/unvoiced (UV) decision data of all the bands. The number N.sub.V of voiced bands having center frequency less than that of the band B.sub.VH is found, so as to decide whether a proportion of the voiced bands is equal to or higher than a predetermined threshold N.sub.th, thereby deciding one V/UV boundary point. Thus, it is possible to replace the V/UV decision data for each band by information on one demarcation in all bands, thereby reducing data volume and bit rate.

    摘要翻译: 输入音频信号是逐块分割的。 每个块都进行频域转换。 如果确定存在所有的有声(V)/无声(UV)判定数据的一个或多个移位点,则针对一个块的频域数据的有声频带搜索具有最高中心频率的有声波段BVH 乐队。 找到具有小于频带BVH的中心频率的有声频带的数量NV,以便确定有声频带的比例是否等于或高于预定阈值Nth,从而确定一个V / UV边界点。 因此,可以通过关于所有频带中的一个分界的信息来替换每个频带的V / UV判定数据,从而减少数据量和比特率。

    Voice encoding method and voice decoding method
    2.
    发明授权
    Voice encoding method and voice decoding method 失效
    语音编码方式和语音解码方式

    公开(公告)号:US5473727A

    公开(公告)日:1995-12-05

    申请号:US146580

    申请日:1993-11-01

    摘要: A compressed digital speech signal is encoded to provide a transmission error-resistant transmission signal. The compressed speech signal is derived from a digital speech signal by performing a pitch search on a block obtained by dividing the speech signal in time to provide pitch information for the block. The block of the speech signal is orthogonally transformed to provide spectral data, which is divided by frequency into plural bands in response to the pitch information. A voiced/unvoiced sound discrimination generates voiced/-unvoiced (V/UV) information indicating whether the spectral data in each of the plural bands represents a voiced or an unvoiced sound. The spectral data in the plural bands are interpolated to provide spectral amplitudes for a predetermined number of bands, independent of the pitch. Hierarchical vector quantizing is applied to the spectral amplitudes to generate upper-layer indices, representing an overview of the spectral amplitudes, and lower-layer indices, representing details of the spectral amplitudes. CRC error detection coding is applied to the upper-layer indices, the pitch information, and the V/UV information to generate CRC codes. Convolution coding for error correction is applied to the upper-layer indices, the higher-order bits of the lower-layer indices, the pitch information, the V/UV information, and the CRC codes. The convolution-coded quantities from two blocks of the speech signal are then interleaved in a frame of the transmission signal, together with the lower-order bits of the respective lower-layer indices.

    摘要翻译: 对压缩的数字语音信号进行编码,以提供传输错误传输信号。 通过对通过对语音信号进行时间分割获得的块来执行音调搜索,从数字语音信号导出压缩语音信号,以提供块的音调信息。 语音信号的块被正交变换以提供频谱数据,频谱数据响应于音调信息被频率划分成多个频带。 有声/无声的声音识别产生指示多个频带中的每一个中的频谱数据是否表示有声或无声的声音/未发音(V / UV)信息。 对多个频带中的频谱数据进行插值,以提供与频率无关的预定数量频带的频谱幅度。 层次矢量量化被应用于频谱幅度以产生表示频谱幅度的概览的上层索引,以及表示频谱幅度细节的下层索引。 将CRC错误检测编码应用于上层索引,音调信息和V / UV信息以产生CRC码。 用于纠错的卷积编码被应用于上层索引,下层索引的高阶比特,音调信息,V / UV信息和CRC码。 然后,将来自语音信号的两个块的卷积编码量与发送信号的帧一起与各个下层索引的低位比特进行交织。

    Voice analysis-synthesis method using noise having diffusion which
varies with frequency band to modify predicted phases of transmitted
pitch data blocks
    3.
    发明授权
    Voice analysis-synthesis method using noise having diffusion which varies with frequency band to modify predicted phases of transmitted pitch data blocks 失效
    使用具有随频带变化的扩散的噪声的语音分析合成方法来修改所传输的音调数据块的预测相位

    公开(公告)号:US5878388A

    公开(公告)日:1999-03-02

    申请号:US871812

    申请日:1997-06-09

    摘要: A high efficiency encoding method for encoding data on frequency axis obtained by dividing an input audio signal on block-by-block basis and converting the signal onto the frequency axis, wherein V bands are searched for a band B.sub.VH with the highest center frequency if it is decided that there are one or more shift points of voiced (V)/unvoiced (UV) decision data of all bands on the frequency axis, and wherein the number of V bands N.sub.V up to the band B.sub.VH is found, so as to decide whether proportion of the V bands is equal to or higher than a predetermined threshold N.sub.th, thereby deciding one V/UV boundary point. Thus, it is possible to replace the V/UV decision data for each band by information on one demarcation in all bands, thereby to reduce data volume and to reduce bit rate. Also, by using two-stage hierarchical vector quantization in quantizing the data on the frequency axis, operation volume for codebook search and memory capacity of the codebook are reduced.

    摘要翻译: 一种高效率编码方法,用于对通过逐块分割输入音频信号而获得的频率轴上的数据进行编码,并将该信号转换到频率轴上,其中对于具有最高中心频率的频带BVH搜索V频带,如果 确定在频率轴上存在所有频带的有声(V)/无声(UV))判决数据的一个或多个移位点,并且其中找到直到频带BVH的V频带NV的数量,以便决定 V带的比例是否等于或高于预定阈值Nth,从而确定一个V / UV边界点。 因此,可以通过关于所有频带中的一个分界的信息来替换每个频带的V / UV判定数据,从而减少数据量并降低比特率。 此外,通过在量化频率轴上的数据时使用两级分层矢量量化,减少码本搜索的操作量和码本的存储容量。

    High efficiency encoding method
    4.
    发明授权
    High efficiency encoding method 失效
    高效编码方法

    公开(公告)号:US5765127A

    公开(公告)日:1998-06-09

    申请号:US150082

    申请日:1993-12-06

    摘要: A high efficiency encoding method for encoding data on frequency axis obtained by dividing an input audio signal on block-by-block basis and converting the signal onto the frequency axis, wherein V bands are searched for a band B.sub.VH with the highest center frequency if it is decided that there are one or more shift points of voiced (V)/unvoiced (UV) decision data of all bands on the frequency axis, and wherein the number of V bands N.sub.V up to the band B.sub.VH is found, so as to decide whether proportion of the V bands is equal to or higher than a predetermined threshold N.sub.th, thereby deciding one V/UV boundary point. Thus, it is possible to replace the V/UV decision data for each band by information on one demarcation in all bands, thereby to reduce data volume and to reduce bit rate. Also, by using two-stage hierarchical vector quantization in quantizing the data on the frequency axis, operation volume for codebook search and memory capacity of the codebook are reduced.

    摘要翻译: PCT No.PCT / JP93 / 00323 Sec。 371日期:1993年12月6日 102(e)日期1993年12月6日PCT提交1993年2月18日PCT公布。 第WO93 / 19459号公报 日期1993年9月30日一种用于编码通过逐块分割输入音频信号而获得的频率轴上的数据的高效编码方法,并将信号转换到频率轴上,其中V带被搜索带BVH, 如果确定频率轴上所有频带的有声(V)/清音(UV)判定数据都有一个或多个位移点,则发现最高中心频率,并且其中找到频带BV的数量,直到频带BVH 以确定V波段的比例是否等于或高于预定阈值Nth,从而确定一个V / UV边界点。 因此,可以通过关于所有频带中的一个分界的信息来替换每个频带的V / UV判定数据,从而减少数据量并降低比特率。 此外,通过在量化频率轴上的数据时使用两级分层矢量量化,减少码本搜索的操作量和码本的存储容量。

    Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility
    5.
    发明授权
    Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility 失效
    通过正弦分析和具有相位再现性的波形编码进行语音编码和解码的方法和装置

    公开(公告)号:US07454330B1

    公开(公告)日:2008-11-18

    申请号:US08736546

    申请日:1996-10-24

    IPC分类号: G10L19/14

    摘要: A speech encoding method and apparatus in which an input speech signal is divided in terms of blocks or frames as encoding units and encoded in terms of the encoding units, whereby explosive and fricative consonants can be impeccably reproduced, while there is an attenuation of the occurrence of foreign sounds being generated at a transient portion between voiced (V) and unvoiced (UV) portions, so that the speech with high clarity devoid of “stuffed” feeling may be produced. The encoding apparatus includes a first encoding unit for finding residuals of linear predictive coding (LPC) of an input speech signal for performing harmonic coding and a second encoding unit for encoding the input speech signal by waveform coding. The first encoding unit and the second encoding unit are used for encoding a voiced (V) portion and an unvoiced (UV) portion of the input signal, respectively. Code excited linear prediction (CELP) encoding employing vector quantization by a closed loop search of an optimum vector using an analysis-by-synthesis method is used for the second encoding unit. A corresponding decoding method and apparatus is also provided.

    摘要翻译: 一种语音编码方法和装置,其中输入语音信号以块或帧为单位编码,并以编码单位编码,由此可以无可挑剔地复制爆炸和摩擦辅音,同时存在衰减的发生 在V(V)和无声(UV)部分之间的瞬态部分产生外来声音,从而可能产生具有高“透明度”感的语音。 编码装置包括:第一编码单元,用于求出用于执行谐波编码的输入语音信号的线性预测编码(LPC)的残差;以及第二编码单元,用于通过波形编码对输入的语音信号进行编码。 第一编码单元和第二编码单元分别用于对输入信号的有声(V)部分和无声(UV)部分进行编码。 第二编码单元使用通过使用合成分析法的最佳向量的闭环搜索采用矢量量化的码激励线性预测(CELP)编码。 还提供了相应的解码方法和装置。

    Apparatus and method for encoding a signal as well as apparatus and method for decoding signal
    9.
    发明授权
    Apparatus and method for encoding a signal as well as apparatus and method for decoding signal 有权
    用于编码信号的装置和方法以及用于解码信号的装置和方法

    公开(公告)号:US06484140B2

    公开(公告)日:2002-11-19

    申请号:US09935881

    申请日:2001-08-23

    IPC分类号: G01L1914

    摘要: An apparatus and a method for encoding an input signal on the time base through orthogonal transform involves removing the correlation of signal waveform based on parameters obtained by linear predictive coding (LPC) analysis and pitch analysis of the input signal on the time base prior to the orthogonal transform. A normalization circuit section removes the correlation of the signal waveform and takes out the residue by an LPC inverse filter and pitch inverse filter and sends the residue to an orthogonal transform circuit section. The LPC parameters and the pitch parameters are sent to a bit allocation calculation circuit. A coefficient quantization section quantizes the coefficients from the orthogonal transform circuit section according to the number of allocated bits from the bit allocation calculation section.

    摘要翻译: 通过正交变换在时基上对输入信号进行编码的装置和方法包括基于在线性预测编码(LPC)分析获得的参数和基于时间基准的输入信号的音调分析来消除信号波形的相关性 正交变换。 归一化电路部分去除信号波形的相关性,并通过LPC逆滤波器和音调反向滤波器取出残差,并将其发送到正交变换电路部分。 LPC参数和音调参数被发送到比特分配计算电路。 系数量化部根据来自比特分配计算部的分配比特数,对来自正交变换电路部的系数进行量化。

    Apparatus and method for encoding a signal as well as apparatus and method for decoding a signal
    10.
    发明授权
    Apparatus and method for encoding a signal as well as apparatus and method for decoding a signal 有权
    用于编码信号的装置和方法以及用于对信号进行解码的装置和方法

    公开(公告)号:US06353808B1

    公开(公告)日:2002-03-05

    申请号:US09422250

    申请日:1999-10-21

    IPC分类号: G10L1904

    摘要: An apparatus and a method for encoding an input signal on the time base through orthogonal transform involves removing the correlation of signal waveform on the basis of the parameters obtained by means of linear predictive coding (LPC) analysis and pitch analysis of the input signal on the time base prior to the orthogonal transform. The time base input signal from input terminal is sent to a normalization circuit section and a LPC analysis circuit. The normalization circuit section removes the correlation of the signal waveform and takes out the residue by an LPC inverse filter and pitch inverse filter and sends the residue to an orthogonal transform circuit section. The LPC parameters from the LPC analysis circuit and the pitch parameters from the pitch analysis circuit are sent to a bit allocation calculation circuit. A coefficient quantization section quantizes the coefficients from the orthogonal transform circuit section according to the number of allocated bits from the bit allocation calculation section.

    摘要翻译: 通过正交变换在时基上对输入信号进行编码的装置和方法包括基于通过线性预测编码(LPC)分析获得的参数和对输入信号的音调分析来去除信号波形的相关性 在正交变换之前的时基。 来自输入端子的时基输入信号被发送到归一化电路部分和LPC分析电路。 归一化电路部分去除信号波形的相关性,并通过LPC逆滤波器和音调反向滤波器取出残差,并将其发送到正交变换电路部分。 来自LPC分析电路的LPC参数和来自音调分析电路的音调参数被发送到比特分配计算电路。 系数量化部根据来自比特分配计算部的分配比特数,对来自正交变换电路部的系数进行量化。