Method and apparatus for non-speech activity reduction of a low bit rate digital voice message
    1.
    发明授权
    Method and apparatus for non-speech activity reduction of a low bit rate digital voice message 失效
    用于低比特率数字语音消息的非语音活动减少的方法和装置

    公开(公告)号:US06370500B1

    公开(公告)日:2002-04-09

    申请号:US09409187

    申请日:1999-09-30

    IPC分类号: G10L1900

    CPC分类号: G10L19/012 G10L25/78

    摘要: A technique is used in a speech encoder (107) that reduces non-speech activity of a low bit rate digital voice message. Speech model parameters that include quantized speech spectral parameter vectors are generated in a sequence of frames. A determination is made as to which frames of the sequence of frames are voiced frames and which frames are unvoiced frames. A consecutive sequence of frames of unvoiced frames is identified (2330) as an unvoiced burst when a length, NUV, of the consecutive sequence of frames exceeds a predetermined length, Ns. A non-speech activity portion of the unvoiced burst is identified (2335-2365) and removed.

    摘要翻译: 在语音编码器(107)中使用技术来减少低比特率数字语音消息的非语音活动。 包括量化语音频谱参数矢量的语音模型参数在帧序列中生成。 确定帧序列的哪些帧是浊音帧,哪些帧是清音帧。 当连续帧序列的长度NUV超过预定长度Ns时,确定无声帧的连续序列(2330)为无声突发。 确定清音突发的非语音活动部分(2335-2365)并移除。

    Method and apparatus for transferring low bit rate digital voice messages using incremental messages
    2.
    发明授权
    Method and apparatus for transferring low bit rate digital voice messages using incremental messages 失效
    用于使用增量消息传送低比特率数字语音消息的方法和装置

    公开(公告)号:US06772126B1

    公开(公告)日:2004-08-03

    申请号:US09410006

    申请日:1999-09-30

    IPC分类号: G10L2104

    CPC分类号: G10L19/24

    摘要: A system controller (106) is for transferring a low bit rate digital voice message. The system controller generates from an analog voice signal representing the voice message a set of speech model parameters, and generates a first derived set of speech model parameters from a first subset of the set of speech model parameters, the first derived set encoding the voice signal at a second voice quality and second vocoder rate that are less, respectively, than a first voice quality and vocoder rate. The system controller transmits (3610) the low bit rate-digital voice message comprising the first derived set of speech model parameters to a communication receiver (114). The communication receiver requests (3640) an incremental message when the quality of the voice message is unsatisfactory. The system controller generates and transmits (3555, 3650) an incremental message-and the communication receiver uses (3660) the incremental message to generate a higher quality voice message.

    摘要翻译: 系统控制器(106)用于传送低比特率数字话音消息。 系统控制器从表示语音消息的模拟语音信号产生一组语音模型参数,并且从语音模型参数集合的第一子集生成第一导出的语音模型参数集,编码语音信号的第一导出集合 分别具有比第一语音质量和声码率更小的第二语音质量和第二声码器速率。 系统控制器将包括第一导出的语音模型参数集合的低比特率数字话音消息(3610)发送到通信接收机(114)。 当语音消息的质量不令人满意时,通信接收器请求(3640)增量消息。 系统控制器生成并发送增量消息(3555,3650),通信接收器使用增量消息(3660)生成更高质量的语音消息。

    Method and apparatus for dynamic segmentation of a low bit rate digital voice message
    3.
    发明授权
    Method and apparatus for dynamic segmentation of a low bit rate digital voice message 失效
    低比特率数字语音消息的动态分割方法和装置

    公开(公告)号:US06418405B1

    公开(公告)日:2002-07-09

    申请号:US09410140

    申请日:1999-09-30

    IPC分类号: G10L1900

    CPC分类号: G10L15/04 G10L19/0018

    摘要: A system controller (106) includes a speech encoder (107) that dynamically segments frames of a low bit rate digital voice message. Speech model parameters have been generated in a sequence of frames. The speech model parameters include quantized speech spectral parameter vectors. The speech encoder selects (1820) a first quantized speech spectral parameter vector as a current anchor vector, selects (1820, 1830) a second quantized speech spectral parameter vector located a predetermined number of frames (LMAX) from the current anchor vector as a target speech parameter vector, and perturbs (1840) the target speech parameter vector to derive a plurality (K) of perturbed speech parameter vectors.

    摘要翻译: 系统控制器(106)包括动态地分段低比特率数字语音消息的帧的语音编码器(107)。 语音模型参数已经在一系列帧中生成。 语音模型参数包括量化语音频谱参数向量。 语音编码器选择(1820)作为当前锚矢量的第一量化语音频谱参数矢量,选择(1820,1830)从位于当前锚矢量的预定数量帧(LMAX)的第二量化语音频谱参数矢量作为目标 语音参数向量,并扰动(1840)目标语音参数向量,以导出多个(K)扰动语音参数向量。

    Method and apparatus for encoding and decoding frames of voice model parameters into a low bit rate digital voice message
    4.
    发明授权
    Method and apparatus for encoding and decoding frames of voice model parameters into a low bit rate digital voice message 失效
    用于将语音模型参数帧编码和解码为低比特率数字话音消息的方法和装置

    公开(公告)号:US06496798B1

    公开(公告)日:2002-12-17

    申请号:US09409183

    申请日:1999-09-30

    IPC分类号: G10L1900

    CPC分类号: G10L19/167

    摘要: A system controller (106) includes a speech encoder (107) that encodes a low bit rate digital voice message. The speech encoder sets values of words of a header of the encoded message. The values of the words define a quantity of frames in the voice message, N, and define a vocoder rate used for the encoded message. The speech encoder sets a state of each indicator in each frame status field of N frame status fields that are transmitted after the header of the encoded message. The speech encoder assembles N frame data fields, wherein each of the frame data fields comprises a set of data words. The N frame data fields follow the N frame status fields. Each set of data words conforms to at least one of the vocoder rate and the states of the indicators. A decoder (3310) decodes the encoded low bit rate digital message.

    摘要翻译: 系统控制器(106)包括编码低比特率数字语音消息的语音编码器(107)。 语音编码器设置编码消息的报头的字的值。 这些字的值定义了语音消息N中的帧数量,并且定义了用于编码消息的声码器速率。 语音编码器在编码消息的头部之后发送的N个帧状态字段的每个帧状态字段中设置每个指示符的状态。 语音编码器组合N个帧数据字段,其中每个帧数据字段包括一组数据字。 N帧数据字段遵循N帧状态字段。 每组数据字符合至少一个声码器速率和指示器的状态。 解码器(3310)对编码的低比特率数字消息进行解码。

    Method and apparatus for pitch determination of a low bit rate digital voice message
    5.
    发明授权
    Method and apparatus for pitch determination of a low bit rate digital voice message 失效
    用于音高确定低比特率数字语音消息的方法和装置

    公开(公告)号:US06418407B1

    公开(公告)日:2002-07-09

    申请号:US09410007

    申请日:1999-09-30

    IPC分类号: G10L1104

    CPC分类号: G10L25/90 G10L19/09 G10L19/10

    摘要: A pitch determiner (931) of a system controller (106) that generates a smoothed pitch value for a current frame of a low bit rate voice message includes a pitch function generator (955) that generates a pitch detection function (PDF) for each frame of digital samples of a voice signal, a pitch candidate selector (960) that selects a future frame pitch candidate from a pitch detection function (PDF), and a pitch adjuster (978) that generates the smoothed pitch value. The pitch adjuster includes a subharmonic pitch corrector (965) that determines a future frame pitch value by performing pitch subharmonic correction of a future frame pitch candidate using a roughness factor of the frequency transformed window.

    摘要翻译: 产生低比特率语音消息的当前帧的平滑的音调值的系统控制器(106)的音调确定器(931)包括:产生用于每个帧的音调检测功能(PDF)的音调函数发生器(955) 语音信号的数字样本,从音调检测功能(PDF)中选择未来的帧间距候选的音调候选选择器(960)以及产生平滑的音调值的音调调节器(978)。 音调调节器包括次谐波音调校正器(965),其通过使用频率变换窗口的粗糙度因子执行对未来帧音调候选的音调次谐波校正来确定未来帧音调值。

    Very low bit rate voice messaging system using variable rate backward
search interpolation processing
    6.
    发明授权
    Very low bit rate voice messaging system using variable rate backward search interpolation processing 失效
    使用可变速率反向搜索插值处理的非常低比特率语音消息系统

    公开(公告)号:US5682462A

    公开(公告)日:1997-10-28

    申请号:US528033

    申请日:1995-09-14

    IPC分类号: G10L19/00 G10L19/06 G10L9/00

    CPC分类号: G10L19/06

    摘要: A method and apparatus is provided for a low bit rate speech transmission. Speech spectral parameter vectors are generated from a voice message and stored in a sequence of speech spectral parameter vectors within a speech spectral parameter matrix. A first index identifying a first speech parameter template corresponding to a first speech spectral parameter vector of the sequence of speech spectral parameter vectors is transmitted. A subsequent speech spectral parameter vector of the sequence is selected and a subsequent speech parameter template is determined having a subsequent index. One or more intervening interpolated speech parameter templates are interpolated between the first speech parameter template and the subsequent speech parameter template. The one or more intervening speech spectral parameter vectors are compared to the corresponding one or more intervening interpolated speech parameter templates to derive a distance. The subsequent index is transmitted when the distance derived is less than or equal to a predetermined distance.

    摘要翻译: 提供了一种用于低比特率语音传输的方法和装置。 语音频谱参数矢量从语音消息生成并存储在语音频谱参数矩阵内的语音频谱参数矢量序列中。 发送识别对应于语音频谱参数矢量序列的第一语音频谱参数向量的第一语音参数模板的第一索引。 选择该序列的后续语音频谱参数矢量,并且确定随后的语音参数模板具有后续索引。 在第一语音参数模板和随后的语音参数模板之间插入一个或多个插入的内插语音参数模板。 将一个或多个中间语音频谱参数矢量与相应的一个或多个插入的内插语音参数模板进行比较以导出距离。 当所导出的距离小于或等于预定距离时,传送随后的索引。

    Pitch determiner for a speech analyzer
    7.
    发明授权
    Pitch determiner for a speech analyzer 失效
    语音分析仪的音调确定器

    公开(公告)号:US6018706A

    公开(公告)日:2000-01-25

    申请号:US999171

    申请日:1997-12-29

    IPC分类号: G10L11/04 G10L19/12 G10L7/06

    摘要: A pitch determiner (414) for use with a speech analyzer includes a pitch function generator (414) which generates a plurality of pitch components representing a pitch function for one or more sequential segments of speech. which are represented by a predetermined number of digitized speech samples. A pitch enhancer (1116) enhances the pitch function of a current segment of speech utilizing the pitch function of one or more sequential segments of speech to generate a plurality of enhanced pitch components. A pitch detector (1118) detects the pitch of the current segment of speech by determining the pitch of an enhanced pitch component having a largest amplitude of the plurality of enhanced pitch components.

    摘要翻译: 与语音分析器一起使用的音调确定器(414)包括音调函数发生器(414),其生成表示用于一个或多个连续语音段的音调函数的多个音调分量。 其由预定数量的数字化语音样本表示。 音调增强器(1116)利用一个或多个连续语音段的音调函数来增强当前语音段的音调函数,以产生多个增强音调分量。 音高检测器(1118)通过确定具有多个增强音调分量的最大振幅的增强音调分量的音调来检测当前语音段的音高。

    Method and apparatus for minimal redundancy error detection and
correction of voice spectrum parameters
    8.
    发明授权
    Method and apparatus for minimal redundancy error detection and correction of voice spectrum parameters 失效
    用于语音频谱参数的最小冗余错误检测和校正的方法和装置

    公开(公告)号:US5636231A

    公开(公告)日:1997-06-03

    申请号:US523578

    申请日:1995-09-05

    IPC分类号: G10L19/00 G10L19/06 H03M13/00

    CPC分类号: G10L19/07 G10L19/005

    摘要: Error detection and correction of a received message, such as a digitized voice message is achieved by generating (318) interpolated vectors for each error vector corresponding to a codebook index in a sequence of codebook indexes representing parameters of portions of the message. A plurality of error corrected candidate vectors for the vector corresponding to the codebook index in error, are generated (322,324,326) by flipping one bit in a sequence of bits representing the codebook index in error. The error corrected candidate vector which has a minimal difference from its corresponding interpolated vector is used (338) to replace the error vector. In the case of digital voice, the vectors are spectral vectors which represent spectral information for a time sample of a voice message. An ordering property of vector components is exploited to detect errors in a received codebook index without parity bits.

    摘要翻译: 通过对表示信息部分参数的代码簿索引序列中的码本索引生成每个误差向量的内插向量来实现对诸如数字化语音消息的接收消息的错误检测和校正。 通过在代表码本索引的位的序列中翻转一位来产生用于与错误码本索引相对应的矢量的多个纠错候选向量(322,324,326)。 使用与其对应的内插向量具有最小差异的误差校正候选向量(338)来替换误差向量。 在数字语音的情况下,矢量是表示语音消息的时间采样的频谱信息的频谱矢量。 利用矢量分量的排序属性来检测接收到的码本索引中没有奇偶校验位的错误。

    MBE synthesizer for very low bit rate voice messaging systems
    9.
    发明授权
    MBE synthesizer for very low bit rate voice messaging systems 失效
    用于非常低比特率语音消息系统的MBE合成器

    公开(公告)号:US5684926A

    公开(公告)日:1997-11-04

    申请号:US592252

    申请日:1996-01-26

    CPC分类号: G10L19/16 G10L19/09 G10L19/10

    摘要: An MBE synthesizer (116) for generating a segment of speech from compressed speech data received by a receiver (2004). The compressed speech data includes one or more indexes (2240, 2242) and pitch data (2248). The MBE synthesizer (116) includes the following: an excitation generator (2222) utilizing a transform function for generating transformed excitation components responsive to the pitch data (2248). A memory (3006) for storing a table of predetermined spectral vectors (2205) and associated predetermined voicing vectors (2203). A harmonic amplitude estimator (2209) that is responsive to the one or more predetermined spectra/vectors identified by the indexes (2240, 2242) received, that generates harmonic amplitude control signals. The harmonic amplitude estimator (2209) which includes a peak detector (2503), a peak enhancer (2505), a valley detector (2507), a valley enhancer (2509). A multi-band voicing controller (2214), responsive to the predetermined voicing vectors which are associated with the one or more predetermined spectral vectors identified, for controlling a selection of the excitation components.

    摘要翻译: 一种用于从由接收机接收的压缩语音数据产生语音段的MBE合成器(116)。 压缩语音数据包括一个或多个索引(2240,2242)和音调数据(2248)。 MBE合成器(116)包括以下:利用变换函数的激励发生器(2222),用于响应于音调数据(2248)产生变换的激励分量。 一种用于存储预定光谱向量(2205)和相关联的预定发声矢量(2203)的表的存储器(3006)。 响应于由所接收的指标(2240,2242)所识别的一个或多个预定光谱/矢量的谐波振幅估计器(2209),其产生谐波幅度控制信号。 谐波振幅估计器(2209)包括峰值检测器(2503),峰值增强器(2505),谷值检测器(2507),谷值增强器(2509)。 多频带发声控制器(2214)响应于与所识别的一个或多个预定频谱矢量相关联的预定语音向量,用于控制激励分量的选择。

    Apparatus and method for coding excitation parameters in a very low bit
rate voice messaging system
    10.
    发明授权
    Apparatus and method for coding excitation parameters in a very low bit rate voice messaging system 失效
    用于在非常低比特率语音消息系统中编码激励参数的装置和方法

    公开(公告)号:US5666350A

    公开(公告)日:1997-09-09

    申请号:US603677

    申请日:1996-02-20

    IPC分类号: G10L19/02 H04J3/17

    CPC分类号: G10L19/0212 H04J3/17

    摘要: An apparatus codes excitation parameters for very low bit rate voice messaging using a method that processes a voice message to generating speech parameters. The speech parameters are separated (316) to produce a first group of energy parameters and a second group of pitch and voicing parameters. Subsequently, the first group of energy parameters are encoded and compressed using a non-uniform root-mean-square scalar process (318) to create a first plurality of encoded data. Additionally, the second group of pitch and voicing parameters are compressed, encoded, and combined into a single parameter using a three slope vector encoding process (320) that creates a second plurality of encoded data. Finally, the first and second plurality of encoded data are multiplexed (322) to create a multiplexed signal for transmission, the multiplexed signal representing the voice message.

    摘要翻译: 一种装置使用处理语音消息以产生语音参数的方法来编码用于非常低比特率语音消息的激励参数。 语音参数被分离(316)以产生第一组能量参数和第二组音调和发音参数。 随后,使用非均匀均方根标量过程(318)对第一组能量参数进行编码和压缩,以创建第一多个编码数据。 另外,使用创建第二多个编码数据的三斜率矢量编码处理(320),第二组音调和发声参数被压缩,编码和组合成单个参数。 最后,第一和第二多个编码数据被多路复用(322)以产生用于发送的复用信号,表示语音消息的复用信号。