Speech compression system and method
    1.
    发明授权
    Speech compression system and method 有权
    语音压缩系统及方法

    公开(公告)号:US07593852B2

    公开(公告)日:2009-09-22

    申请号:US11700481

    申请日:2007-01-30

    IPC分类号: G10L15/20

    摘要: The invention improves the encoding and decoding of speech by focusing the encoding on the perceptually important characteristics of speech. The system analyzes selected features of an input speech signal, and first performing a common frame based speech coding of an input speech signal. The system then performs a speech coding based on either a first speech coding mode or a second speech coding mode. The selection of a mode is based on characteristics of the input speech signal. The first speech coding mode uses a first framing structure and the second speech coding mode uses a second framing structure.

    摘要翻译: 本发明通过将编码聚焦在语音的重要特征上来改进语音的编码和解码。 该系统分析输入语音信号的所选特征,并且首先对输入语音信号进行基于公共帧的语音编码。 然后,该系统基于第一语音编码模式或第二语音编码模式执行语音编码。 模式的选择基于输入语音信号的特性。 第一语音编码模式使用第一成帧结构,第二语音编码模式使用第二帧结构。

    Codebook tables for multi-rate encoding and decoding with pre-gain and delayed-gain quantization tables
    2.
    发明授权
    Codebook tables for multi-rate encoding and decoding with pre-gain and delayed-gain quantization tables 有权
    用于具有预增益和延迟增益量化表的多速率编码和解码的码表

    公开(公告)号:US06757649B1

    公开(公告)日:2004-06-29

    申请号:US10409404

    申请日:2003-04-08

    IPC分类号: G10L1912

    摘要: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

    摘要翻译: 公开了能够将语音信号编码为比特流以进行后续解码以产生合成语音的语音压缩系统。 语音压缩系统通过将期望的平均比特率与重构语音的感知质量进行平衡来优化比特流消耗的带宽。 语音压缩系统包括全速率编解码器,半速率编解码器,四分之一速率编解码器和八速率编解码器。 基于速率选择来选择性地激活编解码器。 此外,基于类型分类,全速率和半速率编解码器被选择性地激活。 选择性地激活每个编解码器以以强调语音信号的不同方面的不同比特率对语音信号进行编码和解码,以增强合成语音的整体质量。

    System of encoding and decoding speech signals

    公开(公告)号:US06604070B1

    公开(公告)日:2003-08-05

    申请号:US09663734

    申请日:2000-09-15

    IPC分类号: G10L1912

    摘要: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

    Speech coding system and method using bi-directional mirror-image predicted pulses
    5.
    发明申请
    Speech coding system and method using bi-directional mirror-image predicted pulses 有权
    使用双向镜像预测脉冲的语音编码系统和方法

    公开(公告)号:US20090043574A1

    公开(公告)日:2009-02-12

    申请号:US12284623

    申请日:2008-09-23

    IPC分类号: G10L19/12 G10L19/00

    摘要: There is provided a method of decoding speech data generated from a speech signal. The method comprises receiving the speech data having at least one main pulse in a subframe of the speech data; generating a first predicted pulse, based on the at least one main pulse, on one side of the main pulse in the subframe of the speech data, wherein the first predicted pulse has a lower gain than the main pulse; generating a second predicted pulse, as a mirror image of the first predicted pulse on a reverse time scale, on the other side of the main pulse in the subframe of the speech data; reconstructing the speech signal using the at least one main pulse, the first predicted pulse and the second predicted pulse.

    摘要翻译: 提供了一种对从语音信号产生的语音数据进行解码的方法。 该方法包括:接收语音数据的子帧中具有至少一个主脉冲的语音数据; 基于所述至少一个主脉冲在所述语音数据的子帧中的所述主脉冲的一侧产生第一预测脉冲,其中所述第一预测脉冲具有比所述主脉冲更低的增益; 在语音数据的子帧中的主脉冲的另一侧上产生第二预测脉冲作为反时限上的第一预测脉冲的镜像; 使用所述至少一个主脉冲,所述第一预测脉冲和所述第二预测脉冲来重构所述语音信号。

    Conference bridge processing of speech in a packet network environment
    7.
    发明授权
    Conference bridge processing of speech in a packet network environment 有权
    会议桥处理语音在分组网环境中

    公开(公告)号:US06463414B1

    公开(公告)日:2002-10-08

    申请号:US09547832

    申请日:2000-04-12

    IPC分类号: G10L1102

    CPC分类号: G10L19/173

    摘要: There is provided a conference bridge or transcoder configured to intelligently handle multiple speech channels in the contest of a packet network, wherein various speech channels may adhere to variety of speech encoding standards. For example, the conference bridge establishes framing and alignment of multiple incoming speech channels associated with multiple participants, extracts parameters from the speech samples, mixes the parameters, and re-encodes the resulting speech samples for transmission to the participants. In one aspect, a speech processing method comprises decoding a first bitstream according to a first coding scheme to generate first speech samples and a first side information; generating second speech samples and a second side information using the first speech samples and the first side information, for use according to a second coding scheme; and creating a second bitstream, encoded based on the second coding scheme, using the second speech samples and the second side information.

    摘要翻译: 提供了一种配置成在分组网络的比赛中智能地处理多个语音信道的会议桥或代码转换器,其中各种语音信道可以遵循各种语音编码标准。 例如,会议桥建立与多个参与者相关联的多个输入语音信道的成帧和对准,从语音样本中提取参数,混合参数,并对所得到的语音样本进行重新编码以传输给参与者。 一方面,语音处理方法包括根据第一编码方案对第一比特流进行解码,以产生第一语音样本和第一侧信息; 使用第一语音样本和第一侧信息生成第二语音样本和第二侧信息,以便根据第二编码方案使用; 以及使用所述第二语音样本和所述第二侧信息来创建基于所述第二编码方案编码的第二比特流。

    Encoding and decoding speech signals variably based on signal classification
    8.
    发明授权
    Encoding and decoding speech signals variably based on signal classification 有权
    基于信号分类对语音信号进行编码和解码

    公开(公告)号:US06735567B2

    公开(公告)日:2004-05-11

    申请号:US10409430

    申请日:2003-04-08

    IPC分类号: G10L1304

    摘要: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

    摘要翻译: 公开了能够将语音信号编码为比特流以进行后续解码以产生合成语音的语音压缩系统。 语音压缩系统通过将期望的平均比特率与重构语音的感知质量进行平衡来优化比特流消耗的带宽。 语音压缩系统包括全速率编解码器,半速率编解码器,四分之一速率编解码器和八速率编解码器。 基于速率选择来选择性地激活编解码器。 此外,基于类型分类,全速率和半速率编解码器被选择性地激活。 选择性地激活每个编解码器以以强调语音信号的不同方面的不同比特率对语音信号进行编码和解码,以增强合成语音的整体质量。

    Bitstream protocol for transmission of encoded voice signals
    9.
    发明授权
    Bitstream protocol for transmission of encoded voice signals 有权
    用于传输编码语音信号的比特流协议

    公开(公告)号:US06581032B1

    公开(公告)日:2003-06-17

    申请号:US09662828

    申请日:2000-09-15

    IPC分类号: G10L1912

    摘要: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

    摘要翻译: 公开了能够将语音信号编码为比特流以进行后续解码以产生合成语音的语音压缩系统。 语音压缩系统通过将期望的平均比特率与重构语音的感知质量进行平衡来优化比特流消耗的带宽。 语音压缩系统包括全速率编解码器,半速率编解码器,四分之一速率编解码器和八速率编解码器。 基于速率选择来选择性地激活编解码器。 此外,基于类型分类,全速率和半速率编解码器被选择性地激活。 选择性地激活每个编解码器以以强调语音信号的不同方面的不同比特率对语音信号进行编码和解码,以增强合成语音的整体质量。

    Speech compression system and method
    10.
    发明授权
    Speech compression system and method 有权
    语音压缩系统及方法

    公开(公告)号:US07191122B1

    公开(公告)日:2007-03-13

    申请号:US11112394

    申请日:2005-04-22

    IPC分类号: G10L19/12

    摘要: The invention improves the encoding and decoding of speech by focusing the encoding on the perceptually important characteristics of speech. The system analyzes selected features of an input speech signal, and first performing a common frame based speech coding of an input speech signal. The system then performs a speech coding based on either a first speech coding mode or a second speech coding mode. The selection of a mode is based on characteristics of the input speech signal. The first speech coding mode uses a first framing structure and the second speech coding mode uses a second framing structure.

    摘要翻译: 本发明通过将编码聚焦在语音的重要特征上来改进语音的编码和解码。 该系统分析输入语音信号的所选特征,并且首先对输入语音信号进行基于公共帧的语音编码。 然后,该系统基于第一语音编码模式或第二语音编码模式执行语音编码。 模式的选择基于输入语音信号的特性。 第一语音编码模式使用第一成帧结构,第二语音编码模式使用第二帧结构。