Speech coding and decoding apparatus
    31.
    再颁专利
    Speech coding and decoding apparatus 失效
    语音编解码装置

    公开(公告)号:USRE36721E

    公开(公告)日:2000-05-30

    申请号:US561751

    申请日:1995-11-22

    IPC分类号: G10L9/00

    摘要: A speech signal is input to an excitation signal generating section, a prediction filter and a prediction parameter calculator. The prediction parameter calculator calculates a predetermined number of prediction parameters (LPC parameter or reflection coefficient) by an autocorrelation method or covariance method, and supplies the acquired prediction parameters to a prediction parameter coder. The codes of the prediction parameters are sent to a decoder and a multiplexer. The decoder sends decoded values of the codes of the prediction parameters to the prediction filter and the excitation signal generating section. The prediction filter calculates a prediction residual signal, which is the difference between the input speech signal and the decoded prediction parameter, and sends it to the excitation signal generating section. The excitation signal generating section calculates the pulse interval and amplitude for each of a predetermined number of subframes based on the input speech signal, the prediction residual signal and the quantized value of the prediction parameter, and sends them to the multiplexer. The multiplexer combines these codes and the codes of the prediction parameters, and send the results as an output signal of a coding apparatus to a transmission path or the like.

    摘要翻译: 语音信号被输入到激励信号产生部分,预测滤波器和预测参数计算器。 预测参数计算器通过自相关方法或协方差方法计算预定数量的预测参数(LPC参数或反射系数),并将所获取的预测参数提供给预测参数编码器。 预测参数的代码被发送到解码器和多路复用器。 解码器将预测参数的代码的解码值发送到预测滤波器和激励信号生成部。 预测滤波器计算作为输入语音信号和解码预测参数之间的差的预测残差信号,并将其发送到激励信号生成部。 激励信号生成部基于输入的语音信号,预测残差信号和预测参数的量化值,计算预定数量的子帧中的每一个的脉冲间隔和幅度,并将其发送到多路复用器。 多路复用器组合这些代码和预测参数的代码,并将结果作为编码装置的输出信号发送到传输路径等。

    Method and apparatus for adjusting a spectrum shape of a speech signal
    32.
    发明授权
    Method and apparatus for adjusting a spectrum shape of a speech signal 失效
    用于调整语音信号的频谱形状的方法和装置

    公开(公告)号:US5864798A

    公开(公告)日:1999-01-26

    申请号:US714260

    申请日:1996-09-17

    IPC分类号: G10L19/14 G10L9/00

    CPC分类号: G10L19/26 G10L25/12

    摘要: Adjusting the shape of a spectrum of a speech signal includes the steps of using a first filter with pole-zero transfer function A(z)/B(z) for subjecting a speech signal to a spectrum envelop emphasis and a second filter cascade-connected with the first filter, for compensating for a spectral tilt due to the first filter, independently deriving two filter coefficients used in the second filter for compensating for the spectral tilt from the pole-zero transfer function, and compensating for the spectral tilt corresponding to the pole-zero transfer function according to the derived filter coefficients.

    摘要翻译: 调整语音信号的频谱的形状包括以下步骤:使用具有极零传递函数A(z)/ B(z)的第一滤波器来对语音信号进行频谱包络加强,并且第二滤波器级联连接 利用第一滤波器,用于补偿由于第一滤波器引起的频谱倾斜,独立地导出在第二滤波器中使用的两个滤波器系数,用于补偿来自极 - 零传递函数的频谱倾斜,以及补偿对应于 根据派生滤波系数的极零传递函数。

    Vector quantizing apparatus
    33.
    发明授权
    Vector quantizing apparatus 失效
    矢量量化装置

    公开(公告)号:US5677986A

    公开(公告)日:1997-10-14

    申请号:US451174

    申请日:1995-05-26

    CPC分类号: H03M7/3082 G06T9/008

    摘要: A vector quantizing apparatus includes a first search section for obtaining an approximate vector X1 which is approximated to a desired vector R, a residual vector calculator for calculating a residual vector Rv from the desired vector R and the approximate vector X1, a weighting section for obtaining weighted vectors X2 to XN of code vectors x2 to xN, and a second search section for calculating an estimation value which is the magnitude of a projection vector of the residual vector Rv with respect to the vector space formed by the approximate vector X1 and the weighted vectors X2 to XN, and searching a code vector which maximizes this estimation value.

    摘要翻译: 矢量量化装置包括用于获得近似于期望矢量R的近似矢量X1的第一搜索部分,用于从期望矢量R和近似矢量X1计算残差矢量Rv的残差矢量计算器,用于获得的加权部分 以及第二搜索部分,用于计算相对于由近似矢量X1形成的向量空间的残差矢量Rv的投影矢量的大小的估计值,以及加权矢量X 2的加权矢量 向量X2到XN,并且搜索最大化该估计值的码矢量。

    Speech coding and decoding apparatus
    34.
    发明授权
    Speech coding and decoding apparatus 失效
    语音编解码装置

    公开(公告)号:US5265167A

    公开(公告)日:1993-11-23

    申请号:US13551

    申请日:1992-11-19

    IPC分类号: G10L19/04 G10L19/10 G10L9/00

    CPC分类号: G10L19/113

    摘要: A speech signal is input to an excitation signal generating section, a prediction filter and a prediction parameter calculator. The prediction parameter calculator calculates a predetermined number of prediction parameters (LPC parameter or reflection coefficient) by an autocorrelation method or covariance method, and supplies the acquired prediction parameters to a prediction parameter coder. The codes of the prediction parameters are sent to a decoder and a multiplexer. The decoder sends decoded values of the codes of the prediction parameters to the prediction filter and the excitation signal generating section. The prediction filter calculates a prediction residual signal, which is the difference between the input speech signal and the decoded prediction parameter, and sends it to the excitation signal generating section. The excitation signal generating section calculates the pulse interval and amplitude for each of a predetermined number of subframes based on the input speech signal, the prediction residual signal and the quantized value of the prediction parameter, and sends them to the multiplexer. The multiplexer combines these codes and the codes of the prediction parameters, and send the results as an output signal of a coding apparatus to a transmission path or the like.

    摘要翻译: 语音信号被输入到激励信号产生部分,预测滤波器和预测参数计算器。 预测参数计算器通过自相关方法或协方差方法计算预定数量的预测参数(LPC参数或反射系数),并将所获取的预测参数提供给预测参数编码器。 预测参数的代码被发送到解码器和多路复用器。 解码器将预测参数的代码的解码值发送到预测滤波器和激励信号生成部。 预测滤波器计算作为输入语音信号和解码预测参数之间的差的预测残差信号,并将其发送到激励信号生成部。 激励信号生成部基于输入的语音信号,预测残差信号和预测参数的量化值,计算预定数量的子帧中的每一个的脉冲间隔和幅度,并将其发送到多路复用器。 多路复用器组合这些代码和预测参数的代码,并将结果作为编码装置的输出信号发送到传输路径等。

    SPEECH SYNTHESIZER, SPEECH SYNTHESIZING METHOD AND PROGRAM PRODUCT
    36.
    发明申请
    SPEECH SYNTHESIZER, SPEECH SYNTHESIZING METHOD AND PROGRAM PRODUCT 失效
    语音合成器,语音合成方法和程序产品

    公开(公告)号:US20120089402A1

    公开(公告)日:2012-04-12

    申请号:US13271321

    申请日:2011-10-12

    IPC分类号: G10L13/08

    CPC分类号: G10L13/10

    摘要: According to one embodiment, a speech synthesizer includes an analyzer, a first estimator, a selector, a generator, a second estimator, and a synthesizer. The analyzer analyzes text and extracts a linguistic feature. The first estimator selects a first prosody model adapted to the linguistic feature and estimates prosody information that maximizes a first likelihood representing probability of the selected first prosody model. The selector selects speech units that minimize a cost function determined in accordance with the prosody information. The generator generates a second prosody model that is a model of the prosody information of the speech units. The second estimator estimates prosody information that maximizes a third likelihood calculated on the basis of the first likelihood and a second likelihood representing probability of the second prosody model. The synthesizer generates synthetic speech by concatenating the speech units on the basis of the prosody information estimated by the second estimator.

    摘要翻译: 根据一个实施例,语音合成器包括分析器,第一估计器,选择器,发生器,第二估计器和合成器。 分析仪分析文本并提取语言特征。 第一估计器选择适合于语言特征的第一韵律模型,并且估计使表示所选择的第一韵律模型的概率的第一似然最大化的韵律信息。 选择器选择使根据韵律信息确定的成本函数最小化的语音单元。 发生器产生作为语音单元的韵律信息的模型的第二韵律模型。 第二估计器估计使基于第一可能性计算的第三似然最大化的韵律信息和表示第二韵律模型的概率的第二似然。 合成器基于由第二估计器估计的韵律信息来连接语音单元来产生合成语音。

    Prosody-pattern generating apparatus, speech synthesizing apparatus, and computer program product and method thereof
    37.
    发明申请
    Prosody-pattern generating apparatus, speech synthesizing apparatus, and computer program product and method thereof 有权
    韵律图案生成装置,语音合成装置及其计算机程序产品及其方法

    公开(公告)号:US20080243508A1

    公开(公告)日:2008-10-02

    申请号:US12068600

    申请日:2008-02-08

    IPC分类号: G10L13/08

    CPC分类号: G10L13/10

    摘要: Normalization parameters are generated at a normalization-parameter generating unit by calculating the mean values and the standard deviations of an initial prosody pattern and a prosody pattern of a training sentence of a speech corpus. Then, the variance range or variance width of the initial prosody pattern is normalized at the prosody-pattern normalizing unit in accordance with the normalization parameters. As a result, a prosody pattern similar to speech of human beings and improved in naturalness can be generated with a small amount of calculation.

    摘要翻译: 归一化参数通过计算语料库的训练句的初始韵律模式和韵律模式的平均值和标准偏差在标准化参数生成单元处产生。 然后,根据归一化参数,在韵律模式归一化单元处对初始韵律模式的方差范围或方差宽度进行归一化。 结果,可以通过少量的计算产生与人的言语和自然性相似的韵律模式。

    Feature-vector compensating apparatus, feature-vector compensating method, and computer product
    38.
    发明申请
    Feature-vector compensating apparatus, feature-vector compensating method, and computer product 审中-公开
    特征向量补偿装置,特征向量补偿方法和计算机产品

    公开(公告)号:US20070276662A1

    公开(公告)日:2007-11-29

    申请号:US11713801

    申请日:2007-03-05

    IPC分类号: G10L15/00

    摘要: A feature extracting unit extracts a feature vector of an input speech. A similarity calculating unit calculates degrees of similarity for each of a plurality of noise environments, based on the feature vector. A compensation-vector calculating unit acquires a first compensation vector from a storing unit, calculates a second compensation vector based on the first compensation vector, and calculates a third compensation vector by weighting and summing the second compensation vector with the degree of similarity as weights. A compensating unit compensates the feature vector based on the third compensation vector.

    摘要翻译: 特征提取单元提取输入语音的特征向量。 相似度计算单元基于特征向量来计算多个噪声环境中的每一个的相似度。 补偿矢量计算单元从存储单元获取第一补偿向量,基于第一补偿向量计算第二补偿向量,并通过对具有相似度的第二补偿向量进行加权和求和来计算第三补偿向量。 补偿单元基于第三补偿向量补偿特征向量。

    Feature-vector compensating apparatus, feature-vector compensating method, and computer program product
    39.
    发明申请
    Feature-vector compensating apparatus, feature-vector compensating method, and computer program product 有权
    特征向量补偿装置,特征向量补偿方法和计算机程序产品

    公开(公告)号:US20070260455A1

    公开(公告)日:2007-11-08

    申请号:US11723410

    申请日:2007-03-19

    IPC分类号: G10L15/20

    CPC分类号: G10L15/20

    摘要: A noise-environment storing unit stores therein a compensation vector for compensating a feature vector of a speech. A feature-vector extracting unit extracts the feature vector of the speech in each of a plurality of frames. A noise-environment-series estimating unit estimates a noise-environment series based on a feature-vector series and a degree of similarity. A calculating unit obtains a compensation vector corresponding to each noise environment in estimated noise-environment series based on the compensation vector present in the noise-environment storing unit. A compensating unit compensates the extracted feature vector of the speech based on obtained compensation vector.

    摘要翻译: 噪声环境存储单元在其中存储用于补偿语音的特征向量的补偿矢量。 特征矢量提取单元在多个帧中的每一帧中提取语音的特征向量。 噪声环境系列估计单元基于特征向量序列和相似度来估计噪声环境系列。 计算单元基于噪声环境存储单元中存在的补偿矢量,获得与估计的噪声环境系列中的每个噪声环境相对应的补偿矢量。 补偿单元基于获得的补偿向量来补偿提取的语音特征向量。

    Speech encoding method, apparatus and program
    40.
    发明授权
    Speech encoding method, apparatus and program 失效
    语音编码方法,装置和程序

    公开(公告)号:US07191120B2

    公开(公告)日:2007-03-13

    申请号:US10675947

    申请日:2003-10-02

    IPC分类号: G10L19/00 G10L11/04

    CPC分类号: G10L25/93 G10L19/09 G10L25/78

    摘要: A speech encoding method, apparatus and program wherein an input speech signal is divided into a plurality of frames each having a predetermined length, each of the frames is subdivided into a plurality of subframes, a predictive pitch period of a subframe in a to-be-encoded current frame is obtained by using pitch periods of at least two frames of the current frame and past and future frames with respect to the current frame; a pitch period of a subframe in the current frame is obtained by using the predictive pitch period, a relative pitch pattern codebook storing a plurality of relative pitch patterns representing fluctuations in pitch periods of a plurality of subframes is prepared, and a change in pitch period of plural subframes is expressed with one relative pitch pattern selected from the relative pitch pattern codebook.

    摘要翻译: 一种语音编码方法,装置和程序,其中输入语音信号被划分为具有预定长度的多个帧,每个帧被细分为多个子帧,子帧的预测音调周期 通过使用当前帧的至少两帧的音调周期和相对于当前帧的过去和未来帧来获得编码的当前帧; 通过使用预测音调周期来获得当前帧中的子帧的音调周期,准备存储表示多个子帧的音调周期的波动的多个相对音调模式的相对音调模式码本,并且音调周期的变化 由相对的音调图案码本中选择的一个相对音调图形来表示多个子帧。