-
公开(公告)号:USRE43209E1
公开(公告)日:2012-02-21
申请号:US12695917
申请日:2010-01-28
申请人: Hirohisa Tasaki , Tadashi Yamaura
发明人: Hirohisa Tasaki , Tadashi Yamaura
IPC分类号: G10L19/00
摘要: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates. A repetition period coding unit compares the evaluation values provided for the predetermined number of candidates with one another, selects one candidate from the predetermined number of candidates according to the comparison result, and furnishes selection information indicating the selection result, excitation source location code, and polarity code.
摘要翻译: 语音编码装置包括:重复周期预选单元,用于通过将自适应激励源的重复周期分别乘以多个常数来产生驱动激励源的重复周期的多个候选, 从生成的所有候选中选择预定数量的候选。 驱动激励源编码单元提供激励源位置信息和激励源极性信息,使得对于每个预定数量的候选使编码失真最小化,并且提供与预定数量的候选中的每一个的最小编码失真相关联的评估值 候选人。 重复周期编码单元将针对预定数量的候选者提供的评估值彼此进行比较,根据比较结果从预定数量的候选中选择一个候选,并提供指示选择结果,激励源位置代码和 极性代码。
-
公开(公告)号:US07006966B2
公开(公告)日:2006-02-28
申请号:US10083556
申请日:2002-02-27
申请人: Tadashi Yamaura , Hirohisa Tasaki
发明人: Tadashi Yamaura , Hirohisa Tasaki
IPC分类号: G10L19/12
CPC分类号: G10L19/12 , G10L2019/0007
摘要: The present invention comprises: first periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a first periodicity emphasis coefficient adaptively determined based on a predetermined rule; and second periodicity providing means for emphasizing periodicity of a fixed code vector output from at least one fixed excitation code book by use of a predetermined second periodicity emphasis coefficient.
-
公开(公告)号:US06496796B1
公开(公告)日:2002-12-17
申请号:US09620564
申请日:2000-07-20
申请人: Hirohisa Tasaki , Tadashi Yamaura
发明人: Hirohisa Tasaki , Tadashi Yamaura
IPC分类号: G10L1904
CPC分类号: G10L19/10 , G10L2019/0008
摘要: Drive sound source coding means, decoding means has a plurality of algebraic sound source coding means, decoding means having sound source position tables different in distribution lean of sound source position candidates in a frame, each algebraic sound source coding means, decoding means for referencing spectrum envelope information and coding the sound source of an input voice based on a sound source position selected from among the sound source position candidates in the sound source position table and a polarity and selection means for selecting the algebraic sound source coding means, decoding means with the smallest coding distortion from among the plurality of algebraic sound source coding means, decoding means and outputting code representing the drive sound source position output by the selected algebraic sound source coding means, and polarity.
摘要翻译: 驱动声源编码装置,解码装置具有多个代数声源编码装置,具有在帧中声源位置候选的分布偏差不同的声源位置表的解码装置,每个代数声源编码装置,用于参考频谱的解码装置 基于从声源位置表中的声源位置候选中选择的声源位置对输入声音的声源进行编码,以及极性和选择装置,用于选择代数声源编码装置,解码装置具有 多个代数声源编码装置中的最小编码失真,解码装置和输出代表所选代数声源编码装置输出的驱动声源位置的代码和极性。
-
公开(公告)号:US07454328B2
公开(公告)日:2008-11-18
申请号:US10433354
申请日:2001-04-26
申请人: Tadashi Yamaura , Hirohisa Tasaki
发明人: Tadashi Yamaura , Hirohisa Tasaki
CPC分类号: G10L19/12 , G10L2019/0005
摘要: A speech encoding apparatus calculates encoding distortion of a noise-like fixed code vector and multiplies the encoding distortion by a fixed weight corresponding to the noise-like degree of the noise-like fixed code vector, calculates encoding distortion of a non-noise-like fixed code vector and multiplies the encoding distortion by a fixed weight corresponding to the non-noise-like fixed code vector, and selects the fixed excitation code associated with multiplication result with a smaller value.
摘要翻译: 语音编码装置计算噪声状固定码矢量的编码失真,并将编码失真乘以与类噪声固定码矢量的噪声等级对应的固定权重,计算非噪声固定码矢量的编码失真 固定码矢量,将编码失真乘以与非噪声固定码矢量对应的固定权重,并选择与乘法结果相关联的固定激励码较小的值。
-
公开(公告)号:US07047184B1
公开(公告)日:2006-05-16
申请号:US09706813
申请日:2000-11-07
申请人: Hirohisa Tasaki , Tadashi Yamaura
发明人: Hirohisa Tasaki , Tadashi Yamaura
IPC分类号: G10L19/00
CPC分类号: G10L19/107
摘要: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates. A repetition period coding unit compares the evaluation values provided for the predetermined number of candidates with one another, selects one candidate from the predetermined number of candidates according to the comparison result, and furnishes selection information indicating the selection result, excitation source location code, and polarity code.
摘要翻译: 语音编码装置包括:重复周期预选单元,用于通过将自适应激励源的重复周期分别乘以多个常数来产生驱动激励源的重复周期的多个候选, 从生成的所有候选中选择预定数量的候选。 驱动激励源编码单元提供激励源位置信息和激励源极性信息,使得对于每个预定数量的候选使编码失真最小化,并且提供与预定数量的候选中的每一个的最小编码失真相关联的评估值 候选人。 重复周期编码单元将针对预定数量的候选者提供的评估值彼此进行比较,根据比较结果从预定数量的候选中选择一个候选,并提供指示选择结果,激励源位置代码和 极性代码。
-
公开(公告)号:USRE43190E1
公开(公告)日:2012-02-14
申请号:US12695942
申请日:2010-01-28
申请人: Hirohisa Tasaki , Tadashi Yamaura
发明人: Hirohisa Tasaki , Tadashi Yamaura
IPC分类号: G10L19/00
摘要: A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the repetition period of an adaptive excitation source by a plurality of constant numbers, respectively, and for pre-selecting a predetermined number of candidates from all the candidates generated. A driving excitation source coding unit provides both excitation source location information and excitation source polarity information that minimize a coding distortion, for each of the predetermined number of candidates, and provides an evaluation value associated with the minimum coding distortion for each of the predetermined number of candidates. A repetition period coding unit compares the evaluation values provided for the predetermined number of candidates with one another, selects one candidate from the predetermined number of candidates according to the comparison result, and furnishes selection information indicating the selection result, excitation source location code, and polarity code.
-
7.
公开(公告)号:US6052661A
公开(公告)日:2000-04-18
申请号:US777874
申请日:1996-12-31
CPC分类号: G10L19/08
摘要: A speech encoding apparatus capable of averting the deterioration of synthesis speech quality in encoding the input speech and of generating a high-quality synthesis output speech through small quantities of computation. The apparatus includes a target speech generation part for generating from the input speech a target speech vector of a vector length corresponding to a delay parameter; an adaptive codebook for generating from previously generated excitation signals an adaptive vector of the vector length corresponding to the delay parameter; an adaptive code search part for evaluating the distortion of a synthesis vector obtained from the adaptive vector with respect to the target speech vector so as to search for the adaptive vector conducive to the least distortion; and a frame code generation part for generating an excitation signal of a frame length from the adaptive vector conducive to the least distortion.
摘要翻译: 一种语音编码装置,其能够避免编码输入语音中的合成语音质量的劣化,并且通过少量的计算产生高质量的合成输出语音。 该装置包括:目标语音产生部分,用于从输入语音产生与延迟参数对应的矢量长度的目标语音矢量; 自适应码本,用于从先前产生的激励信号生成与延迟参数对应的向量长度的自适应矢量; 自适应码搜索部分,用于评估从所述自适应矢量相对于所述目标语音矢量获得的合成矢量的失真,以便搜索有助于所述最小失真的自适应矢量; 以及帧码生成部,用于从有助于最小失真的自适应矢量生成帧长度的激励信号。
-
公开(公告)号:US08724828B2
公开(公告)日:2014-05-13
申请号:US13878621
申请日:2011-01-19
申请人: Satoru Furuta , Takashi Sudo , Hirohisa Tasaki
发明人: Satoru Furuta , Takashi Sudo , Hirohisa Tasaki
IPC分类号: H04B15/00
CPC分类号: H04R3/002 , G10L21/0232
摘要: A correction spectrum calculation unit 6 obtains a correction spectrum by smoothing an estimated noise spectrum in accordance with the degree of its variations, and a suppression quantity limiting coefficient calculation unit 7 decides a suppression quantity limiting coefficient from the correction spectrum. A suppression quantity calculation unit 9 obtains a suppression coefficient based on the suppression quantity limiting coefficient, and the spectrum suppression unit 10 carries out amplitude suppression of spectral components of an input signal.
摘要翻译: 校正频谱计算单元6通过根据其变化程度对估计的噪声频谱进行平滑来获得校正频谱,抑制量限制系数计算单元7根据校正频谱确定抑制量限制系数。 抑制量计算单元9基于抑制量限制系数获得抑制系数,并且频谱抑制单元10对输入信号的频谱分量进行幅度抑制。
-
公开(公告)号:US20130003987A1
公开(公告)日:2013-01-03
申请号:US13581544
申请日:2010-03-09
申请人: Satoru Furuta , Hirohisa Tasaki
发明人: Satoru Furuta , Hirohisa Tasaki
IPC分类号: H04B15/00
CPC分类号: G10L21/0208 , G10L21/0232 , G10L25/78 , G10L2021/02163
摘要: A band separating unit 5 carries out a band division of a plurality of power spectra into which an input signal is converted by a time-to-frequency converting unit 2 to combine power spectra into each subband, and a band representative component generating unit 6 defines a power spectrum having a maximum among the plurality of power spectra within each subband as a representative power spectrum. A noise suppression amount generating unit 7 calculates an amount of noise suppression for each subband by using the representative power spectrum and a noise spectrum, and a noise suppressing unit 9 suppresses the amplitudes of the power spectra according to the amount of noise suppression.
摘要翻译: 频带分离单元5对时频转换单元2对输入信号进行转换的多个功率谱进行频带分割,将功率谱组合成各子带,频带代表分量生成单元6定义 在每个子带内的多个功率谱中具有最大值的功率谱作为代表性功率谱。 噪声抑制量生成单元7通过使用代表性功率谱和噪声频谱来计算每个子带的噪声抑制量,噪声抑制单元9根据噪声抑制量来抑制功率谱的振幅。
-
公开(公告)号:US06643618B2
公开(公告)日:2003-11-04
申请号:US09842095
申请日:2001-04-26
申请人: Bunkei Matsuoka , Hirohisa Tasaki
发明人: Bunkei Matsuoka , Hirohisa Tasaki
IPC分类号: G10L2102
CPC分类号: G10L19/012 , G10L2019/0012
摘要: A speech decoding unit estimates coding parameters of a speech pause by carrying out smoothing algorithm of the coding parameters by using a coding parameter xref constituting far-end talker background noise information extracted by a parameter extracting circuit 12, and a coding parameter xn used for synthesizing the previous background noise.
摘要翻译: 语音解码单元通过使用构成由参数提取电路12提取的远端讲话人背景噪声信息的编码参数xref,以及用于合成的编码参数xn,通过执行编码参数的平滑算法来估计语音暂停的编码参数 以前的背景噪音。
-
-
-
-
-
-
-
-
-