Speech spurt detecting apparatus and method with threshold adapted by
noise and speech statistics
    1.
    发明授权
    Speech spurt detecting apparatus and method with threshold adapted by noise and speech statistics 失效
    语音突发检测装置和方法,具有噪声和语音统计的阈值

    公开(公告)号:US6044342A

    公开(公告)日:2000-03-28

    申请号:US978481

    申请日:1997-11-25

    CPC分类号: G10L25/78

    摘要: A speech spurt detecting apparatus for detecting speech spurts in a voice signal has a storage for storing an input voice signal. A decision portion determines speech spurt sections and mute sections using a threshold value and sets one of the mute sections at a latter part of a hangover time. A mute level statistical processor estimates the noise distribution of a signal in the mute sections. A speech spurt detecting threshold value decision portion receives the average and the variance of the noise distribution from the mute level statistical processor and approximates the noise distribution to a gamma distribution to decide a speech spurt detecting threshold. A speech spurt transmitting portion outputs the voice signal in the speech spurt sections from the storage. A speech spurt level statistical processor carries out statistical processing of the speech spurt sections. The speech spurt detecting threshold value decision portion detects an error of the speech spurt detecting threshold value using the speech spurt level statistical processor and the mute level statistical processor and resets the speech spurt detecting threshold value to its initial value if the error exceeds a predetermined value. The speech spurt detecting threshold value decision portion increases the speech spurt detecting threshold value at a fixed rate in each of the speech spurt sections, and computes (the average).sup.2 /(the variance) to obtain an adjusting coefficient and computes (the adjusting coefficient).times.(the average) to obtain the speech spurt detecting threshold value.

    摘要翻译: 用于检测语音信号中的语音喷发的语音突发检测装置具有用于存储输入语音信号的存储器。 决定部分使用阈值确定语音突发部分和静音部分,并且在宿醉时间的后半部分设置静音部分中的一个。 静音级统计处理器估计静音部分中的信号的噪声分布。 语音突发检测阈值判定部分从静音级统计处理器接收噪声分布的平均值和方差,并将噪声分布近似为伽马分布,以决定语音突发检测阈值。 话音突发发送部分从存储器输出语音突发部分中的语音信号。 语音突发等级统计处理器执行语音突发部分的统计处理。 语音突发检测阈值判定部使用语音突发级统计处理器和静音级统计处理器来检测语音突发检测阈值的误差,并且如果误差超过预定值则将语音突发检测阈值重置为其初始值 。 语音突发检测阈值判定部分在每个语音突发部分中以固定速率增加语音突起检测阈值,并且计算(平均)2 /(方差)以获得调整系数并计算(调整系数 )x(平均),以获得语音突发检测阈值。

    Method and apparatus for extracting speech spurts from voice and
reproducing voice from extracted speech spurts
    2.
    发明授权
    Method and apparatus for extracting speech spurts from voice and reproducing voice from extracted speech spurts 失效
    从提取的语音喷嘴中提取话音和再现语音的方法和装置

    公开(公告)号:US6078882A

    公开(公告)日:2000-06-20

    申请号:US93926

    申请日:1998-06-09

    CPC分类号: G10L19/012

    摘要: Identification information of a speech spurt, hangover and pause is used to indicate that a digital voice signal is the speech spurt, hangover or pause. While the identification information of a speech spurt, hangover and pause is indicative of the speech spurt, a voice level adjuster does not attenuate the digital voice signal, and the voice signal/third signal combiner mixes it with a third signal which undergoes the maximum attenuation through a third signal level adjuster. While the identification information of a speech spurt, hangover and pause is indicative of the hangover, the voice level adjuster gradually attenuates the digital voice signal. This is because the level of the voice signal is expected to be high in the first half of the hangover period, but to decay in its latter half to such a level that it is dispensable for speech recognition. A third signal (noise), on the other hand, is gradually increased in the latter half of the hangover period to preserve the continuity in the transition from the speech spurt to a pause, thus achieving smooth transition to the pause. This makes it possible to reduce as much as possible the unnaturalness involved in switching between speech spurts and pauses, thereby improving the quality of the reproduced voice.

    摘要翻译: 语音突发,宿醉和暂停的识别信息用于指示数字语音信号是语音突发,宿醉或暂停。 虽然语音突发,宿醉和暂停的识别信息指示语音突发,语音电平调节器不会衰减数字语音信号,并且语音信号/第三信号组合器与经历最大衰减的第三信号混合 通过第三个信号电平调节器。 虽然语音突发,宿醉和暂停的识别信息表示宿醉,语音电平调节器逐渐衰减数字语音信号。 这是因为语音信号的水平在宿醉期的上半期预计会很高,但在下半年衰减到这样的水平,这对于语音识别是不必要的。 另一方面,第三个信号(噪声)在宿醉期的后半段逐渐增加,以保持从语音突发转变为暂停的连续性,从而实现平稳过渡到暂停。 这使得可以尽可能地减少在语音喷射和暂停之间切换所涉及的不自然,从而提高再现语音的质量。