COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM
    1.
    发明申请
    COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM 有权
    复合声学谐音语音分析系统

    公开(公告)号:US20110131039A1

    公开(公告)日:2011-06-02

    申请号:US12629006

    申请日:2009-12-01

    IPC分类号: G10L21/00

    CPC分类号: G10L25/15

    摘要: A method and apparatus are provided for determining an instantaneous frequency and an instantaneous bandwidth of a speech resonance of a speech signal. The method includes receiving a speech signal having a real component; filtering the speech signal so as to generate a plurality of filtered signals such that the real component and an imaginary component of the speech signal are reconstructed; and generating a first estimated frequency and a first estimated bandwidth of a speech resonance of the speech signal based on both a first filtered signal of the plurality of filtered signals and a single-lag delay of the first filtered signal.

    摘要翻译: 提供了一种用于确定语音信号的语音共振的瞬时频率和瞬时带宽的方法和装置。 该方法包括接收具有真实分量的语音信号; 对所述语音信号进行滤波,以产生多个经滤波的信号,使得重构所述语音信号的实数分量和虚分量; 以及基于所述多个滤波信号的第一滤波信号和所述第一滤波信号的单滞后延迟来产生所述语音信号的语音共振的第一估计频率和第一估计带宽。

    METHOD OF AND SYSTEM FOR PROVIDING ADAPTIVE RESPONDENT TRAINING IN A SPEECH RECOGNITION APPLICATION
    2.
    发明申请
    METHOD OF AND SYSTEM FOR PROVIDING ADAPTIVE RESPONDENT TRAINING IN A SPEECH RECOGNITION APPLICATION 有权
    在语音识别应用中提供自适应对策训练的方法和系统

    公开(公告)号:US20110231190A1

    公开(公告)日:2011-09-22

    申请号:US13052412

    申请日:2011-03-21

    IPC分类号: G10L15/06

    摘要: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.

    摘要翻译: 用于进行电话语音识别应用的系统包括用于与受访者进行电话联系的自动电话设备和语音识别设备,所述语音识别设备在进行电话联系时,向受访者呈现至少一个介绍性提示,以便答复者回复 ; 收到答辩人的口头答复; 并对口语响应执行语音识别分析,以确定答辩人完成该应用的能力。 如果语音识别装置基于对介绍性提示的口头响应,确定答辩人能够竞争应用,则语音识别装置向被访者呈现至少一个应用提示。 如果语音识别装置基于对介绍性提示的口头响应,确定回答者不能完成应用,则语音识别系统向被访者呈现完成该应用的指令。

    Method of and system for providing adaptive respondent training in a speech recognition application
    3.
    发明授权
    Method of and system for providing adaptive respondent training in a speech recognition application 有权
    在语音识别应用中提供自适应受访者训练的方法和系统

    公开(公告)号:US09578169B2

    公开(公告)日:2017-02-21

    申请号:US13052412

    申请日:2011-03-21

    摘要: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.

    摘要翻译: 用于进行电话语音识别应用的系统包括用于与受访者进行电话联系的自动电话设备和语音识别设备,所述语音识别设备在进行电话联系时,向受访者呈现至少一个介绍性提示,以便答复者回复 ; 收到答辩人的口头答复; 并对口语响应执行语音识别分析,以确定答辩人完成该应用的能力。 如果语音识别装置基于对介绍性提示的口头响应,确定答辩人能够竞争应用,则语音识别装置向被访者呈现至少一个应用提示。 如果语音识别装置基于对介绍性提示的口头响应,确定回答者不能完成应用,则语音识别系统向被访者呈现完成该应用的指令。

    Fast and accurate extraction of formants for speech recognition using a plurality of complex filters in parallel
    4.
    发明授权
    Fast and accurate extraction of formants for speech recognition using a plurality of complex filters in parallel 有权
    使用并行的多个复杂滤波器快速准确地提取用于语音识别的共振峰

    公开(公告)号:US08311812B2

    公开(公告)日:2012-11-13

    申请号:US12629006

    申请日:2009-12-01

    IPC分类号: G10L15/02 G10L19/02

    CPC分类号: G10L25/15

    摘要: A method and apparatus are provided for determining an instantaneous frequency and an instantaneous bandwidth of a speech resonance of a speech signal. The method includes receiving a speech signal having a real component; filtering the speech signal so as to generate a plurality of filtered signals such that the real component and an imaginary component of the speech signal are reconstructed; and generating a first estimated frequency and a first estimated bandwidth of a speech resonance of the speech signal based on both a first filtered signal of the plurality of filtered signals and a single-lag delay of the first filtered signal.

    摘要翻译: 提供了一种用于确定语音信号的语音共振的瞬时频率和瞬时带宽的方法和装置。 该方法包括接收具有真实分量的语音信号; 对所述语音信号进行滤波,以产生多个经滤波的信号,使得重构所述语音信号的实数分量和虚分量; 以及基于所述多个滤波信号的第一滤波信号和所述第一滤波信号的单滞后延迟来产生所述语音信号的语音共振的第一估计频率和第一估计带宽。

    DIGITAL PROCESSOR BASED COMPLEX ACOUSTIC RESONANCE DIGITAL SPEECH ANALYSIS SYSTEM
    5.
    发明申请
    DIGITAL PROCESSOR BASED COMPLEX ACOUSTIC RESONANCE DIGITAL SPEECH ANALYSIS SYSTEM 有权
    基于数字处理器的复合声学谐振数字语音分析系统

    公开(公告)号:US20140122067A1

    公开(公告)日:2014-05-01

    申请号:US13665486

    申请日:2012-10-31

    IPC分类号: G10L25/15

    CPC分类号: G10L25/15

    摘要: A speech analysis system uses one or more digital processors to reconstruct a speech signal by accurately extracting speech formants from a digitized version of the speech signal. The system extracts the formants by determining an estimated instantaneous frequency and an estimated instantaneous bandwidth of speech resonances of the digital version of the speech signal in real time. The system digitally filters the digital speech signal using a plurality of complex digital filters in parallel having overlapping bandwidths to ensure that substantially all of the bandwidth of the speech signal is covered. This virtual chain of overlapping complex digital filters produces a corresponding plurality of complex filtered signals. A first estimated frequency and a first estimated bandwidth is generated for each of the filtered signals, and speech resonances of the input speech signal are identified therefrom.

    摘要翻译: 语音分析系统使用一个或多个数字处理器通过从语音信号的数字化版本精确地提取语音共振峰来重建语音信号。 该系统通过实时估计语音信号的数字版本的语音共振的估计瞬时频率和估计的瞬时带宽来提取共振峰。 该系统使用具有重叠带宽并行的多个复数数字滤波器对数字语音信号进行数字滤波,以确保覆盖语音信号的基本上所有的带宽。 这个重叠的复数数字滤波器虚拟链产生相应的多个复数滤波信号。 为每个滤波信号产生第一估计频率和第一估计带宽,并且从其识别输入语音信号的语音谐振。

    Speech-recognition circuitry employing nonlinear processing, speech
element modeling and phoneme estimation
    6.
    发明授权
    Speech-recognition circuitry employing nonlinear processing, speech element modeling and phoneme estimation 失效
    使用非线性处理的语音识别电路,语音元素建模和语音估计

    公开(公告)号:US5168524A

    公开(公告)日:1992-12-01

    申请号:US395449

    申请日:1989-08-17

    IPC分类号: G10L11/00 G10L15/02 G10L15/10

    CPC分类号: G10L15/02

    摘要: A phoneme estimator in a speech-recognition system includes energy detect circuitry for detecting the segments of a speech signal that should be analyzed for phoneme content. Speech-element processors then process the speech signal segments, calculating nonlinear (powers and products) representations of the segments. The nonlinear representation data is applied to speech-element modeling circuitry which reduces the data through speech element specific modeling. The reduced data are then subjected to further nonlinear processing. The results of the further nonlinear processing are again applied to speech-element modeling circuitry, producing phoneme isotype estimates. The phoneme isotype estimates are rearranged and consolidated, that is, the estimates are uniformly labeled and duplicate estimates are consolidated, forming estimates of words or phrases containing minimal numbers of phonemes. The estimates may then be compared with stored words or phrases to determine what was spoken.

    Speech-recognition circuitry employing phoneme estimation
    7.
    发明授权
    Speech-recognition circuitry employing phoneme estimation 失效
    使用音素估计的语音识别电路

    公开(公告)号:US5027408A

    公开(公告)日:1991-06-25

    申请号:US36380

    申请日:1987-04-09

    CPC分类号: G10L15/10

    摘要: A phoneme estimator (12) in a speech-recognition system (10) includes trigger circuitry (18, 22) for identifying the segments of speech that should be analyzed for phoneme content. Speech-element processors (24, 26, and 28) calculate the likelihoods that currently received speech contains individual phonemes, but they operate only when the trigger circuitry identifies such segments. The computation-intensive processing for determining phoneme likelihoods is thus performed on only a small subset of the received speech segments. The accuracy of the speech-element processors (24, 26, and 28) is enhanced because these processors operate by recognition of patterns not only in elements of the data-reduced representations of the received speech but also in higher-ordered products of those elements; that is, these circuits employ non-linear modeling for phoneme identification.

    Method of and system for providing adaptive respondent training in a speech recognition application based upon the inherent response of the respondent
    8.
    发明授权
    Method of and system for providing adaptive respondent training in a speech recognition application based upon the inherent response of the respondent 有权
    基于被访者的固有响应,在语音识别应用中提供自适应受理训练的方法和系统

    公开(公告)号:US07933775B2

    公开(公告)日:2011-04-26

    申请号:US11273528

    申请日:2005-11-14

    IPC分类号: G10L21/00 G10L15/00

    摘要: A system for conducting a telephonic speech recognition application includes an automated telephone device for making telephonic contact with a respondent and a speech recognition device which, upon the telephonic contact being made, presents the respondent with at least one introductory prompt for the respondent to reply to; receives a spoken response from the respondent; and performs a speech recognition analysis on the spoken response to determine a capability of the respondent to complete the application. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is capable of competing the application, the speech recognition device presents at least one application prompt to the respondent. If the speech recognition device, based on the spoken response to the introductory prompt, determines that the respondent is not capable of completing the application, the speech recognition system presents instructions on completing the application to the respondent.

    摘要翻译: 用于进行电话语音识别应用的系统包括用于与受访者进行电话联系的自动电话设备和语音识别设备,所述语音识别设备在进行电话联系时,向受访者呈现至少一个介绍性提示,以便答复者回复 ; 收到答辩人的口头答复; 并对口语响应执行语音识别分析,以确定答辩人完成该应用的能力。 如果语音识别装置基于对介绍性提示的口头响应,确定答辩人能够竞争应用,则语音识别装置向被访者呈现至少一个应用提示。 如果语音识别装置基于对介绍性提示的口头响应,确定回答者不能完成应用,则语音识别系统向被访者呈现完成该应用的指令。

    Speech recognition circuitry employing nonlinear processing speech
element modeling and phoneme estimation
    9.
    发明授权
    Speech recognition circuitry employing nonlinear processing speech element modeling and phoneme estimation 失效
    采用非线性处理语音元素建模和音素估计的语音识别电路

    公开(公告)号:US5369726A

    公开(公告)日:1994-11-29

    申请号:US15299

    申请日:1993-02-09

    CPC分类号: G10L15/02

    摘要: A phoneme estimator in a speech-recognition system includes energy detect circuitry for detecting the segments of a speech signal that should be analyzed for phoneme content. Speech-element processors then process the speech signal segments, calculating nonlinear representations of the segments. The nonlinear representation data is applied to speech-element modeling circuitry which reduces the data through speech element specific modeling. The reduced data are then subjected to further nonlinear processing. The results of the further nonlinear processing are again applied to speech-element modeling circuitry, producing phoneme isotype estimates. The phoneme isotype estimates are rearranged and consolidated, that is, the estimates are uniformly labeled and duplicate estimates are consolidated, forming estimates of words or phrases containing minimal numbers of phonemes. The estimates may then be compared with stored words or phrases to determine what was spoken.

    摘要翻译: 语音识别系统中的音素估计器包括能量检测电路,用于检测应分析音素内容的语音信号的片段。 然后,语音元件处理器处理语音信号段,计算段的非线性表示。 非线性表示数据被应用于通过语音元素特定建模减少数据的语音元件建模电路。 然后将减小的数据进行进一步的非线性处理。 进一步非线性处理的结果再次应用于语音元素建模电路,产生音素同种型估计。 音素同种型估计被重新排列和合并,也就是说,估计被统一标记,重复的估计被合并,形成包含最小数量的音素的单词或短语的估计。 然后可以将估计与存储的单词或短语进行比较,以确定所说的内容。