Method and apparatus for disambiguating lists of elements for speech interfaces
    1.
    发明授权
    Method and apparatus for disambiguating lists of elements for speech interfaces 有权
    用于消除语音界面元素列表的方法和装置

    公开(公告)号:US06523004B1

    公开(公告)日:2003-02-18

    申请号:US09356144

    申请日:1999-07-19

    IPC分类号: G10L1504

    CPC分类号: G10L15/04 G10L2015/085

    摘要: In a computer system having a list based natural discourse application adapted for speech recognition. In response to a first user element request, the system searches a list of elements to generate a list of matches which contain elements which satisfy the element request. The system calculates the time required to read out the match list common levels, the time required to read out all matches, and the time required to iteratively query the user as to which matches of one of said common levels to read out. The system then reads out the match list using the method having the lowest calculated time.

    摘要翻译: 在具有适用于语音识别的基于列表的自然话语应用的计算机系统中。 响应于第一用户元素请求,系统搜索元素列表以生成包含满足元素请求的元素的匹配列表。 系统计算读出匹配列表公共级别所需的时间,读取所有匹配所需的时间,以及迭代地向用户查询所读出的所述共同级别之一的匹配所需的时间。 然后系统使用计算时间最短的方法读出匹配列表。

    Method of managing a speech cache
    2.
    发明授权
    Method of managing a speech cache 有权
    管理语音缓存的方法

    公开(公告)号:US06741963B1

    公开(公告)日:2004-05-25

    申请号:US09598603

    申请日:2000-06-21

    IPC分类号: G10L1500

    CPC分类号: G10L15/285 G10L15/22

    摘要: A method and a system for improving recall of speech data in a computer speech system can include a plurality of speech cache management steps including providing a speech cache, receiving a speech system input and identifying a speech event in the received speech system input, the speech event comprising speech data. Subsequently, the speech data can be compared to pre-determined speech cache entry criteria; and, if the speech data meets one of the pre-determined entry criteria, at least one entry can be added to the speech cache, the at least one entry corresponding to the speech data. Additionally, the speech data can be compared to pre-determined speech cache exit criteria; and, if the speech data meets one of the pre-determined exit criteria, at least one entry can be purged from the speech cache, the at least one entry corresponding to the speech data. The entry criteria can include frequently used speech data, recently used speech data, and important speech data. Similarly, the exit criteria can include least frequently used speech data associated with each entry in the speech cache, least recently used speech data associated with each entry in the speech cache and least important speed data associated with each entry in the speech cache.

    摘要翻译: 一种用于改善计算机语音系统中的语音数据的回忆的方法和系统可以包括多个语音高速缓存管理步骤,包括提供语音高速缓存,接收语音系统输入和识别接收到的语音系统输入中的语音事件,语音 事件包括语音数据。 随后,语音数据可以与预定的语音高速缓存入口标准进行比较; 以及如果所述语音数据满足所述预定条目标准之一,则至少一个条目可以被添加到所述语音高速缓存,所述至少一个条目对应于所述语音数据。 另外,语音数据可以与预定语音高速缓存退出标准进行比较; 并且如果语音数据满足预定的退出准则之一,则可以从语音高速缓存中清除至少一个条目,该对应于该语音数据的至少一个条目。 入门标准可以包括经常使用的语音数据,最近使用的语音数据和重要的语音数据。 类似地,退出标准可以包括与语音高速缓存中的每个条目相关联的最少频繁使用的语音数据,与语音高速缓存中的每个条目相关联的最近最少使用的语音数据以及与语音高速缓存中的每个条目相关联的最小重要速度数据。

    Method and apparatus for improving speech recognition accuracy
    3.
    发明授权
    Method and apparatus for improving speech recognition accuracy 失效
    提高语音识别精度的方法和装置

    公开(公告)号:US06675142B2

    公开(公告)日:2004-01-06

    申请号:US09960826

    申请日:2001-09-21

    IPC分类号: G10L1500

    CPC分类号: G10L15/22 G10L2015/0638

    摘要: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.

    摘要翻译: 转录系统(100)包括计算机(102),监视器(104)和麦克风(110)。 通过麦克风,系统的用户提供由系统接收和转录(204)的输入语音。 系统在转录过程中监视(205)转录语言的准确性。 系统还确定(210)转录语音的准确性是否足够,如果不是,则自动激活(214)语音识别改进工具并且向用户警告(212)该工具已被激活。

    Transcription system for multiple speakers, using and establishing identification
    4.
    发明授权
    Transcription system for multiple speakers, using and establishing identification 有权
    多个扬声器的转录系统,使用和建立识别

    公开(公告)号:US06332122B1

    公开(公告)日:2001-12-18

    申请号:US09337392

    申请日:1999-06-23

    IPC分类号: G10L1100

    CPC分类号: G10L17/00 G10L15/26

    摘要: A method and apparatus for transcribing text from multiple speakers in a computer system having a speech recognition application. The system receives speech from one of a plurality of speakers through a single channel, assigns a speaker ID to the speaker, transcribes the speech into text, and associates the speaker ID with the speech and text. In order to detect a speaker change, the system monitors the speech input through the channel for a speaker change.

    摘要翻译: 一种用于在具有语音识别应用的计算机系统中从多个扬声器转录文本的方法和装置。 系统通过单个频道从多个扬声器中的一个接收语音,向说话者分配扬声器ID,将语音转录成文本,并将扬声器ID与语音和文本相关联。 为了检测扬声器变化,系统通过通道来监视通过通道输入的语音以进行扬声器改变。

    System and method for concurrent presentation of multiple audio information sources
    5.
    发明授权
    System and method for concurrent presentation of multiple audio information sources 失效
    同时呈现多个音频信息源的系统和方法

    公开(公告)号:US06757656B1

    公开(公告)日:2004-06-29

    申请号:US09594397

    申请日:2000-06-15

    IPC分类号: G10L2106

    CPC分类号: G10L15/22

    摘要: A method for concurrent presentation of multiple audio information sources. In the method, audio information from at least two audio information sources is concurrently presented, and a user speech selection of one of the audio information sources is accepted. At least one of the audio information sources can then be reconfigured. The reconfiguration audibly distinguishes the user selected audio information source from other audio information sources.

    摘要翻译: 一种用于并发呈现多个音频信息源的方法。 在该方法中,同时呈现来自至少两个音频信息源的音频信息,并且接受音频信息源之一的用户语音选择。 然后可以重新配置至少一个音频信息源。 重新配置可听见地将用户选择的音频信息源与其他音频信息源区分开。

    Method and apparatus for correcting misinterpreted voice commands in a speech recognition system
    7.
    发明授权
    Method and apparatus for correcting misinterpreted voice commands in a speech recognition system 有权
    用于在语音识别系统中校正误解的语音命令的方法和装置

    公开(公告)号:US06327566B1

    公开(公告)日:2001-12-04

    申请号:US09333698

    申请日:1999-06-16

    IPC分类号: G10L1504

    摘要: An efficient method and system, particularly well-suited for correcting natural language understanding (NLU) commands, corrects spoken commands misinterpreted by a speech recognition system. The method involves a series of steps, including: receiving the spoken command from a user; parsing the command to identify a paraphrased command; displaying the paraphrased command; and accepting corrections of the paraphrased command from the user. The paraphrased command is segmented according to command language categories, which include a command action category, an action object category, and an action and/or object modifying category. The paraphrased command is displayed in a user interface window segmented into these command language categories. The user interface window also contains alternative commands for each segment of the paraphrased command.

    摘要翻译: 一种特别适合用于校正自然语言理解(NLU)命令的有效方法和系统来校正由语音识别系统误解的语音命令。 该方法涉及一系列步骤,包括:从用户接收口令命令; 解析命令来识别一个释义的命令; 显示释义的命令; 并接受来自用户的释义命令的更正。 释义的命令根据命令语言类别进行分段,其中包括命令操作类别,操作对象类别以及操作和/或对象修改类别。 释义的命令显示在分为这些命令语言类别的用户界面窗口中。 用户界面窗口还包含替代命令的每个段的替代命令。

    Method, system, and apparatus for limiting available selections in a speech recognition system
    8.
    发明授权
    Method, system, and apparatus for limiting available selections in a speech recognition system 有权
    用于限制语音识别系统中的可用选择的方法,系统和装置

    公开(公告)号:US07010490B2

    公开(公告)日:2006-03-07

    申请号:US09770577

    申请日:2001-01-26

    IPC分类号: G10L15/00

    CPC分类号: G10L15/26 G10L2015/228

    摘要: A method and system for completing user input in a speech recognition system. The method can include a series of steps which can include receiving a user input. The user input can specify an attribute of a selection. The method can include comparing the user input with a set of selections in the speech recognition system. Also, the method can include limiting the set of selections to an available set of selections which can correspond to the received user input. The step of matching a received user spoken utterance with the selection in the available set of selections also can be included.

    摘要翻译: 一种在语音识别系统中完成用户输入的方法和系统。 该方法可以包括可以包括接收用户输入的一系列步骤。 用户输入可以指定选择的属性。 该方法可以包括将用户输入与语音识别系统中的一组选择进行比较。 此外,该方法可以包括将选择集合限制为可以对应于所接收的用户输入的可用选择集合。 可以包括将接收到的用户口令话语与可用选择集合中的选择进行匹配的步骤。

    Method and apparatus for improving speech recognition accuracy
    9.
    发明授权
    Method and apparatus for improving speech recognition accuracy 有权
    提高语音识别精度的方法和装置

    公开(公告)号:US06370503B1

    公开(公告)日:2002-04-09

    申请号:US09345071

    申请日:1999-06-30

    IPC分类号: G10L1526

    CPC分类号: G10L15/22 G10L2015/0638

    摘要: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem. The system also provides the user the ability to test (222) the transcription process in order to determine whether the solution has improved the recognition accuracy.

    摘要翻译: 转录系统(100)包括计算机(102),监视器(104)和麦克风(110)。 通过麦克风,系统的用户提供由系统接收和转录(204)的输入语音。 系统在转录过程中监视(205)转录语言的准确性。 系统还确定(210)转录语音的准确性是否足够,如果不是,则自动激活(214)语音识别改进工具并且向用户警告(212)该工具已被激活。 该工具也可以由用户手动激活(206)。 识别问题的类型由用户或系统自动识别(216),并且系统提供(218)可能的解决方案步骤,以使用户能够调整(219)系统参数或修改用户行为以减轻识别 问题。 该系统还为用户提供测试(222)转录过程的能力,以确定解决方案是否提高了识别精度。