Method and system for proofreading and correcting dictated text
    2.
    发明授权
    Method and system for proofreading and correcting dictated text 有权
    用于校对和纠正指定文本的方法和系统

    公开(公告)号:US06611802B2

    公开(公告)日:2003-08-26

    申请号:US09330668

    申请日:1999-06-11

    IPC分类号: G10L1300

    摘要: A method of proofreading and correcting dictated text contained in an electronic document comprises the steps of: selecting proofreading criteria for identifying textual errors contained in the electronic document; playing back each word contained in the electronic document; and, marking as a textual error each played back word in nonconformity with at least one of the proofreading criteria. The method can further comprise the step of editing each the marked textual error identified in the marking step. In particular, the editing step can include reviewing each the marked textual error identified in the marking step; accepting user specified changes to each marked textual error reviewed in the reviewing step; and, unmarking each marked textual error corrected by the user in the accepting step. Also, the reviewing step can include highlighting each the word in the electronic document corresponding to the marked textual error marked in the marking step; and, displaying an explanation for each marked textual error in a user interface. Moreover, the reviewing step can further include suggesting a recommended change to the marked textual error; displaying the recommended change in the user interface; and, accepting a user specified preference to substitute the recommended change for the marked textual error.

    摘要翻译: 一种校正和纠正电子文档中包含的指定文本的方法包括以下步骤:选择用于识别电子文档中包含的文本错误的校对标准; 播放电子文档中包含的每个单词; 并且标记为文本错误,每个都使用至少一个校对标准在不合格中播放单词。 该方法还可以包括编辑在标记步骤中识别的每个标记的文本错误的步骤。 特别地,编辑步骤可以包括查看在标记步骤中识别的标记的文本错误; 在审查步骤中审查的每个标记的文字错误接受用户指定的更改; 并且在接受步骤中取消标记由用户校正的每个标记的文本错误。 此外,审查步骤可以包括突出显示与标记步骤中标记的标记的文本错误相对应的电子文档中的单词; 并且在用户界面中显示每个标记的文本错误的说明。 此外,审查步骤还可以包括建议对标记的文字错误的建议更改; 显示用户界面中推荐的更改; 并且接受用户指定的首选项以替代所标记的文本错误的推荐更改。

    Method and apparatus for improving speech recognition accuracy
    3.
    发明授权
    Method and apparatus for improving speech recognition accuracy 失效
    提高语音识别精度的方法和装置

    公开(公告)号:US06675142B2

    公开(公告)日:2004-01-06

    申请号:US09960826

    申请日:2001-09-21

    IPC分类号: G10L1500

    CPC分类号: G10L15/22 G10L2015/0638

    摘要: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.

    摘要翻译: 转录系统(100)包括计算机(102),监视器(104)和麦克风(110)。 通过麦克风,系统的用户提供由系统接收和转录(204)的输入语音。 系统在转录过程中监视(205)转录语言的准确性。 系统还确定(210)转录语音的准确性是否足够,如果不是,则自动激活(214)语音识别改进工具并且向用户警告(212)该工具已被激活。

    Method and apparatus for improving speech recognition accuracy
    4.
    发明授权
    Method and apparatus for improving speech recognition accuracy 有权
    提高语音识别精度的方法和装置

    公开(公告)号:US06370503B1

    公开(公告)日:2002-04-09

    申请号:US09345071

    申请日:1999-06-30

    IPC分类号: G10L1526

    CPC分类号: G10L15/22 G10L2015/0638

    摘要: A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem. The system also provides the user the ability to test (222) the transcription process in order to determine whether the solution has improved the recognition accuracy.

    摘要翻译: 转录系统(100)包括计算机(102),监视器(104)和麦克风(110)。 通过麦克风,系统的用户提供由系统接收和转录(204)的输入语音。 系统在转录过程中监视(205)转录语言的准确性。 系统还确定(210)转录语音的准确性是否足够,如果不是,则自动激活(214)语音识别改进工具并且向用户警告(212)该工具已被激活。 该工具也可以由用户手动激活(206)。 识别问题的类型由用户或系统自动识别(216),并且系统提供(218)可能的解决方案步骤,以使用户能够调整(219)系统参数或修改用户行为以减轻识别 问题。 该系统还为用户提供测试(222)转录过程的能力,以确定解决方案是否提高了识别精度。

    Managing voice commands in speech applications
    5.
    发明授权
    Managing voice commands in speech applications 失效
    在语音应用程序中管理语音命令

    公开(公告)号:US06182046B2

    公开(公告)日:2001-01-30

    申请号:US09048714

    申请日:1998-03-26

    IPC分类号: G10L1522

    CPC分类号: G10L15/22 G10L2015/228

    摘要: A method for managing a What Can I Say (WCIS) function in an application having a plurality of commands which can be voice activated comprises the steps of: storing a set of substantially all voice activatable commands associated with the application; identifying those of the commands in the set which are displayable by the application; and, in response to a user input, displaying in a graphical user interface (GUI) a subset of the voice activatable commands which are not displayable by the application. Moreover, the method includes displaying in the GUI, in response to a user input, a list of the stored set of substantially all voice activatable commands associated with the application; displaying in the GUI a pull down menu identifying different categories by which the commands can be viewed in the list; and, displaying the GUI with a pull down menu identifying commands that can be performed against a voice command.

    摘要翻译: 一种用于管理具有可以被语音激活的多个命令的应用中的“可以说什么(WCIS)”功能的方法包括以下步骤:存储与应用相关联的基本上所有可语音激活的命令的集合; 识别集合中可由应用程序显示的命令的那些; 并且响应于用户输入,在图形用户界面(GUI)中显示无法由应用程序显示的语音可激活命令的子集。 此外,该方法包括在GUI中显示响应于用户输入的存储的与应用相关联的基本上所有语音激活命令的集合的列表; 在GUI中显示识别不同类别的下拉菜单,通过该菜单可以在列表中查看命令; 并且用下拉菜单显示GUI,以识别针对语音命令可执行的命令。

    Method and apparatus for correcting misinterpreted voice commands in a speech recognition system
    6.
    发明授权
    Method and apparatus for correcting misinterpreted voice commands in a speech recognition system 有权
    用于在语音识别系统中校正误解的语音命令的方法和装置

    公开(公告)号:US06327566B1

    公开(公告)日:2001-12-04

    申请号:US09333698

    申请日:1999-06-16

    IPC分类号: G10L1504

    摘要: An efficient method and system, particularly well-suited for correcting natural language understanding (NLU) commands, corrects spoken commands misinterpreted by a speech recognition system. The method involves a series of steps, including: receiving the spoken command from a user; parsing the command to identify a paraphrased command; displaying the paraphrased command; and accepting corrections of the paraphrased command from the user. The paraphrased command is segmented according to command language categories, which include a command action category, an action object category, and an action and/or object modifying category. The paraphrased command is displayed in a user interface window segmented into these command language categories. The user interface window also contains alternative commands for each segment of the paraphrased command.

    摘要翻译: 一种特别适合用于校正自然语言理解(NLU)命令的有效方法和系统来校正由语音识别系统误解的语音命令。 该方法涉及一系列步骤,包括:从用户接收口令命令; 解析命令来识别一个释义的命令; 显示释义的命令; 并接受来自用户的释义命令的更正。 释义的命令根据命令语言类别进行分段,其中包括命令操作类别,操作对象类别以及操作和/或对象修改类别。 释义的命令显示在分为这些命令语言类别的用户界面窗口中。 用户界面窗口还包含替代命令的每个段的替代命令。

    Transcription system for multiple speakers, using and establishing identification
    7.
    发明授权
    Transcription system for multiple speakers, using and establishing identification 有权
    多个扬声器的转录系统,使用和建立识别

    公开(公告)号:US06332122B1

    公开(公告)日:2001-12-18

    申请号:US09337392

    申请日:1999-06-23

    IPC分类号: G10L1100

    CPC分类号: G10L17/00 G10L15/26

    摘要: A method and apparatus for transcribing text from multiple speakers in a computer system having a speech recognition application. The system receives speech from one of a plurality of speakers through a single channel, assigns a speaker ID to the speaker, transcribes the speech into text, and associates the speaker ID with the speech and text. In order to detect a speaker change, the system monitors the speech input through the channel for a speaker change.

    摘要翻译: 一种用于在具有语音识别应用的计算机系统中从多个扬声器转录文本的方法和装置。 系统通过单个频道从多个扬声器中的一个接收语音,向说话者分配扬声器ID,将语音转录成文本,并将扬声器ID与语音和文本相关联。 为了检测扬声器变化,系统通过通道来监视通过通道输入的语音以进行扬声器改变。

    Method and apparatus for providing an event-based “What-Can-I-Say?” window
    8.
    发明授权
    Method and apparatus for providing an event-based “What-Can-I-Say?” window 有权
    提供基于事件的“我可以说什么”的方法和装置? 窗口

    公开(公告)号:US06308157B1

    公开(公告)日:2001-10-23

    申请号:US09328095

    申请日:1999-06-08

    IPC分类号: G10L1514

    CPC分类号: G10L15/26 G10L2015/228

    摘要: A method and system efficiently identifies voice commands for a user of a speech recognition system. The method involves a series of steps including: receiving input from a user; monitoring the computer system to log system events and ascertain a current system state; predicting a probable next event according to the current system state and logged events; and identifying acceptable voice commands to perform the next event. The system events include commands, system control activities, timed activities, and application activation. These events are statistically analyzed in light of the current system state to determine the probable next event. The voice commands for performing the probable next event are displayed to the user.

    摘要翻译: 方法和系统有效地识别用于语音识别系统的用户的语音命令。 该方法涉及一系列步骤,包括:从用户接收输入; 监视计算机系统以记录系统事件并确定当前的系统状态; 根据当前的系统状态和记录的事件预测可能的下一个事件; 并且识别可接受的语音命令以执行下一个事件。 系统事件包括命令,系统控制活动,定时活动和应用程序激活。 根据当前系统状态对这些事件进行统计分析,以确定可能的下一个事件。 用于执行可能的下一个事件的语音命令被显示给用户。

    Method for correcting frequently misrecognized words or command in
speech application
    9.
    发明授权
    Method for correcting frequently misrecognized words or command in speech application 失效
    在语音应用中纠正频繁误认的单词或命令的方法

    公开(公告)号:US5970451A

    公开(公告)日:1999-10-19

    申请号:US60122

    申请日:1998-04-14

    摘要: A method for correcting frequently misrecognized words and commands in a speech application. According to the method, when a need for correcting a frequently misrecognized word/command spoken by a user is detected, a recording is made of the misrecognized word/command in isolation. Subsequently, an in-isolation base form for the misrecognized word/command is established from the in-isolation recording. The in-isolation base form is then saved and the misrecognized word/command in recorded in context. Next, an in-context base form is established for the misrecognized word/command from the context recording and a comparison is made between the in-isolation and in-context base forms. The in-context base form is saved only if the in-isolation and in-context base forms are markedly different from one another. A sentence is displayed using the frequently misrecognized word/command in context and the user is prompted to speak the sentence. The sentence is then recognized using the speech application and the user is prompted to confirm whether or not the frequently misrecognized word/command was properly recognized. The method is terminated if the frequently misrecognized word/command was properly recognized.

    摘要翻译: 一种在语音应用程序中纠正频繁错误识别的单词和命令的方法。 根据该方法,当检测到需要校正用户所说出的经常被错误识别的字/命令时,隔离地记录错误识别的字/命令。 随后,从隔离记录建立了用于误识别的字/命令的隔离基础形式。 然后保存隔离基本形式,并记录在上下文中的错误识别的字/命令。 接下来,针对上下文记录中的错误识别的字/命令建立了上下文基础形式,并且在隔离和上下文基础形式之间进行比较。 只有在隔离和上下文基础形式彼此明显不同的情况下,内存基础形式才被保存。 在上下文中使用经常被误认的字/命令显示一个句子,并且提示用户说出这个句子。 然后使用语音应用来识别该句子,并且提示用户确认是否正确识别了经常被错误识别的字/命令。 如果正确识别出经常被错误识别的字/命令,则该方法将被终止。

    Speech recognition correction for devices having limited or no display
    10.
    发明授权
    Speech recognition correction for devices having limited or no display 有权
    具有有限或不显示的设备的语音识别校正

    公开(公告)号:US07200555B1

    公开(公告)日:2007-04-03

    申请号:US09610061

    申请日:2000-07-05

    IPC分类号: G10L15/26

    CPC分类号: G10L15/22

    摘要: A novel apparatus and method for correcting speech recognized text in a predominantly speech-only environment for use with a device having only a limited or no display device available. The method is preferably implemented by a machine readable storage mechanism having stored thereon a computer program, the method comprising the following steps. First, audio speech input can be received and speech-to-text converted to speech recognized text. Second, a first speech correction command for performing a correction operation on speech recognized text stored in a text buffer can be detected in the speech recognized text. Third, if a speech correction command is not detected in the speech recognized text, the speech recognized text can be added to the text buffer. Fourth, if a speech command is detected in the speech recognized text, the detected correction speech command can be performed on speech recognized text stored in the text buffer.

    摘要翻译: 一种新颖的装置和方法,用于在仅主要是仅限语音环境中校正语音识别的文本,以便与仅具有有限或不显示设备的设备一起使用。 该方法优选地由其上存储有计算机程序的机器可读存储机构来实现,该方法包括以下步骤。 首先,可以接收音频语音输入并将语音到文本转换成语音识别的文本。 第二,可以在语音识别文本中检测用于对存储在文本缓冲器中的语音识别文本执行校正操作的第一语音校正命令。 第三,如果在语音识别文本中没有检测到语音校正命令,则可以将语音识别的文本添加到文本缓冲器。 第四,如果在语音识别文本中检测到语音命令,则可以对存储在文本缓冲器中的语音识别文本执行检测到的校正语音命令。