摘要:
A method of proofreading and correcting dictated text contained in an electronic document can include selecting proofreading criteria for identifying textual errors contained in the electronic document; playing back each word contained in the electronic document; and, marking as a textual error each played back word in nonconformity with at least one of the proofreading criteria.
摘要:
A method of proofreading and correcting dictated text contained in an electronic document comprises the steps of: selecting proofreading criteria for identifying textual errors contained in the electronic document; playing back each word contained in the electronic document; and, marking as a textual error each played back word in nonconformity with at least one of the proofreading criteria. The method can further comprise the step of editing each the marked textual error identified in the marking step. In particular, the editing step can include reviewing each the marked textual error identified in the marking step; accepting user specified changes to each marked textual error reviewed in the reviewing step; and, unmarking each marked textual error corrected by the user in the accepting step. Also, the reviewing step can include highlighting each the word in the electronic document corresponding to the marked textual error marked in the marking step; and, displaying an explanation for each marked textual error in a user interface. Moreover, the reviewing step can further include suggesting a recommended change to the marked textual error; displaying the recommended change in the user interface; and, accepting a user specified preference to substitute the recommended change for the marked textual error.
摘要:
A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated.
摘要:
A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem. The system also provides the user the ability to test (222) the transcription process in order to determine whether the solution has improved the recognition accuracy.
摘要:
A method for managing a What Can I Say (WCIS) function in an application having a plurality of commands which can be voice activated comprises the steps of: storing a set of substantially all voice activatable commands associated with the application; identifying those of the commands in the set which are displayable by the application; and, in response to a user input, displaying in a graphical user interface (GUI) a subset of the voice activatable commands which are not displayable by the application. Moreover, the method includes displaying in the GUI, in response to a user input, a list of the stored set of substantially all voice activatable commands associated with the application; displaying in the GUI a pull down menu identifying different categories by which the commands can be viewed in the list; and, displaying the GUI with a pull down menu identifying commands that can be performed against a voice command.
摘要:
An efficient method and system, particularly well-suited for correcting natural language understanding (NLU) commands, corrects spoken commands misinterpreted by a speech recognition system. The method involves a series of steps, including: receiving the spoken command from a user; parsing the command to identify a paraphrased command; displaying the paraphrased command; and accepting corrections of the paraphrased command from the user. The paraphrased command is segmented according to command language categories, which include a command action category, an action object category, and an action and/or object modifying category. The paraphrased command is displayed in a user interface window segmented into these command language categories. The user interface window also contains alternative commands for each segment of the paraphrased command.
摘要:
A method and apparatus for transcribing text from multiple speakers in a computer system having a speech recognition application. The system receives speech from one of a plurality of speakers through a single channel, assigns a speaker ID to the speaker, transcribes the speech into text, and associates the speaker ID with the speech and text. In order to detect a speaker change, the system monitors the speech input through the channel for a speaker change.
摘要:
A method and system efficiently identifies voice commands for a user of a speech recognition system. The method involves a series of steps including: receiving input from a user; monitoring the computer system to log system events and ascertain a current system state; predicting a probable next event according to the current system state and logged events; and identifying acceptable voice commands to perform the next event. The system events include commands, system control activities, timed activities, and application activation. These events are statistically analyzed in light of the current system state to determine the probable next event. The voice commands for performing the probable next event are displayed to the user.
摘要:
A method for correcting frequently misrecognized words and commands in a speech application. According to the method, when a need for correcting a frequently misrecognized word/command spoken by a user is detected, a recording is made of the misrecognized word/command in isolation. Subsequently, an in-isolation base form for the misrecognized word/command is established from the in-isolation recording. The in-isolation base form is then saved and the misrecognized word/command in recorded in context. Next, an in-context base form is established for the misrecognized word/command from the context recording and a comparison is made between the in-isolation and in-context base forms. The in-context base form is saved only if the in-isolation and in-context base forms are markedly different from one another. A sentence is displayed using the frequently misrecognized word/command in context and the user is prompted to speak the sentence. The sentence is then recognized using the speech application and the user is prompted to confirm whether or not the frequently misrecognized word/command was properly recognized. The method is terminated if the frequently misrecognized word/command was properly recognized.
摘要:
A novel apparatus and method for correcting speech recognized text in a predominantly speech-only environment for use with a device having only a limited or no display device available. The method is preferably implemented by a machine readable storage mechanism having stored thereon a computer program, the method comprising the following steps. First, audio speech input can be received and speech-to-text converted to speech recognized text. Second, a first speech correction command for performing a correction operation on speech recognized text stored in a text buffer can be detected in the speech recognized text. Third, if a speech correction command is not detected in the speech recognized text, the speech recognized text can be added to the text buffer. Fourth, if a speech command is detected in the speech recognized text, the detected correction speech command can be performed on speech recognized text stored in the text buffer.