摘要:
A method for concurrent presentation of multiple audio information sources. In the method, audio information from at least two audio information sources is concurrently presented, and a user speech selection of one of the audio information sources is accepted. At least one of the audio information sources can then be reconfigured. The reconfiguration audibly distinguishes the user selected audio information source from other audio information sources.
摘要:
A method and apparatus for transcribing text from multiple speakers in a computer system having a speech recognition application. The system receives speech from one of a plurality of speakers through a single channel, assigns a speaker ID to the speaker, transcribes the speech into text, and associates the speaker ID with the speech and text. In order to detect a speaker change, the system monitors the speech input through the channel for a speaker change.
摘要:
A transcription system (100) includes a computer (102), a monitor (104), and a microphone (110). Via the microphone, a user of the system provides input speech that is received and transcribed (204) by the system. The system monitors (205) the accuracy of the transcribed speech during transcription. The system also determines (210) whether the accuracy of the transcribed speech is sufficient and, if not, automatically activates (214) a speech recognition improvement tool and alerts (212) the user that the tool has been activated. This tool could also be manually activated (206) by the user. The type of recognition problem is identified (216) by the user or automatically by the system, and the system provides (218) possible solution steps for enabling the user to adjust (219) system parameters or modify user behavior in order to alleviate the recognition problem. The system also provides the user the ability to test (222) the transcription process in order to determine whether the solution has improved the recognition accuracy.
摘要:
An efficient method and system, particularly well-suited for correcting natural language understanding (NLU) commands, corrects spoken commands misinterpreted by a speech recognition system. The method involves a series of steps, including: receiving the spoken command from a user; parsing the command to identify a paraphrased command; displaying the paraphrased command; and accepting corrections of the paraphrased command from the user. The paraphrased command is segmented according to command language categories, which include a command action category, an action object category, and an action and/or object modifying category. The paraphrased command is displayed in a user interface window segmented into these command language categories. The user interface window also contains alternative commands for each segment of the paraphrased command.
摘要:
A method and system efficiently identifies voice commands for a user of a speech recognition system. The method involves a series of steps including: receiving input from a user; monitoring the computer system to log system events and ascertain a current system state; predicting a probable next event according to the current system state and logged events; and identifying acceptable voice commands to perform the next event. The system events include commands, system control activities, timed activities, and application activation. These events are statistically analyzed in light of the current system state to determine the probable next event. The voice commands for performing the probable next event are displayed to the user.
摘要:
In a computer system having a list based natural discourse application adapted for speech recognition. In response to a first user element request, the system searches a list of elements to generate a list of matches which contain elements which satisfy the element request. The system calculates the time required to read out the match list common levels, the time required to read out all matches, and the time required to iteratively query the user as to which matches of one of said common levels to read out. The system then reads out the match list using the method having the lowest calculated time.
摘要:
A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
摘要:
A computer system has a notification manager for playing a message to a user by selecting one of a plurality of audio notifications. The method includes the step of setting a priority level for each notification arriving into a queue. The notification is inserted into a position in the queue based upon the priority level of the notification, such that the audio notifications at the queue top have a generally higher priority than audio notifications at the queue bottom. The notification at the top of the queue can be selected if the priority level of the notification is greater than a predetermined gate level. Once a notification is selected, a message corresponding to the selected notification is played to the user.
摘要:
A method of proofreading and correcting dictated text contained in an electronic document comprises the steps of: selecting proofreading criteria for identifying textual errors contained in the electronic document; playing back each word contained in the electronic document; and, marking as a textual error each played back word in nonconformity with at least one of the proofreading criteria. The method can further comprise the step of editing each the marked textual error identified in the marking step. In particular, the editing step can include reviewing each the marked textual error identified in the marking step; accepting user specified changes to each marked textual error reviewed in the reviewing step; and, unmarking each marked textual error corrected by the user in the accepting step. Also, the reviewing step can include highlighting each the word in the electronic document corresponding to the marked textual error marked in the marking step; and, displaying an explanation for each marked textual error in a user interface. Moreover, the reviewing step can further include suggesting a recommended change to the marked textual error; displaying the recommended change in the user interface; and, accepting a user specified preference to substitute the recommended change for the marked textual error.
摘要:
The method of identifying excess noise in a computer system includes first recording a silence sample; second recording an isolated noise sample while operating a computer system component in isolation from other computer system components; comparing signal characteristics of the silence sample with signal characteristics of the isolated noise sample; and, attributing the isolated noise sample to the isolated computer component when the signal characteristics of the silence sample differ by a preset threshold from the signal characteristics of the isolated noise sample. The inventive method can further include logging the signal characteristics of the silence sample and the isolated noise sample; reporting excess noise identified in the identifying step; and, suggesting a remedy for the identified excess noise.