Abstract:
A system may use multiple speech interface devices to interact with a user by speech. All or a portion of the speech interface devices may detect a user utterance and may initiate speech processing to determine a meaning or intent of the utterance. Within the speech processing, arbitration is employed to select one of the multiple speech interface devices to respond to the user utterance. Arbitration may be based in part on metadata that directly or indirectly indicates the proximity of the user to the devices, and the device that is deemed to be nearest the user may be selected to respond to the user utterance.
Abstract:
Methods and systems for managing multiple tasks using a dialog are presented. In some embodiments, a processor may parse a first natural language user input received at a user device to extract task related information from the first natural language user input. In response to identifying that the first natural language user input comprises a request to perform a first task, the processor may initiate execution of the first task. The user device may receive a second natural language user input after execution of the first task has been initiated which requests execution of a second task. The processor may initiate execution of the second task before execution of the first task is complete.
Abstract:
Techniques for generating summaries and action items associated with speech are described. Disclosed are techniques for presenting a first audio signal including a portion of a first audio stream at a loudspeaker, identifying data representing an audiolink associated with the first audio stream, and determining data representing a cue and data representing a second audio stream associated with the audiolink. A second audio signal including the cue may be presented, and a third audio signal including a portion of the second audio stream may be presented at the loudspeaker.
Abstract:
A graphical user interface allows a speech recognition system user to browse available grammars and their topics. A dialog box interface displays the currently active grammar, grammar searching mode, and a current input. A list of valid word phrases of at least one word also is generated and displayed. Using the interface, a user additionally may select an active grammar and a method of searching and displaying valid examples from the grammar based on the current input.
Abstract:
A voice recognition system includes a microphone for receiving speech from a user and processing electronics. The processing electronics are in communication with the microphone and are configured to use a plurality of rules to evaluate user interactions with the voice recognition system. The processing electronics automatically determine and set an expertise level in response to and based on the evaluation. The processing electronics are configured to automatically adjust at least one setting of the voice recognition system in response to the set expertise level.