摘要:
A numbering scheme is disclosed for implementation in the context of an application display. A user is able to select an item on the display by speaking a number corresponding to a desired control item. In some cases, the screen can include so many numbers that the user loses context and is unable to identify which number they want to select. For this reason, in one embodiment, a temporal switching mechanism is implemented wherein periodic switches (e.g., second-long intervals) occur between showing numbered items and showing a non-numbered screen. In one embodiment, an optional secondary confirmation step is implemented wherein the user sees only the item they just selected and has the chance to (a) learn the programmatic name of the item they selected and/or (b) either confirm and proceed with their selection, or cancel. In one embodiment, the optional secondary confirmation step is omitted if the user speaks a number followed by a predetermined command word.
摘要:
A system and method for positioning a software User Interface (UI) window on a display screen is provided, wherein the method includes displaying the software UI window on the display screen and identifying at least one suitable location on the display screen responsive to an active target window area of a target application UI window. The method further includes determining whether the software UI window is disposed at the at least one suitable location on the display screen and if the software UI window is disposed in a location other than the at least one suitable location on the display screen, positioning the software UI window at the at least one suitable location on the display screen.
摘要:
A portable wireless-enabled system includes an input transducer (for example, a microphone), an output transducer (for example, a speaker) and a wireless transceiver system. Also included, is a memory having a programmable user speech profile. A processor system controls operation of the input transducer, the output transducer, the wireless transceiver system and the memory.
摘要:
A method of interacting with a speech recognition (SR)-enabled personal computer (PC) is provided in which a user SR profile is transferred from a wireless-enabled device to the SR-enabled PC. Interaction with SR applications, on the SR-enabled PC, is carried out by transmitting speech signals wirelessly to the SR-enabled PC. The transmitted speech signals are recognized with the help of the transferred user SR profile.
摘要:
Upon selection of a displayed word, a list of alternatives for the selected word is displayed. Each alternative in the list has an associated symbol. A speech signal is then decoded to identify a list of possible words and the list of possible words is displayed with each possible word having an associated symbol.
摘要:
Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.
摘要:
Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.
摘要:
A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.
摘要:
A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.
摘要:
The present invention combines speech recognition tutorial training with speech recognizer voice training. The system prompts the user for speech data and simulates, with predefined screenshots, what happens when speech commands are received. At each step in the tutorial process, when the user is prompted for an input, the system is configured such that only a predefined set (which may be one) of user inputs will be recognized by the speech recognizer. When a successful recognition is being made, the speech data is used to train the speech recognition system.