摘要:
Input context for a statistical dialog manager may be provided. Upon receiving a spoken query from a user, the query may be categorized according to at least one context clue. The spoken query may then be converted to text according to a statistical dialog manager associated with the category of the query and a response to the spoken query may be provided to the user.
摘要:
A method and system for manipulating characters displayed on a display screen is provided wherein the method includes identifying a selected word, wherein the selected word includes at least one character to be modified. The method further includes correlating each of the at least one character with a unique numerical value and receiving a selection command and a modification command, wherein the selection command is the unique numerical value corresponding to a selected character. Furthermore, the method includes modifying the selected character responsive to the modification command to generate a modified word.
摘要:
A system and method for creating a mnemonics Language Model for use with a speech recognition software application, wherein the method includes generating an n-gram Language Model containing a predefined large body of characters, wherein the n-gram Language Model includes at least one character from the predefined large body of characters, constructing a new language Model (LM) token for each of the at least one character, extracting pronunciations for each of the at least one character responsive to a predefined pronunciation dictionary to obtain a character pronunciation representation, creating at least one alternative pronunciation for each of the at least one character responsive to the character pronunciation representation to create an alternative pronunciation dictionary and compiling the n-gram Language Model for use with the speech recognition software application, wherein compiling the Language Model is responsive to the new Language Model token and the alternative pronunciation dictionary.
摘要:
A system and method for use in computing systems that employ speech recognition capabilities is provided. Where recognized speech can be dictation and commands, one or more buttons may be used to change modes of said computing systems to accept spoken words as dictation, or to accept spoken words as commands, as well as activate a microphone used for the speech recognition. The change in mode may occur responsive to the manner in which a button is pressed, where the manner may include such depressions as taps, press and holds, thumbwheel slides, and other forms of button manipulation.
摘要:
The present invention provides an application-independent and engine-independent middleware layer between applications and engines. The middleware provides speech-related services to both applications and engines, thereby making it far easier for application vendors and engine vendors to bring their technology to consumers.
摘要:
Upon selection of a displayed word, a list of alternatives for the selected word is displayed. Each alternative in the list has an associated symbol. A speech signal is then decoded to identify a list of possible words and the list of possible words is displayed with each possible word having an associated symbol.
摘要:
The present invention relates to interfacing with electronic ink. Ink is stored in a data structure that permits later retrieval by applications. The ink includes stroke information and may include property information. Through various programming interfaces, one may interact with the ink through methods and setting or retrieving properties. Other objects and collections may be used as well in conjunction with the ink objects.
摘要:
The present invention provides a method and computer-readable medium for searching for programming objects on a computer system. Under one aspect of the invention, optional search attributes are used to order a list of references to found programming objects. Under a second aspect of the invention, object attributes that are stored outside of a static attribute storage area are inspected during the search for programming objects. Under a third aspect of the invention, different sets of object data are allowed to reference the same programming object class, and different objects of a single programming object class may be initialized in different ways so that they exhibit different attributes.
摘要:
The present invention provides an object model that exposes speech-related functionality to applications that target a managed code environment. In one embodiment, the object model and associated interfaces are implemented consistently with other non-speech related interfaces supported across a platform.
摘要:
The present invention provides a method and computer-readable medium for searching for programming objects on a computer system. Under the invention, object attributes that are stored outside of a static attribute storage area are inspected during the search for programming objects.