摘要:
A method of correcting speech recognition mode errors in a document is disclosed. A computer-readable medium having computer-executable instructions for correcting speech recognition mode errors in a document is also disclosed. Further, an apparatus for correcting speech recognition mode errors in a document is disclosed.
摘要:
A computer-implemented method for providing a candidate list of alternatives for a text selection containing text from multiple input sources, each of which can be stochastic (such as a speech recognition unit, handwriting recognition unit, or input method editor) or non-stochastic (such as a keyboard and mouse). A text component of the text selection may be the result of data processed through a series of stochastic input sources, such as speech input that is converted to text by a speech recognition unit before being used as input into an input method editor. To determine alternatives for the text selection, a stochastic input combiner parses the text selection into text components from different input sources. For each stochastic text component, the combiner retrieves a stochastic model containing alternatives for the text component. If the stochastic text component is the result of a series of stochastic input sources, the combiner derives a stochastic model that accurately reflects the probabilities of the results of the entire series. The combiner creates a list of alternatives for the text selection by combining the stochastic models retrieved. The combiner may revise the list of alternatives by applying natural language principles to the text selection as a whole. The list of alternatives for the text selection is then presented to the user. If the user chooses one of the alternatives, then the word processor replaces the text selection with the chosen candidate.
摘要:
A computer-implemented method for providing a candidate list of alternatives for a text selection containing text from multiple input sources, each of which can be stochastic (such as a speech recognition unit, handwriting recognition unit, or input method editor) or non-stochastic (such as a keyboard and mouse). A text component of the text selection may be the result of data processed through a series of stochastic input sources, such as speech input that is converted to text by a speech recognition unit before being used as input into an input method editor. To determine alternatives for the text selection, a stochastic input combiner parses the text selection into text components from different input sources. For each stochastic text component, the combiner retrieves a stochastic model containing alternatives for the text component. If the stochastic text component is the result of a series of stochastic input sources, the combiner derives a stochastic model that accurately reflects the probabilities of the results of the entire series. The combiner creates a list of alternatives for the text selection by combining the stochastic models retrieved. The combiner may revise the list of alternatives by applying natural language principles to the text selection as a whole. The list of alternatives for the text selection is then presented to the user. If the user chooses one of the alternatives, then the word processor replaces the text selection with the chosen candidate.
摘要:
A background audio recovery system displays an inactive status indicator for a speech recognition program module in an application program. To prevent losses of dictated speech when a speech recognition program module is inadvertently assigned to an inactive mode, the background audio recovery system determines whether an audio input device is receiving audio. If audio is being received by the audio input device, the background audio recovery system stores the audio data for later retrieval by the user. When a user issues a command to activate the speech recognition program module, the background audio recovery system initiates a background audio program module for manipulating the stored audio data that was recorded while the speech recognition program module was assigned to an inactive mode.
摘要:
A multi-source input and playback utility that accepts inputs from various sources, transcribes the inputs as text, and plays aloud user-selected portions of the text is disclosed. The user may select a portion of the text and request audio playback thereof. The utility examines each transcribed word in the selected text. If stored audio data is associated with a given word, that audio data is retrieved and played. If no audio data is associated, then a textto-speech entry or series of entries is retrieved and played instead.
摘要:
Systems and methods are provided for delivering customized versions of web pages to users. In one implementation, a method is provided for customizing a delivered version of a web page to reflect a current time-of-day at a geographic location of the user. According to the method, a request for a web page is received from a client device of the user. The request for the web page includes an IP address of the client device. Based on the IP address, a current time is determined for the received request. Thereafter, a version of the requested web page corresponding to the current time is generated, and the generated version of the requested web page is delivered to the client device.
摘要:
A fact repository stores objects. Each object includes a collection of facts, where a fact comprises an attribute and a value. A set of objects from the fact repository are designated for analysis. The presentation engine presents the facts of the objects in a user interface (UI) having a table. Through manipulation of the UI, an end-user can add or remove facts from the table, and sort the table based on the values of particular facts. The presentation engine also presents the facts of the objects in a UI having a graph. Through manipulation of the UI, the end-user can add or remove facts from the graph, and can sort the facts shown in the graph based on values that are shown, or not shown, in the graph. The presentation engine can further present the facts of the objects in UIs including maps and timelines.
摘要:
Disclosed herein is a method, a system and a computer product for generating a snippet for an entity, wherein each snippet comprises a plurality of sentiments about the entity. One or more textual reviews associated with the entity is selected. A plurality of sentiment phrases are identified based on the one or more textual reviews, wherein each sentiment phrase comprises a sentiment about the entity. One or more sentiment phrases from the plurality of sentiment phrases are selected to generate a snippet.
摘要:
A system and method for providing namespace related information. A namespace library operating in a computer provides a central source of namespace related information for handling XML documents. The namespace related information may be used by other computer application programs operating in the computer. The namespace related information provided by the namespace library is indexed by namespace. Many types of namespace related information may be associated with each namespace. The computer application programs may obtain namespace related information by querying the namespace library using a particular namespace.
摘要:
A system and method for generating a user interface for a speech recognition program module which provides user feedback by inserting a place mark or bar into the text of the document at the insertion point. The place mark indicates to the user that the speech recognition program module has recorded the dictated speech string and is in the process of translating the speech string. The place mark consists of a string of characters, such as a string of ellipses. The place mark has a length that is proportional in length to the expected length of the text that the user has dictated. The length of the place mark is based on the elapsed time of the speech string dictated by the user. When the speech recognition engine has completed the translation of the speech string into text, the final text replaces the place mark in the document. The place mark may be highlighted in different colors or the characters rendered in different colors to indicate to the user the volume level of the speech string being translated.