摘要:
Systems and methods for analyzing the content of audio/video files using speech recognition and data mining technologies are provided. As it can generally be assumed that a user's interest is highly correlated with an audio/video clip or television program the user may be watching, methods and systems for utilizing the results of speech recognition and data mining technology implementation to retrieve relevant advertising content for display are also provided.
摘要:
A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing part of a multi-path framework is described. The part of the multi-path framework includes a subscript/superscript analysis and character determination component that is designed to identify subscript and superscript elements. A method for analyzing a handwritten mathematical expression includes receiving a symbols corresponding to handwritten mathematical expression input strokes, identifying subscript and/or superscript structures, and determining a character for each symbol of the set. A graph of vertexes and edges may be created based upon the set of symbols and the graph may be searched to determine optimized candidates.
摘要:
A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing part of a multi-path framework is described. The part of the multi-path framework includes a subscript/superscript analysis and character determination component that is designed to identify subscript and superscript elements. A method for analyzing a handwritten mathematical expression includes receiving a symbols corresponding to handwritten mathematical expression input strokes, identifying subscript and/or superscript structures, and determining a character for each symbol of the set. A graph of vertexes and edges may be created based upon the set of symbols and the graph may be searched to determine optimized candidates.
摘要:
A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing a multi-path framework is described. The framework may include symbol grouping and recognition, tabular structure analysis, subordinate sub-expression analysis, subscript/superscript analysis and character determination, and semantic structure analysis components. A method for recognizing a handwritten mathematical expression includes receiving a plurality of input strokes corresponding to a handwritten mathematical expression and providing a candidate list of recognized candidate expressions based upon the input strokes. Input strokes are grouped into symbols, tabular structures are determined, dominant symbol candidates and subordinate symbols are determined, and subscript and superscript structures are determined.
摘要:
Systems, methods, and computer-readable media for processing electronic ink receive an electronic ink input; convert the electronic ink input to a first machine-generated object using handwriting recognition; display the first machine-generated object on a display; receive speech input; convert the speech input to a second machine-generated object using speech recognition; generate a list of machine-generated objects based on the electronic ink input, the list including the first machine-generated object and alternative machine-generated objects and functioning as a dictionary for converting the speech input; and replace the first machine-generated object with the second machine-generated object. The machine-generated objects may correspond to words, lines, and/or other groupings of machine-generated text. A user may confirm that the second machine-generated object should replace the first machine-generated object and the system will perform the replacement. The systems and methods may generate a list of alternative machine-generated object candidates to the first machine-generated object based on handwriting recognition of the electronic ink input alone or in combination with a statistical language model.
摘要:
A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing a multi-path framework is described. The framework may include symbol grouping and recognition, tabular structure analysis, subordinate sub-expression analysis, subscript/superscript analysis and character determination, and semantic structure analysis components. A method for recognizing a handwritten mathematical expression includes receiving a plurality of input strokes corresponding to a handwritten mathematical expression and providing a candidate list of recognized candidate expressions based upon the input strokes. Input strokes are grouped into symbols, tabular structures are determined, dominant symbol candidates and subordinate symbols are determined, and subscript and superscript structures are determined.
摘要:
Systems, methods, and computer-readable media for processing electronic ink receive an electronic ink input; convert the electronic ink input to a first machine-generated object using handwriting recognition; display the first machine-generated object on a display; receive speech input; convert the speech input to a second machine-generated object using speech recognition; generate a list of machine-generated objects based on the electronic ink input, the list including the first machine-generated object and alternative machine-generated objects and functioning as a dictionary for converting the speech input; and replace the first machine-generated object with the second machine-generated object. A user may confirm that the second machine-generated object should replace the first machine-generated object. The systems and methods may generate a list of alternative machine-generated object candidates to the first machine-generated object based on handwriting recognition of the electronic ink input alone or in combination with a statistical language model.
摘要:
An opinion system infers the opinion of a sentence of a product review based on a probability that the sentence contains certain sequences of parts of speech that are commonly used to express an opinion as indicated by the training data and the probabilities of the training data. When provided with the sentence, the opinion system identifies possible sequences of parts of speech of the sentence that are commonly used to express an opinion and the probability that the sequence is the correct sequence for the sentence. For each sequence, the opinion system then retrieves a probability derived from the training data that the sequence contains an opinion word that expresses an opinion. The opinion system then retrieves a probability from the training data that the opinion words of the sentence are used to express an opinion. The opinion system then combines the probabilities to generate an overall probability that the sentence with that sequence expresses an opinion.
摘要:
An opinion system infers the opinion of a sentence of a product review based on a probability that the sentence contains certain sequences of parts of speech that are commonly used to express an opinion as indicated by the training data and the probabilities of the training data. When provided with the sentence, the opinion system identifies possible sequences of parts of speech of the sentence that are commonly used to express an opinion and the probability that the sequence is the correct sequence for the sentence. For each sequence, the opinion system then retrieves a probability derived from the training data that the sequence contains an opinion word that expresses an opinion. The opinion system then retrieves a probability from the training data that the opinion words of the sentence are used to express an opinion. The opinion system then combines the probabilities to generate an overall probability that the sentence with that sequence expresses an opinion.
摘要:
Measurement of Kullback-Leibler Divergence (KLD) between hidden Markov models (HMM) of acoustic units utilizes an unscented transform to approximate KLD between Gaussian mixtures. Dynamic programming equalizes the number of states between HMMs having a different number of states, while the total KLD of the HMMs is obtained by summing individual KLDs calculated by state pair by state pair comparisons.