摘要:
The invention performs handwriting recognition using mixtures of Bayesian networks. A mixture of Bayesian networks (MBN) consists of plural hypothesis-specific Bayesian networks (HSBNs) having possibly hidden and observed variables. A common external hidden variable is associated with the MBN, but is not included in any of the HSBNs. Each HSBN models the world under the hypothesis that the common external hidden variable is in a corresponding one of its states. The MBNs encode the probabilities of observing the sets of visual observations corresponding to a handwritten character. Each of the HSBNs encodes the probabilities of observing the sets of visual observations corresponding to a handwritten character and given a hidden common variable being in a particular state.
摘要:
The present invention leverages curve fitting data techniques to provide automatic detection of data anomalies in a “data tube” from a data perspective, allowing, for example, detection of data anomalies such as on-screen, drill down, and drill across data anomalies in, for example, pivot tables and/or OLAP cubes. It determines if data substantially deviates from a predicted value established by a curve fitting process such as, for example, a piece-wise linear function applied to the data tube. A threshold value can also be employed by the present invention to facilitate in determining a degree of deviation necessary before a data value is considered anomalous. The threshold value can be supplied dynamically and/or statically by a system and/or a user via a user interface. Additionally, the present invention provides an indication to a user of the type and location of a detected anomaly from a top level data perspective.
摘要:
Generation of a model for raw variables from a model for cooked variables. In one embodiment, a first data model for a plurality of cooked transactional variables is input. The cooked transactional variables have been abstracted from raw transactional variables, where the latter variables are based on a data set comprising a plurality of records, each record having a value for each raw transactional variables. A type of the first model is determined, and a second data model, for the plurality of raw transactional variables, is generated based on the first data model and the type of the first data model. The second data model is then output.
摘要:
Automated methods and apparatus for synchronizing audio and text data, e.g., in the form of electronic files, representing audio and text expressions of the same work or information are described. Also described are automated methods of detecting errors and other discrepancies between the audio and text versions of the same work. A speech recognition operation is performed on the audio data initially using a speaker independent acoustic model. The recognized text in addition to audio time stamps are produced by the speech recognition operation. The recognized text is compared to the text in text data to identify correctly recognized words. The acoustic model is then retrained using the correctly recognized text and corresponding audio segments from the audio data transforming the initial acoustic model into a speaker trained acoustic model. The retrained acoustic model is then used to perform an additional speech recognition operation on the audio data. The audio and text data are synchronized using the results of the updated acoustic model. In addition, one or more error reports based on the final recognition results are generated showing discrepancies between the recognized words and the words included in the text. By retraining the acoustic model in the above described manner, improved accuracy is achieved.
摘要:
A general event composing and monitoring system that allows high-level events to be created from combinations of low-level events. An event specification tool allows for rapid development of a general event processor that creates high-level events from combinations of user actions. The event system, in combination with a reasoning system, is able to monitor and perform inference about several classes of events for a variety of purposes. The various classes of events include the current context, the state of key data structures in a program, general sequences of user inputs, including actions with a mouse-controlled cursor while interacting with a graphical user interface, words typed in free-text queries for assistance, visual information about users, such as gaze and gesture information, and speech information. Additionally, a method is provided for building an intelligent user interface system by constructing a reasoning model to compute the probability of alternative user's intentions, goals, or informational needs through analysis of information about a user's actions, program state, and words. The intelligent user interface system monitors user interaction with a software application and applies probabilistic reasoning to sense that the user may need assistance in using a particular feature or to accomplish a specific task. The intelligent user interface also accepts a free-text query from the user asking for help and combines the inference analysis of user actions and program state with an inference analysis of the free-text query. The inference system accesses a rich, updatable user profile system to continually check for competencies and changes assistance that is given based on user competence.
摘要:
An event composing and monitoring system that allows high-level events to be created from combinations of low-level events. An event specification tool, contained in the system, allows for rapidly developing a general event processor that creates high-level events from combinations of user actions. An event system, in combination with an inference system, monitors and infers, for various purposes, about several classes of events including: current program context; state of key data structures; user input sequences, including actions with a mouse-controlled cursor while interacting with a graphical user interface; words typed in free-text help queries; visual user information, such as gaze and gesture information; and user speech information. Additionally, an intelligent user interface is provided by constructing a reasoning model that computes probability of alternative user intentions, goals or information needs through analyzing information regarding program state, and that user's actions and free-text query words. Specifically, the interface monitors user interaction with a program and probabilistically reasons to sense that a user may need assistance in using a particular feature or to accomplish a specific task. This interface accepts a free-text help query from the user and combines the inference analysis of user actions and the program state with an inference analysis of the query. The inference system, using an updateable user profile, continually checks for user competencies and, based on such competencies, changes assistance that is offered.
摘要:
An improved method and system for performing case-based reasoning is provided. A belief network is utilized by the preferred case-based reasoning system for assisting a user in problem resolution. After resolving a problem of a user, the preferred embodiment of the present invention updates the probabilities in the belief network so as to provide for a more accurate problem resolution upon the next invocation of the preferred embodiment. The belief network of the preferred embodiment contains six data types relating to a problem resolution scenario. The data types utilized by the belief network of the preferred embodiment include: issues, causes, resolutions, symptoms, terms, and alternates.
摘要:
The disclosed system provides an improved collaborative filtering system by utilizing a belief network, which is sometimes known as a Bayesian network. The disclosed system learns a belief network using both prior knowledge obtained from an expert in a given field of decision making and a database containing empirical data obtained from many people. The empirical data contains attributes of users as well as their preferences in the field of decision making. After initially learning the belief network, the belief network is relearned at various intervals when additional attributes are identified as having a causal effect on the preferences and data for these additional attributes can be gathered. This relearning allows the belief network to improve its accuracy at predicting preferences of a user. Upon each iteration of relearning, a cluster model is automatically generated that best predicts the data in the database. After relearning the belief network a number of times, the belief network is used to predict the preferences of a user using probabilistic inference. In performing probabilistic inference, the known attributes of a user are received and the belief network is accessed to determine the probability of the unknown preferences of the user given the known attributes. Based on these probabilities, the preference most likely to be desired by the user can be predicted.
摘要:
An improved free text query method and system is provided as part of an improved on-line help system. In a preferred embodiment of the present invention, the on-line help system provides a free text query system that performs partial analysis. The partial analysis performed by the preferred embodiment includes identifying keywords within input provided by the user, performing disambiguation analysis, performing definiteness analysis, performing capitalization analysis, and generating a ranked list of candidates according to a probability associated with each candidate. In addition, the preferred embodiment of the present invention is internationalizable. That is, the present invention is easily ported between different languages.
摘要:
Shift invariant predictors are described herein. By way of example, a system for predicting binding information relating to a binding of a protein and a ligand can include a trained binding model and a prediction component. The trained binding model can include a hidden variable representing an unknown alignment of the ligand at a binding site of the protein. The prediction component can be configured to predict the binding information by employing information about the protein's sequence, the ligand's sequence and the trained binding model.