摘要:
In one embodiment, an event impact signature detector may analyze a time series with external events. A data interface 250 may receive a data set 310 representing the time series with external events. A processor 220 may fit the data set 310 into a baseline time series model 330. The processor 220 may iteratively determine each event location 352 for multiple external events 350 affecting the baseline time series model 330. The processor 220 may iteratively solve for each event impact 354 of the multiple external events 350 factoring in interactions between the multiple external events 350.
摘要:
In one embodiment, an event impact signature detector may analyze a time series with external events. A data interface 250 may receive a data set 310 representing the time series with external events. A processor 220 may fit the data set 310 into a baseline time series model 330. The processor 220 may iteratively determine each event location 352 for multiple external events 350 affecting the baseline time series model 330. The processor 220 may iteratively solve for each event impact 354 of the multiple external events 350 factoring in interactions between the multiple external events 350.
摘要:
A multi-modal search system (and corresponding methodology) is provided. The system employs text, speech, touch and gesture input to establish a search query. Additionally, a subset of the modalities can be used to obtain search results based upon exact or approximate matches to a search result. For example, wildcards, which can either be triggered by the user or inferred by the system, can be employed in the search.
摘要:
The present invention utilizes a cross-prediction scheme to predict values of discrete and continuous time observation data, wherein conditional variance of each continuous time tube variable is fixed to a small positive value. By allowing cross-predictions in an ARMA based model, values of continuous and discrete observations in a time series are accurately predicted. The present invention accomplishes this by extending an ARMA model such that a first time series “tube” is utilized to facilitate or “cross-predict” values in a second time series tube to form an “ARMAxp” model. In general, in the ARMAxp model, the distribution of each continuous variable is a decision graph having splits only on discrete variables and having linear regressions with continuous regressors at all leaves, and the distribution of each discrete variable is a decision graph having splits only on discrete variables and having additional distributions at all leaves.
摘要:
The present invention utilizes a discriminative density model selection method to provide an optimized density model subset employable in constructing a classifier. By allowing multiple alternative density models to be considered for each class in a multi-class classification system and then developing an optimal configuration comprised of a single density model for each class, the classifier can be tuned to exhibit a desired characteristic such as, for example, high classification accuracy, low cost, and/or a balance of both. In one instance of the present invention, error graph, junction tree, and min-sum propagation algorithms are utilized to obtain an optimization from discriminatively selected density models.
摘要:
The present invention utilizes generic and user-specific features of handwriting samples to provide adaptive handwriting recognition with a minimum level of user-specific enrollment data. By allowing generic and user-specific classifiers to facilitate in a recognition process, the features of a specific user's handwriting can be exploited to quickly ascertain characteristics of handwriting characters not yet entered by the user. Thus, new characters can be recognized without requiring a user to first enter that character as enrollment or “training” data. In one instance of the present invention, processing of generic features is accomplished by a generic classifier trained on multiple users. In another instance of the present invention, a user-specific classifier is employed to modify a generic classifier's classification as required to provide user-specific handwriting recognition.
摘要:
Match criteria are provided to specify when advertisements will be shown, for instance in a search environment. Input such as search queries can be represented in a simplified form such as an implicit and/or explicit wildcard expression. Advertisers or other entities can bid on terms such that advertisements or similar content are presented when the terms match an expansion of a simplified input. Matching ads can subsequently be displayed alone or in combination with query expansion suggestions and/or query results.
摘要:
Determining the near-optimal block size for incremental-type expectation maximization (EM) algorithms is disclosed. Block size is determined based on the novel insight that the speed increase resulting from using an incremental-type EM algorithm as opposed to the standard EM algorithm is roughly the same for a given range of block sizes. Furthermore, this block size can be determined by an initial version of the EM algorithm that does not reach convergence. For a current block size, the speed increase is determined, and if the speed increase is the greatest determined so far, the current block size is set as the target block size. This process is repeated for new block sizes, until no new block sizes can be determined.
摘要:
One aspect of the invention is the construction of mixtures of Bayesian networks. Another aspect of the invention is the use of such mixtures of Bayesian networks to perform inferencing. A mixture of Bayesian networks (MBN) consists of plural hypothesis-specific Bayesian networks (HSBNs) having possibly hidden and observed variables. A common external hidden variable is associated with the MBN, but is not included in any of the HSBNs. The number of HSBNs in the MBN corresponds to the number of states of the common external hidden variable, and each HSBN is based upon the hypothesis that the common external hidden variable is in a corresponding one of those states. In one mode of the invention, the MBN having the highest MBN score is selected for use in performing inferencing. In another mode of the invention, some or all of the MBNs are retained as a collection of MBNs which perform inferencing in parallel, their outputs being weighted in accordance with the corresponding MBN scores and the MBN collection output being the weighted sum of all the MBN outputs. In one application of the invention, collaborative filtering may be performed by defining the observed variables to be choices made among a sample of users and the hidden variables to be the preferences of those users.
摘要:
The subject invention leverages standard probabilistic inference techniques to determine a log-likelihood for a conditional Gaussian graphical model of a data set with at least one continuous variable and with data not observed for at least one of the variables. This provides an efficient means to compute gradients for CG models with continuous variables and incomplete data observations. The subject invention allows gradient-based optimization processes to employ gradients to iteratively adapt parameters of models in order to improve incomplete data log-likelihoods and identify maximum likelihood estimates (MLE) and/or local maxima of the incomplete data log-likelihoods. Conditional Gaussian local gradients along with conditional multinomial local gradients determined by the subject invention can be utilized to facilitate in providing parameter gradients for full conditional Gaussian models.