Abstract:
Some embodiments are directed to a system for identifying clusters from a plurality of users using cloud services. A behavior collection module is configured to obtain user preferences for the plurality of users, and an EM module to configured estimate at least one parameter of a distance-based model by the Expectation-Maximization (EM) algorithm for various values of G (number of clusters). A selection module is configured to compute Bayesian Information Criteria (BIC) with the at least one estimated parameter obtained from the EM module for various values of G, compare BICs obtained for various values of G, select the model with the highest BIC as the best model (best model including the plurality of clusters) and use estimated latent variables of the best model to build a classifier. A characterization module is configured to classify each user into a cluster of the best model using the classifier, and to determine ranking preference of each cluster.
Abstract:
Disclosed are the embodiments for creating a model capable of identifying one or more clusters in a financial data. An input is received pertaining to a range of numbers. Each number in the range of numbers is representative of a number of clusters in the financial data. For a cluster, one or more first parameters of a distribution associated with the cluster are estimated. Thereafter, a threshold value is determined based on the one or more first parameters. An inverse cumulative distribution of each of one or more n-dimensional variables in the financial data is determined. The one or more first parameters are updated to generate one or more second parameters based on the estimated inverse cumulative distribution. A model is created for each number in the range of numbers based on the one or more second parameters.
Abstract:
The disclosed embodiments illustrate methods and systems for scheduling a batch of tasks on one or more crowdsourcing platforms. The method includes generating one or more forecast models for each of the one or more crowdsourcing platforms based on historical data associated with each of the one or more crowdsourcing platforms and a robustness parameter. Thereafter, for a forecast model, from the one or more forecast models, associated with each of the one or more crowdsourcing platforms, a schedule is generated based on the forecast model and one or more parameters associated with the batch of tasks. Further, the schedule is executed on each of the one or more forecasts models associated with the one or more crowdsourcing platforms to determine a performance score of the schedule on each of the one or more forecast models. Finally, the schedule is recommended to a requestor based on the performance score.
Abstract:
A method and a system for predicting admission of a human subject to a first ward in a medical center are disclosed. A patient dataset is generated based on at least a measure of one or more physiological parameters associated with one or more first human subjects and a first information pertaining to the admission of each of the one or more first human subjects to the first ward. For a first human subject of the one or more first human subjects, a first score at each of the one or more first time instants is determined. Further, one or more second time instants from the one or more first time instants are identified. Further, a second score at each of the one or more second time instants is determined. In an embodiment, the first classifier is trained based on at least the second score, and the first information.
Abstract:
A method and system for data classification using machine learning comprises collecting a dataset with a data collection module, receiving the dataset at a classification module configured for machine learning, dividing the dataset into a plurality of vectors, transforming the plurality of vectors into a plurality of variables wherein each variable is assigned a label, and classifying the variables.
Abstract:
A method and a system are provided for prediction of an outcome of a stroke event associated with a first human subject. The method receives a first score, one or more first observations, and one or more second observations associated with the first human subject. The method predicts one or more second scores at the second time instant based on a training of a probabilistic model. The method further selects a second score from the one or more second scores at the second time instant. The second score corresponds to the outcome of the stroke event associated with the first human subject. The second score corresponds to the highest value from the one or more second scores.
Abstract:
Methods and systems for creating one or more statistical classifiers. A first set of performance parameters, corresponding to the one or more applications and the one or more computing infrastructures, is extracted from a historical data pertaining to the execution of the one or more applications on the one or more computing infrastructures. Further, a set of application-specific and a set of infrastructure-specific parameters are selected, from the first set of performance parameters, based on one or more statistical techniques. A similarity between each pair of the applications, each pair of the computing infrastructures, and each pair of possible combinations of an application and a computing infrastructure is determined. One or more statistical classifiers are created, based on the determined similarity.