Abstract:
A system, method, and computer program product for automatically selecting from a plurality of analytic algorithms a best performing analytic algorithm to apply to a dataset is provided. The automatically selecting from the plurality of analytic algorithms the best performing analytic algorithm to apply to the dataset enables a training a plurality of analytic algorithms on a plurality of subsets of the dataset. Then, a corresponding prediction accuracy trend is estimated across the subsets for each of the plurality of analytic algorithms to produce a plurality of accuracy trends. Next, the best performing analytic algorithm is selected and outputted from the plurality of analytic algorithms based on the corresponding prediction accuracy trend with a highest value from the plurality of accuracy trends.
Abstract:
A method of meta-learning includes receiving a prediction objective, extracting a plurality of subsets of data from a distributed dataset, generating a plurality of local predictions, wherein each local prediction is based on a different subset of the plurality of subsets of data and the prediction objective, combining the plurality of local predictions, and generating a final prediction based on the combined local predictions.
Abstract:
A method of meta-learning includes receiving a prediction objective, extracting a plurality of subsets of data from a distributed dataset, generating a plurality of local predictions, wherein each local prediction is based on a different subset of the plurality of subsets of data and the prediction objective, combining the plurality of local predictions, and generating a final prediction based on the combined local predictions.
Abstract:
Techniques for scenario planning are provided. In one example, a computer-implemented method can comprise analyzing, by a device operatively coupled to a processor, content using a topic model. The content can be associated with a defined source and is related to one or more current events. The computer-implemented method can also comprise determining, by the device, one or more portions of the analyzed content that are relevant to one or more key risk drivers using a risk driver model. The computer-implemented method can also comprise aggregating, by the device, the determined one or more portions into one or more emerging storylines based on values of one or more attributes of the topic model.
Abstract:
A method of analyzing physiological data streams. According to the method, physiological data is received into a computerized machine. The physiological data comprises numerical data and medical symptoms of a patient. Features are extracted from the physiological data based on development of the physiological data over a period of time. The features are converted into a textual representation using natural language generation. Input terms for an information retrieval system operating on the computerized machine are automatically generated based on the features. The input terms are input to the information retrieval system. A corpus of data is automatically searched to retrieve results to the input terms using the information retrieval system.
Abstract:
Embodiments of the invention include method, systems and computer program products for providing ease-of-drive driving directions. The computer-implemented method includes receiving, by a processor, a request for a route from a starting point to a destination point. The processor calculates one or more routes from the starting point to the destination point. The processor scores the one or more calculated routes according to ease-of-drive driving criteria. The processor presents at least one of the scored calculated routes that are below a predetermined threshold.
Abstract:
A method of analyzing physiological data streams. According to the method, physiological data is received into a computerized machine. The physiological data comprises numerical data and medical symptoms of a patient. Features are extracted from the physiological data based on development of the physiological data over a period of time. The features are converted into a textual representation using natural language generation. Input terms for an information retrieval system operating on the computerized machine are automatically generated based on the features. The input terms are input to the information retrieval system. A corpus of data is automatically searched to retrieve results to the input terms using the information retrieval system.
Abstract:
The present invention describes a method and system for optimizing a test flow within each ATE (Automated Test Equipment) station. The test flow includes a plurality of test blocks. A test block includes a plurality of individual tests. A computing system schedule the test flow based one or more of: a test failure model, test block duration and a yield model. The failure model determines an order or sequence of the test blocks. There are at least two failure models: independent failure model and dependant failure model. The yield model describes whether a semiconductor chip is defective or not. Upon completing the scheduling, the ATE station conducts tests according to the scheduled test flow. The present invention can also be applied to software testing.
Abstract:
Embodiments relate to analyzing dataset. A method of analyzing data is provided. The method obtains a description of a dataset. The method automatically generates a plurality of analysis options from the description of the dataset. The method generates a plurality of queries based on the analysis options. The method deploys the queries on the dataset to build a plurality of statistical models from the dataset.
Abstract:
Embodiments for accurate temporal event predictive modeling by a processor. An average reverse event delay may be determined from one or more event delays in a time-series window. A time-series event may be predicted by applying the average reverse event delay in conjunction with one or more weighted factors in a predictive model.