Abstract:
A computer implemented method and computer program product for automatically building semantic associations within a database of unstructured information includes an algorithm for mapping data within the unstructured information and iteratively improving semantic labels for association with the data, until such point as associations pass a convergence test and then the semantic associations are made.
Abstract:
Access is obtained to call data records (or other data) for a plurality of mobile telephony users. The call data records (or other data) include location-time sequences. For each of the mobile telephony users, corresponding ones of the location-time sequences are segmented into trips; a home-work-school classification model is used to determine a home location and a work-school location, based on the trips; and a trip purpose classification model is used to assign each of the trips a purpose label, based at least on comparing at least one of a start location and an end location for each one of the trips to at least one of the home location and the work-school location determined using the home-work-school classification model. The trips are aggregated into purpose based origin-destination categories based at least on the purpose labels, time ranges of the location-time sequences, and the start and end locations.
Abstract:
A GPS-enabled cellular electronic device is operated in an indoor mode. An increase in strength of a cellular signal is detected at the GPS-enabled cellular electronic device. Responsive at least to the increase in cellular signal strength, the GPS-enabled cellular electronic device is transitioned to an outdoor testing mode. Detecting is carried out to determine whether movement of the GPS-enabled cellular electronic device occurs during the outdoor testing mode. If so, the GPS-enabled cellular electronic device is transitioned to an outdoor mode.
Abstract:
A cross descriptor learning system, method and program product therefor. The system extracts descriptors from unlabeled exemplars. For each unlabeled exemplar, a cross predictor uses each descriptor to generate labels for other descriptor. An automatic label generator also generates labels for the same unlabeled exemplars or, optionally, for labeled exemplars. A label predictor results for each descriptor by combining labels from the cross predictor with labels from the automatic label generator.
Abstract:
A system, method, and program product for tracking content are described. Aspects of invention allow bodies of content, whether from a common channel or from different channels, to be compared for relatedness. Comparison of different bodies of content involves analyzing both the actual content, characteristics of the source(s) of the content, and optionally, elapsed time between their respective broadcasts/communications. To this extent, a content similarity value, a source characteristic value and an optional temporal value for the portions of content are determined, and then used to compute a relatedness value of the (bodies of) content.
Abstract:
A method for implementing information quality assessment includes receiving a value relating to a predicted outcome of an event from each of an entity and at least one other entity and a reputation score for the entity and the other entity. The method also includes generating a collective prediction value for the event that reflects a collective prediction for the event. The collective prediction value is generated from predicted outcome values from the entity and the other entity. The collective prediction value is calculated as a function of reputation scores corresponding to the entity and the other entity.
Abstract:
Utility meter readings generated at low sampling rates are disaggregated to identify consumer usage activities. Time intervals between readings can include a plurality of consumer usage activities. By employing a model which recognizes associations among consumer usage activities, effective disaggregation is possible using only aggregated consumption data and interval start times. Consumers and utility managers can design and assess conservation programs based on the disaggregated consumption usage activities.
Abstract:
A computer readable medium embodies a program of machine-readable instructions executable by a processing apparatus to perform operations including determining information corresponding to a number of differences in distances between ending points of journeys taken by a vehicle and starting points of consecutive journeys taken by the vehicle, and transmitting one or more representations of the information. Another computer readable medium tangibly includes instructions for, for each of a number of vehicles, receiving one or more scores corresponding to a vehicle, and determining a ranked list containing at least a portion of the scores, and outputting the ranked list. Another computer readable medium includes instructions for, for each of a number of vehicles, receiving information corresponding to a vehicle, using one or more metrics, calculating one or more values from the received information, and communicating one or more representations of the one or more value to one or more entities.
Abstract:
A method and apparatus is provided for automatically classifying a multimedia artifact based on scoring, and selecting the appropriate set of ontologies from among all possible sets of ontologies, preferably using a recursive routing selection technique. The semantic tagging of the multimedia artifact is enhanced by applying only classifiers from the selected ontology, for use in classifying the multimedia artifact, wherein the classifiers are selected based on the context of the multimedia artifact. One embodiment of the invention, directed to a method for classifying a multimedia artifact, uses a specified criteria to select one or more ontologies, wherein the specified criteria indicates the comparative similarity between specified characteristics of the multimedia artifact and each ontology. The method further comprises scoring and selecting one or more classifiers from a plurality of classifiers that respectively correspond to semantic element of the selected ontologies, and evaluating the multimedia artifact using the selected classifiers to determine a classification for the multimedia artifact.
Abstract:
An autonomous classification device which enables the creation of autonomous classifiers that are easy to deploy, adapt and optimize in the environment in which they are used. The classifier is autonomous in that it can perform three functions that define autonomic systems: automatically configure itself in an environment, optimize its performance using the environment and mechanisms for performance, and continually adapting to improve performance and heal itself in a changing environment.