摘要:
Techniques are disclosed for capturing, storing, querying and analyzing provenance data for automatic discovery of enterprise process information. For example, a computer-implemented method for managing a process associated with an enterprise comprises the following steps. Data associated with an actual end-to-end execution of an enterprise process is collected. Provenance data is generated based on at least a portion of the collected data, wherein the provenance data is indicative of a lineage of one or more data items. A provenance graph that provides a visual representation of the generated provenance data is generated, wherein nodes of the graph represent records associated with the collected data and edges of the graph represent relations between the records. The generated provenance graph is stored in a repository for use in analyzing the enterprise process.
摘要:
Methods and apparatus for optimizing resource allocation among data analysis functions in a classification system are provided. Each of the data analysis functions is characterized as a set of operating points in accordance with at least one of resource requirements and analysis quality. An operating point for each of the data analysis functions is selected in accordance with one or more constraints. The analysis functions are applied at selected operating points to optimize resource allocation among the data analysis functions in the classification system.
摘要:
Systems, methods and articles of manufacture are disclosed for building and executing analytics solutions. Such a solution may provide a comprehensive analytics solution (e.g., a risk assessment, fraud detection solution, dynamic operational risk evaluations, regulatory compliance assessments, etc.). The analytics solution may perform an analytics task using operational data distributed across a variety of independently created and governed data repositories in different departments of an organization. A framework is disclosed which allows a user (e.g., a risk analyst) to compose analytical tools that can access data from a variety of sources (both internal and external to an enterprise) and perform a variety of analytic functions.
摘要:
The invention provides a system adapted to enable contextual collaboration within a computer network. The invention provides a collaboration manager that supports manipulation of collaboration spaces. The collaboration spaces contain one or more collaboration elements that have potentially different collaboration modalities. The context manager maintains resources and resource interrelationships within contexts. The resources can include collaboration elements, processes, tasks, business objects, users, and/or roles. A view generator selects a set of the resources to be displayed to a user of the computer network. The invention also includes an interface adapted to allow applications to specify commands to manipulate collaboration spaces, commands to maintain resources and resource interrelationships within contexts, and queries for context-sensitive views.
摘要:
A method and apparatus for presenting customized presentation offers to a consumer having a shopping device prior to the consumer's check out in a retail environment is set forth. Using a keyword-driven rule-based method, the present invention identifies certain information regarding the consumer, items selected for purchase, retail environment and the like, and processes keywords relevant to the gathered information in view of rule types and available offers to select and prioritize for presentation to the consumer offers pertinent to the consumer's interests, locations and activities while in the store. The personalized offer presentation is made available to the consumer preferably through an in-store server communicating to the shopping device used by the consumer so as to provide the consumer with in-store offers prior to the consumer completing their shopping experience at check out.
摘要:
An invention for generating standard description records from multimedia information. The invention utilizes fundamental entity-relation models for the Generic AV DS that classify the entities, the entity attributes, and the relationships in relevant types to describe visual data. It also involves classification of entity attributes into syntactic and semantic attributes. Syntactic attributes can be categorized into different levels: type/technique, global distribution, local structure, and global composition. Semantic attributes can be likewise discretely categorized: generic object, generic scene, specific object, specific scene, abstract object, and abstract scene. The invention further classifies entity relationships into syntactic/semantic categories. Syntactic relationship categories include spatial, temporal, and visual categories. Semantic relationship categories include lexical and predicative categories. Spatial and temporal relationships can be topological or directional; visual relationships can be global, local, or composition; lexical relationships can be synonymy, antonymy, hyponymy/hypernymy, or meronymy/holonymy; and predicative relationships can be actions (events) or states.
摘要:
A method for translating data from one representation or schema to another representation or schema. Example data encoded in both the schemas is used to generate a translator. This translator is then used for automatically translating data from one schema to another. The translator is computed by finding corresponding paths for matched data elements. When new data is presented in one schema, the translator then gives the translation for the paths of data elements in the data. A translated data is then constructed by using these translated paths. Possible applications in the Internet domain, include but are not limited to: EDI; search engines; content ingestion; content customization; data delivery; and data retrieval. Specific examples are shown for generating a translator and translating data between various schema including HTML, XML and extensions thereto such as SpeechML.
摘要:
Linear optimization queries, which usually arise in various decision support and resource planning applications, are queries that retrieve top N data records (where N is an integer greater than zero) which satisfy a specific optimization criterion. The optimization criterion is to either maximize or minimize a linear equation. The coefficients of the linear equation are given at query time. Methods and apparatus are disclosed for constructing, maintaining and utilizing a multidimensional indexing structure of database records to improve the execution speed of linear optimization queries. Database records with numerical attributes are organized into a number of layers and each layer represents a geometric structure called convex hull. Such linear optimization queries are processed by searching from the outer-most layer of this multi-layer indexing structure inwards. At least one record per layer will satisfy the query criterion and the number of layers needed to be searched depends on the spatial distribution of records, the query-issued linear coefficients, and N, the number of records to be returned. When N is small compared to the total size of the database, answering the query typically requires searching only a small fraction of all relevant records, resulting in a tremendous speedup as compared to linearly scanning the entire dataset.
摘要:
A system and method for uniformly describing space and frequency views of multi-dimensional lattice data, including regions, tilings and hierarchical decompositions of image, video, audio content, and time series data in space, time, frequency and resolution. The space and frequency view description scheme provides a way to specify regions in space, time, frequency and resolution in term of space and frequency views. This allows specification of concepts such as “half resolution”, “upper right quadrant” or “high pass band”, such as when referring to views of an image. The space and frequency view description scheme also provides for SFTilings comprising non-redundant sets of views and SFPartitionings which are SFTilings which completely cover the space, as well as SFHierarchical decompositions comprising hierarchies of SFViews where there are processing dependencies among views. The inventive system and method also provide a way to index these view, which is important in applications of progressive retrieval in which a client application can send requests for views of images stored at a server in terms of an index value. The space and frequency view description scheme handles the details concerning the access and relationship of views with different resolutions, spatial locations and sizes.
摘要:
An apparatus and method for approximating the data stored in a databases by generating multiple projections and representations from the database such that the OLAP queries for the original database (such as aggregation and histogram operations) may be applied to the approximated version of the database, which can be much smaller than the original databases. Other aspects optimize a mapping, via a mapping (or dimension) table, of non-numeric or numeric attributes to other numeric attributes such that the error incurred on applying queries to the approximated version of the database is minimized. Still further aspects define boundaries of approximations so that the boundaries are preserved when approximated versions of the databases are generated.