摘要:
According to one aspect, an index that is built on a table having a plurality of rows comprises an index entry that contains a key value for a sub-part of a row and that includes data for locating a row and data for locating, within the row, a value associated with the sub-part of the row. In one embodiment, the sub-part of the row is an array, with each member of the array represented in the row.
摘要:
This invention addresses the need for dealing with complex planning calculations based on data warehouse or Planning Data Repository (PDR) data where some aggregated data or forecast data might be changed without directly manipulating the underlying data, and where there may be several relationships linking the data. The system is able to deal with complex relationships along more than one axis or dimension. A number of iterations are typically used involving both back-solving and nullforward-solvingnull. The subset of cells that needs to be recalculated is identified before steps of back-solving and/or forward-solving using parent/child tables. The scanning of these tables looking for potential dependencies is much simpler and faster than to looking at the actual formulae or functions relating the cells. The step of creating the parent/child tables is carried out in advance of the actual calculation by parsing all the relationships (formulae and functions) and summarising the dependencies between cells in the parent/child tables.
摘要:
A method for processing requests includes receiving, from a requestor, a first read request to read a portion of a seismic cube. The first read request includes a virtual location of the portion. The method further includes querying a seismic cube index to obtain a mapping parameter and a storage location of a section including the portion of the seismic cube. The mapping parameter maps virtual locations in the seismic cube with data locations in the section. The section is identified using the virtual location of the portion. The method further includes calculating, on a computer processor and using the mapping parameter, a data location in the section corresponding to the virtual location of the portion, and transmitting a second read request to the storage location of the section. The second read request includes the data location. The requestor receives the portion from the storage location.
摘要:
A large OLAP cube includes data and dimensions not required for a particular query, or for a collection of queries. A response to a query is generated by using the large OLAP cube rather than using an OLAP cube tailored to the query, and rather than querying the relational database directly. The large OLAP cube may be created after identifying a set of frequently sought data and/or filters used in queries to the relational database. Data may be staged by creating a smaller OLAP cube with data extracted from the large OLAP cube. The cubes may then be queried, browsed, and used to prepare reports or generate flat files.
摘要:
A computer readable storage medium comprises executable instructions to generate a stream cube to store data received from multiple data sources. A plurality of multi-dimensional data streams are generated to represent the received data. A materialization strategy is repeatedly determined for the stream cube. The stream cube is materialized according to the materialization strategy to record the multi-dimensional data streams.
摘要:
A definition manager independently defines a plurality of dimensional members of a data source. An operation manager receives user input selecting a set of dimensional members of interest and at least one operator to manipulate the selected set of members based on the selected at least one operator. The set of dimensional members of interest comprise a first dimensional member of interest having a given dimension and a given level in a hierarchy of the given dimension and a second dimensional member of interest having a different dimension from the given dimension or a different level in the hierarchy from the given level in the hierarchy. A display manager places the manipulated set of dimensional members in a report so that the manipulated set of dimensional members are displayed adjacently based on the operator to facilitate comparative analysis.
摘要:
Apriori algorithms are popular data mining techniques for extracting association rules from a body of data. The computational complexity of these algorithms is reduced by representing itemset information at cells of a hypercube. The cells encode associations between the items of each transaction. Direct computation of a cell as a lexicographic combination of items accelerates the computation of itemsets, and thereby improves the computational runtime complexity of the apriori algorithm that discovers association rules. Even faster computation is achieved by a user selected cardinality that limits the maximum size of the itemsets.
摘要:
This invention addresses the need for dealing with complex planning calculations based on data warehouse or Planning Data Repository (PDR) data where some aggregated data or forecast data might be changed without directly manipulating the underlying data, and where there may be several relationships linking the data. The system is able to deal with complex relationships along more than one axis or dimension. A number of iterations are typically used involving both back-solving and ‘forward-solving’. The subset of cells that needs to be recalculated is identified before steps of back-solving and/or forward-solving using parent/child tables. The scanning of these tables looking for potential dependencies is much simpler and faster than to looking at the actual formulae or functions relating the cells. The step of creating the parent/child tables is carried out in advance of the actual calculation by parsing all the relationships (formulae and functions) and summarising the dependencies between cells in the parent/child tables.
摘要:
The present invention leverages curve fitting data techniques to provide automatic detection of data anomalies in a “data tube” from a data perspective, allowing, for example, detection of data anomalies such as on-screen, drill down, and drill across data anomalies in, for example, pivot tables and/or OLAP cubes. It determines if data substantially deviates from a predicted value established by a curve fitting process such as, for example, a piece-wise linear function applied to the data tube. A threshold value can also be employed by the present invention to facilitate in determining a degree of deviation necessary before a data value is considered anomalous. The threshold value can be supplied dynamically and/or statically by a system and/or a user via a user interface. Additionally, the present invention provides an indication to a user of the type and location of a detected anomaly from a top level data perspective.
摘要:
A computer-implemented method is provided for defining interesting portions of a workflow of a business or other type of process. Using a tracking profile editor, a portion of a given workflow is selected and associated with a named process part. Items of payload data to be used in reports are selected from a message schema associated with the given workflow. A profile is generated based on the given workflow and the selected items of payload data. A tracking profile compiler receives the created tracking profile and generates a star schema, a flattened view and OLAP cube based on the tracking profile. An interceptor extracts monitored workflow events and stores event information to a tracking stream, based on the tracking profile. A tracking service extracts information pertaining to the monitored events from the tracking stream and stores the information in the queryable database. The OLAP cube and the flattened view are updated based on the information written to the queryable database.