Abstract:
A method for estimating a runtime of a query of a data repository may abstract, extract, or otherwise generate one or more parameters of the query. Thereafter, the abstracted parameters may be associated with one or more statistical query records. From these statistical query records, a query runtime may be estimated. Related apparatuses, computer program products, and computer systems are also described.
Abstract:
In one aspect, in a computer-implemented method may make data anonymous, so that the data may be used during testing. The method may include receiving, from a user interface, an indication of a type of shuffling to be performed on data. Moreover, the data may be shuffled based on the received indication of the shuffling type. The shuffling may rearrange the data to make the data anonymous. The shuffled data may be provided to an application. Related systems, apparatus, methods, and/or articles are also described.
Abstract:
A method for estimating a runtime of a query of a data repository may abstract, extract, or otherwise generate one or more parameters of the query. Thereafter, the abstracted parameters may be associated with one or more statistical query records. From these statistical query records, a query runtime may be estimated. Related apparatuses, computer program products, and computer systems are also described
Abstract:
In one embodiment the present invention includes a method comprising receiving a data filter for filtering a collection of data, wherein the collection of data is configured as a star schema including a fact table and dimension tables. The data filter is applied against the dimension tables to generate a modified dimension table. The modified dimension tables are applied against the fact table to produce a modified fact table. The data filter is then applied against the modified fact table to generate a second modified fact table, which is the output of the process.
Abstract:
A system and method for improving a query on a relational database in a business intelligence system is provided. A multidimensional data table is provided in the database. A sampling dimension is appended to the data table. The sampling dimension includes a number of cells, and wherein each cell includes a sampling value corresponding to the data of at least one of the dimensions of the data table. The data table is then clustered in at least one of the dimensions based on the associated sampling value in the sampling dimension. A query for a subset of data can then be executed on the clustered data table based on the sampling values.
Abstract:
A system and method for improving a query on a relational database in a business intelligence system is provided. A multidimensional data table is provided in the database. A sampling dimension is appended to the data table. The sampling dimension includes a number of cells, and wherein each cell includes a sampling value corresponding to the data of at least one of the dimensions of the data table. The data table is then clustered in at least one of the dimensions based on the associated sampling value in the sampling dimension. A query for a subset of data can then be executed on the clustered data table based on the sampling values.
Abstract:
A method determines whether a structural indicator of a first table of results generated from a first data repository query matches a corresponding structural indicator of a second table of results generated from a substantially identical second data repository query. If the first and second table structural indicators match, then values from at least two corresponding cells from each of the first and second tables are aggregated. If the aggregated values from the first and second tables are identical, then an indication is provided indicating that the first data repository query results are accurate. Related apparatuses, computer program products, and computer systems are also described.
Abstract:
A selection of a plurality of data sources which characterize characteristics and key figures are received. Thereafter, a multiprovider is dynamically generated that is based on the selection and which includes characteristics and key figures from each of the selected data sources. Queries may then be run on top of the multiprovider and reported. Related apparatus, systems, methods, and articles are also described.
Abstract:
Systems and techniques for mapping data structures in a data processing environment to help ensure the accessibility of stored information. In one implementation, an article includes a machine-readable medium storing instructions. The instructions are operable to cause one or more machines to perform operations. The operations include receiving a first data storage schema in which a characteristic in a first dimension table is mapped by a first table and a second table and generating a second data storage schema. The first table maps the characteristic to a first object that include attributes to which time information is irrelevant to data processing activities and the second table maps the characteristic to a second object that include attributes to which time information is relevant to data processing activities. The second data storage schema includes a fact table including at least some facts drawn from the first data storage schema and a second dimension table that includes at least some characteristics drawn from at least one of the first object and the second object.
Abstract:
A method determines whether a structural indicator of a first table of results generated from a first data repository query matches a corresponding structural indicator of a second table of results generated from a substantially identical second data repository query. If the first and second table structural indicators match, then values from at least two corresponding cells from each of the first and second tables are aggregated. If the aggregated values from the first and second tables are identical, then an indication is provided indicating that the first data repository query results are accurate. Related apparatuses, computer program products, and computer systems are also described.