摘要:
A query syntax analysis and postprocessing system and method receives a query request specifying values of a data type directed to a database without native support for the queried data type. A query engine or process for receiving a query request defined by a query syntax traverses the query request for a specification of an unsupported value or expression. In a large, distributed database environment, the query request may implicate multiple physical data stores, each having specific formats and recognized syntax. Values of unsupported or non-native types, or expressions that evaluate to an unsupported or non-native type, are identified and replaced with an expression that is recognized by the legacy database.
摘要:
A query syntax analysis and postprocessing system and method receives a query request specifying values of a data type directed to a database without native support for the queried data type. A query engine or process for receiving a query request defined by a query syntax traverses the query request for a specification of an unsupported value or expression. In a large, distributed database environment, the query request may implicate multiple physical data stores, each having specific formats and recognized syntax. Values of unsupported or non-native types, or expressions that evaluate to an unsupported or non-native type, are identified and replaced with an expression that is recognized by the legacy database.
摘要:
A multidimensional database query engine processes a query request by forming a logical plan of subqueries for retrieving and assembling the data called for by the query request. A multi-pass analysis identifies a granularity of facts needed to fulfill the query request. A recursive analysis parses the query request and identifies components comprising the full query request. The analysis derives a subquery from each component, and identifies dependencies on other subqueries. The subqueries are arranged in a tree structure based on the dependencies. The tree represents subqueries as nodes, with query operations denoted by parent nodes for the dependent subqueries. The result is a hierarchical tree of subqueries associated based on operations between the subqueries and dependent subqueries descending from their parent subqueries.
摘要:
The present disclosure provides a detailed description of techniques used in systems, methods, and in computer program products for dynamic aggregate generation and updating for high performance querying of large datasets. Certain embodiments are directed to technological solutions for determining at least one aggregate of selected virtual cube attributes (e.g., measures, dimensions, etc.) describing a virtual multidimensional data model of a subject database, and generating an aggregate table and a set of aggregate metadata for the aggregate. In some embodiments, an aggregate database statement configured to operate on the subject database can be issued to generate the aggregate table and/or aggregate metadata. Further, the aggregate can be dynamically determined responsive to receiving a database statement configured to operate on the virtual multidimensional data model representing the subject database. Also, the aggregate table can comprise one or more partitions in an aggregate view to facilitate aggregate management and/or quality.
摘要:
The present disclosure provides a detailed description of techniques used in systems, methods, and in computer program products for dynamic aggregate generation and updating for high performance querying of large datasets. Certain embodiments are directed to technological solutions for determining at least one aggregate of selected virtual cube attributes (e.g., measures, dimensions, etc.) describing a virtual multidimensional data model of a subject database, and generating an aggregate table and a set of aggregate metadata for the aggregate. In some embodiments, an aggregate database statement configured to operate on the subject database can be issued to generate the aggregate table and/or aggregate metadata. Further, the aggregate can be dynamically determined responsive to receiving a database statement configured to operate on the virtual multidimensional data model representing the subject database. Also, the aggregate table can comprise one or more partitions in an aggregate view to facilitate aggregate management and/or quality.
摘要:
The present disclosure provides a detailed description of techniques used in systems, methods, and in computer program products for dynamic aggregate generation and updating for high performance querying of large datasets. Certain embodiments are directed to technological solutions for determining at least one aggregate of selected virtual cube attributes (e.g., measures, dimensions, etc.) describing a virtual multidimensional data model of a subject database, and generating an aggregate table and a set of aggregate metadata for the aggregate. In some embodiments, an aggregate database statement configured to operate on the subject database can be issued to generate the aggregate table and/or aggregate metadata. Further, the aggregate can be dynamically determined responsive to receiving a database statement configured to operate on the virtual multidimensional data model representing the subject database. Also, the aggregate table can comprise one or more partitions in an aggregate view to facilitate aggregate management and/or quality.
摘要:
The present disclosure provides a detailed description of techniques used in systems, methods, and in computer program products for dynamic aggregate generation and updating for high performance querying of large datasets. Certain embodiments are directed to technological solutions for determining at least one aggregate of selected virtual cube attributes (e.g., measures, dimensions, etc.) describing a virtual multidimensional data model of a subject database, and generating an aggregate table and a set of aggregate metadata for the aggregate. In some embodiments, an aggregate database statement configured to operate on the subject database can be issued to generate the aggregate table and/or aggregate metadata. Further, the aggregate can be dynamically determined responsive to receiving a database statement configured to operate on the virtual multidimensional data model representing the subject database. Also, the aggregate table can comprise one or more partitions in an aggregate view to facilitate aggregate management and/or quality.
摘要:
Techniques are presented for rule-based selection of alternate data sources for multidimensional data statements. A virtual multidimensional data model is implemented to represent datasets that are accessed at various data sources. Derivative cubes generated from the virtual multidimensional data model are structured to have a respective data source metadata layer that is populated at data statement execution time to identify a target data source. Data source selection rules are established to map the attributes of data statements to target data sources. The data source selection rules are evaluated subject to data statement attributes derived from detected data statements to dynamically select target data sources for the data statements. The derivative cubes codify, in their respective data source metadata layers, a unique set of data source attributes identifying the target data sources. The derivative cubes are accessed to facilitate execution of the data statements on datasets at the target data sources.
摘要:
Techniques are presented for performing enhanced real-time data statement monitoring and control without requiring modifications to data analysis client applications or data storage environments. A data statement processing layer is implemented between the data analysis client applications and the data storage environments. A data statement management agent interacts with an event bus in the data statement processing layer. The data statement management agent accesses event objects on the event bus that are generated during the processing of data statements. Rules are applied to select particular event objects. The selected event objects are transformed into event messages that are delivered to the client applications to monitor the processing of the data statements. Control commands received from the client application are transformed into event objects that are issued to the event bus to control the processing of the data statement.
摘要:
A preferred aggregate store for a multidimensional database receives and stores intermediate data generated as part of a larger query for avoiding repetitive or redundant data retrievals incurring costly data paths. Aggregates include intermediate tables and sets of data, often generated from a join of two or more distinct data tables, combined for generating a query result. In the event an aggregate is called for, certain storage locations may be preferable to others due to network distance, fees for storage and processing, as in the case of cloud storage, or write accessibility. Designation of a preferred aggregate store ensures that aggregates called for by the query processing are stored in a beneficial location for cost and efficiency.