-
公开(公告)号:US20180011906A1
公开(公告)日:2018-01-11
申请号:US15711026
申请日:2017-09-21
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Felix O. Beier , Thomas F. Boehme , Andreas Brodt , Oliver Schiller
IPC: G06F17/30
CPC classification number: G06F17/30501 , G06F17/30404
Abstract: The method may include providing a plurality of synopsis techniques for determining a plurality of attribute value information indicative of the at least one attribute. The method may include determining a data characteristic describing the plurality of data rows of the current data block. The method may include selecting, based on the determined data characteristic, at least one synopsis technique of the provided plurality of synopsis techniques suitable for generating the plurality of attribute value information for the at least one attribute of the current data block. The method may include determining the plurality of attribute value information for the at least one attribute of the plurality of data rows of the current data block using the at least one selected synopsis technique. The method may include storing the determined plurality of attribute value information for the current data block to be used for query processing against the data table.
-
公开(公告)号:US20170337245A1
公开(公告)日:2017-11-23
申请号:US15161396
申请日:2016-05-23
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Felix O. Beier , Thomas F. Boehme , Andreas Brodt , Oliver Schiller
IPC: G06F17/30
CPC classification number: G06F17/30501 , G06F17/30404
Abstract: The method may include providing a plurality of synopsis techniques for determining a plurality of attribute value information indicative of the at least one attribute. The method may include determining a data characteristic describing the plurality of data rows of the current data block. The method may include selecting, based on the determined data characteristic, at least one synopsis technique of the provided plurality of synopsis techniques suitable for generating the plurality of attribute value information for the at least one attribute of the current data block. The method may include determining the plurality of attribute value information for the at least one attribute of the plurality of data rows of the current data block using the at least one selected synopsis technique. The method may include storing the determined plurality of attribute value information for the current data block to be used for query processing against the data table.
-
公开(公告)号:US20170270160A1
公开(公告)日:2017-09-21
申请号:US15073890
申请日:2016-03-18
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Andreas Brodt , Oliver Schiller , Marc Schwind , Mathias Trumpp
IPC: G06F17/30
CPC classification number: G06F17/30463 , G06F17/30339 , G06F17/30469 , G06F17/30536
Abstract: The present disclosure provides a computer implemented method and system for processing queries. The first data table comprises a set of data blocks. Each of the set of data blocks may be assigned respective attribute value information. A query involving a query condition on at least a first attribute of the first data table may be received. And a subset of the set of data blocks to be accessed may be selected based on the query condition and using the attribute value information. Furthermore, a guaranteed bound may be determined for a statistical metric on the first attribute based on at least one of the number of data blocks of the subset of data blocks and the attribute value information of the subset of data blocks. The guaranteed bound for the statistical metric may be used when determining a query execution plan for the received query.
-
公开(公告)号:US20170262500A1
公开(公告)日:2017-09-14
申请号:US15607807
申请日:2017-05-30
Applicant: International Business Machines Corporation
Inventor: Florian Braun , Andreas Brodt , Daniel Martin , Oliver Schiller
CPC classification number: G06F16/24534 , G06F16/219 , G06F16/24532 , G06F16/24553 , G06F16/24569 , G06F21/76
Abstract: A method for processing a database query on a database system includes receiving the database query at the database system, determining a sequence of database operations for processing the database query, estimating a number of incoming data elements of the database system to be processed for each of the database operations, estimating a workload of a programmable integrated circuit of the database system for each of the database operations based on the estimated number of incoming data elements, assuming each of the database operations is performed in a single instantiation, selecting a number of instantiations Ni for a database operation i of the sequence of database operations based on the estimated workload of the programmable integrated circuit for the database operation i, and configuring the programmable integrated circuit to execute the determined sequence of database operations, the database operation i is instantiated Ni times.
-
公开(公告)号:US20170139617A1
公开(公告)日:2017-05-18
申请号:US15231858
申请日:2016-08-09
Applicant: International Business Machines Corporation
Inventor: Thomas F. Boehme , Andreas Brodt , Namik Hrle , Oliver Schiller
IPC: G06F3/06
CPC classification number: G06F3/0619 , G06F3/065 , G06F3/067 , G06F7/36 , G06F11/00 , G06F11/1435 , G06F11/1453 , G06F17/30117 , G06F17/30312 , G06F17/30333 , G06F17/30424 , G06F17/30575 , G06F17/30592 , G06F2201/84
Abstract: Sorting and storing a dataset, the dataset comprising at least one attribute. The method includes defining a set of data blocks and assigning to each data block a predefined maximum number of entries or a predefined maximum amount of storage, dividing the dataset into a sequence of multiple sub-datasets each having one value or a range of values of the attribute, wherein each pair of successive sub-datasets of the sequence are non-overlapping or overlapping at their respective extremum value of the attribute, for each sub-dataset of the multiple sub-datasets: in case the sub-dataset fully or partially fits into a data block of the defined data blocks storing the sub-dataset into at least the data block, the sub-dataset that partially fits into the data block comprising a number of entries that is smaller than a predefined maximum threshold.
-
公开(公告)号:US20220179861A1
公开(公告)日:2022-06-09
申请号:US17114786
申请日:2020-12-08
Applicant: International Business Machines Corporation
Inventor: Thomas Friedhelm Böhme , Oliver Schiller , James Liam Finnie , Ingo Schuster
IPC: G06F16/2453
Abstract: A method, a computer program product, and a system for scheduling a first query execution plan for executing a first query for accessing first data sets of data of a relational database system (RDBS) and a second query execution plan for executing a second query for accessing second data sets of the data. The data is divided into data blocks of data sets of the data, wherein metadata of the RDBS comprises respective information about at least one property of the data sets of a respective data block of the data blocks. The scheduling may be performed on the basis of an examination if there is a common partition between a first partition of data blocks which may fulfill a first condition according to the first query and a second partition of data blocks which may fulfill a second condition according to the second query.
-
公开(公告)号:US20220075774A1
公开(公告)日:2022-03-10
申请号:US17455917
申请日:2021-11-21
Applicant: International Business Machines Corporation
Inventor: Thomas F. Boehme , Andreas Brodt , Daniel Martin , Oliver Schiller
IPC: G06F16/248 , G06F16/27 , G06F16/2453 , G06F16/2455
Abstract: Embodiments of the present invention provide a method and system for processing a query on a set of data blocks in analytical databases. The query is on a set of data blocks, having at least one attribute and specifies at least one selection condition on the attribute. The selection condition is associated with at least one selection expression. Attribute value information on each attribute is generated for each data block. Next, a condition is generated on each attribute to negate the selection expression, if the selection expression has a negation operator. Additional conditions are generated for each selection expression that does not contain a negation operation. The attribute value is used to select the positive and negative subsets of data blocks for each condition. Next, a negative subset that does not require processing to evaluate the query is skipped and the positive subsets and the non-skipped negative subsets are processed.
-
公开(公告)号:US11176123B2
公开(公告)日:2021-11-16
申请号:US16708936
申请日:2019-12-10
Applicant: International Business Machines Corporation
Inventor: Daniel Martin , Andreas Brodt , Oliver Schiller , Felix Beier , Knut Stolze
Abstract: The present disclosure relates to a method for enforcing constraints on data in a data processing system. The method comprises providing a set of constraints on the data. A first data update request may be received at the transactional engine and executes on the first dataset. A second data update request associated with the received data update request is determined and sent by the transactional engine to the analytical engine. The analytical engine executes the second data, resulting in a set of changes in the second dataset. The transactional engine commits the update of the first dataset before or after receiving the results of the checking of the set of constraints. The update on the first dataset is aborted by the transactional engine in response to receiving the results of the checking of the set of constraints, wherein the results indicate that the set of constraints are not met.
-
公开(公告)号:US10929400B2
公开(公告)日:2021-02-23
申请号:US15335082
申请日:2016-10-26
Applicant: International Business Machines Corporation
Inventor: Felix O. Beier , Andreas Brodt , Namik Hrle , Oliver Schiller
IPC: G06F16/00 , G06F16/2455 , G06F16/23
Abstract: A method, a computer program product and a computer system are provided. Attribute value information contains at least a minimum value representing a smallest value of a first attribute and a maximum value representing a largest value of the first attribute, thereby defining a first range of values of the first attribute. A received query against a data table requests one or more values of at least the first attribute that are covered by the first range of values. The attribute value information may be used for selecting a data block of the data table as a candidate potentially including at least part of the requested one or more values and scanning the data block. In response to determining that the data block does not include the one or more requested values, the attribute value information may be updated accordingly.
-
50.
公开(公告)号:US10671592B2
公开(公告)日:2020-06-02
申请号:US15880826
申请日:2018-01-26
Applicant: International Business Machines Corporation
Inventor: Thomas F. Boehme , Andreas Brodt , Alexander Lang , Oliver Schiller
IPC: G06F16/23
Abstract: A mechanism is provided for updating attribute value information for a data table. The data table involves a set of attributes. The data table has multiple records associated with multiple data blocks. Attribute value information is associated with each data block of the multiple data blocks. The attribute value information is descriptive of at least one attribute of the set of attributes. A first data block may be marked as modified in response to a delete process for deleting from a first data block of the multiple data blocks a record having at least one attribute value that is used for generating the attribute value information of the first data block. In response to scanning the marked first data block, the attribute value information associated with the first data block may be updated based on the scan results.
-
-
-
-
-
-
-
-
-