Value range synopsis in column-organized analytical databases

    公开(公告)号:US10346403B2

    公开(公告)日:2019-07-09

    申请号:US15147983

    申请日:2016-05-06

    Abstract: The method may include providing, in accordance with a column-oriented storage technique, the data table as columns corresponding to the plurality of attributes, whereby each column includes a plurality of separate data blocks. The method may also include determining the plurality of records of the provided data table for which a plurality of attribute values of at least one selected column is contained in a plurality of predetermined data blocks. The method may further include determining, for each column of at least a part of the plurality of columns within the determined plurality of records, a plurality of attribute value information descriptive of an associated attribute within the column and providing an indication of the one or more data blocks for which the plurality of attribute value information is determined. The method may also include storing the determined plurality of attribute value information for enabling query processing.

    Dynamic column synopsis for analytical databases

    公开(公告)号:US10331662B2

    公开(公告)日:2019-06-25

    申请号:US15157441

    申请日:2016-05-18

    Abstract: A method, computer program product, and system for processing attribute value information for a data set. The method, computer program product, and system includes receiving a first data query on the data set. The first data query includes a condition on at least one attribute of the data set. While processing the first data query, data blocks containing records of the data set may be scanned. The data blocks contain first data blocks that are full. Attribute value information may be generated for the at least one attribute for the first data blocks. The attribute value information may be stored and a second data query involving a condition on at least one of the at least one attribute may be processed using the stored attribute value information.

    LOW LATENCY CONSTRAINT ENFORCEMENT IN HYBRID DBMS

    公开(公告)号:US20190057126A1

    公开(公告)日:2019-02-21

    申请号:US15679193

    申请日:2017-08-17

    Abstract: A method for enforcing constraints on a database table. The method includes: storing the database table on an analytical engine of a data processing system. An index is created on the database table on a transactional engine of the data processing system, wherein the database table stored on the analytical engine is scanned for creating the index. A data manipulation statement associated with a set of constraints on the database table is provided. The transactional engine enforces the set of constraints based on the index. In response to the set of constraints being successfully enforced: executing the data manipulation statement on the database table in the analytical engine, and updating the index in the transactional engine accordingly to the data manipulation statement.

    METHOD FOR STORING A DATASET
    24.
    发明申请

    公开(公告)号:US20180293250A1

    公开(公告)日:2018-10-11

    申请号:US16005839

    申请日:2018-06-12

    Abstract: Sorting and storing a dataset, the dataset comprising at least one attribute. The method includes defining a set of data blocks and assigning to each data block a predefined maximum number of entries or a predefined maximum amount of storage, dividing the dataset into a sequence of multiple sub-datasets each having one value or a range of values of the attribute, wherein each pair of successive sub-datasets of the sequence are non-overlapping or overlapping at their respective extremum value of the attribute, for each sub-dataset of the multiple sub-datasets: in case the sub-dataset fully or partially fits into a data block of the defined data blocks storing the sub-dataset into at least the data block, the sub-dataset that partially fits into the data block comprising a number of entries that is smaller than a predefined maximum threshold.

    CONSISTENT EXECUTION OF PARTIAL QUERIES IN HYBRID DBMS

    公开(公告)号:US20180046643A1

    公开(公告)日:2018-02-15

    申请号:US15234009

    申请日:2016-08-11

    CPC classification number: G06F17/30174 G06F17/30477 G06F17/30522

    Abstract: The disclosure relates to a system comprising a first and a second database engine and a replication module. The replication module replicates changes from a first database maintained by the first engine to a second database maintained by the second engine. The system maps state times and sync times. The state times are specified in a first temporal reference system used by the first engine and the sync times are specified in a second temporal reference system used by the second engine. Each mapped state time and sync time pair indicate an identical state of the first and second database or parts thereof. The system executes a query partially by the first and partially by the second engine and uses the mapping to execute the query by the first and/or second engine on data of a single defined state.

    VALUE RANGE SYNOPSIS IN COLUMN-ORGANIZED ANALYTICAL DATABASES

    公开(公告)号:US20180011923A1

    公开(公告)日:2018-01-11

    申请号:US15711216

    申请日:2017-09-21

    CPC classification number: G06F16/2455 G06F16/221 G06F16/289

    Abstract: The method may include providing, in accordance with a column-oriented storage technique, the data table as columns corresponding to the plurality of attributes, whereby each column includes a plurality of separate data blocks. The method may also include determining the plurality of records of the provided data table for which a plurality of attribute values of at least one selected column is contained in a plurality of predetermined data blocks. The method may further include determining, for each column of at least a part of the plurality of columns within the determined plurality of records, a plurality of attribute value information descriptive of an associated attribute within the column and providing an indication of the one or more data blocks for which the plurality of attribute value information is determined. The method may also include storing the determined plurality of attribute value information for enabling query processing.

    PROCESSING A DATABASE QUERY IN A DATABASE SYSTEM

    公开(公告)号:US20170116265A1

    公开(公告)日:2017-04-27

    申请号:US14919777

    申请日:2015-10-22

    Abstract: A method for processing a database query on a database system includes receiving the database query at the database system, determining a sequence of database operations for processing the database query, estimating a number of incoming data elements of the database system to be processed for each of the database operations, estimating a workload of a programmable integrated circuit of the database system for each of the database operations based on the estimated number of incoming data elements, assuming each of the database operations is performed in a single instantiation, selecting a number of instantiations Ni for a database operation i of the sequence of database operations based on the estimated workload of the programmable integrated circuit for the database operation i, and configuring the programmable integrated circuit to execute the determined sequence of database operations, the database operation i is instantiated Ni times.

    Extreme value computation
    30.
    发明授权
    Extreme value computation 有权
    极值计算

    公开(公告)号:US09501534B1

    公开(公告)日:2016-11-22

    申请号:US15008960

    申请日:2016-01-28

    CPC classification number: G06F17/3053

    Abstract: Determining an extremum value of an attribute of a first data table. The first data table includes a set of data blocks. Assigning each data block of the set of data blocks an attribute value information indicative of a range of values of the attribute. Assigning the attribute value information results in an extremum set of indicative maximum and minimum values of the attribute. Determining a reference value as the infimum or supremum of the extremum set. If the extremum is a minimum, selecting a subset of the set of data blocks, each having a respective minimum value of the attribute smaller than the reference value. If the extremum is a maximum, selecting a subset of the set of data blocks, each having a respective maximum value of the attribute higher than the reference value. Processing at least part of the subset of data blocks for identifying the extremum value.

    Abstract translation: 确定第一个数据表的属性的极值。 第一数据表包括一组数据块。 为数据块集合中的每个数据块分配表示属性值的范围的属性值信息。 分配属性值信息导致属性的指示性最大值和最小值的极值集合。 确定参考值作为极值集的最小值或最大值。 如果极值是最小的,则选择数据块集合的子集,每个数据块的子集具有小于参考值的属性的相应最小值。 如果极值是最大值,则选择该组数据块的子集,每个数据块的属性的相应最大值高于参考值。 处理用于识别极值的数据块子集的至少一部分。

Patent Agency Ranking