Equi-joins between split tables
    1.
    发明授权
    Equi-joins between split tables 有权
    拆分表之间的连接

    公开(公告)号:US09135282B2

    公开(公告)日:2015-09-15

    申请号:US14297223

    申请日:2014-06-05

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: A join operation between split data tables includes providing reduction data from first partitions to each partition among second partitions. The reduction data serves to identify actual values in one of the second partitions that also occur in one of the first partitions. Global IDs are assigned. Translation lists including the global IDs are sent to the first partitions. Each first partition and each second partition create globalized lists which can then be combined to generate respective first and second compiled lists. The join operation can then be conducted on the first and second compiled lists.

    摘要翻译: 分割数据表之间的连接操作包括从第一分区向第二分区中的每个分区提供缩减数据。 还原数据用于识别也在第一分区中的一个中发生的第二分区之一中的实际值。 全局ID被分配。 包括全局ID的翻译列表将发送到第一个分区。 每个第一分区和每个第二分区创建全局化列表,然后可以将其组合以生成相应的第一和第二编译列表。 然后可以在第一和第二编译列表上进行加入操作。

    VALUE-ID-BASED SORTING IN COLUMN-STORE DATABASES

    公开(公告)号:US20180150494A1

    公开(公告)日:2018-05-31

    申请号:US15363274

    申请日:2016-11-29

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: Innovations in performing sort operations for dictionary-compressed values of columns in a column-store database using value identifiers (“IDs”) are described. For example, a database system includes a data store and an execution engine. The data store stores values at positions of a column A dictionary maps distinct values to corresponding value IDs. An inverted index stores, for each of the corresponding value IDs, a list of those of the positions that contain the associated distinct value. The execution engine processes a request to sort values at an input set of the positions and identify an output set of the positions for sorted values. In particular, the execution engine iterates through positions stored in the lists of the inverted index. For a given position, the execution engine checks if the given position is one of the input set and, if so, adds the given position to the output set.

    Optimal offset pushdown for multipart sorting

    公开(公告)号:US10789249B2

    公开(公告)日:2020-09-29

    申请号:US15603282

    申请日:2017-05-23

    申请人: SAP SE

    IPC分类号: G06F16/2453 G06F16/2455

    摘要: In some aspects, a method, computer program product, and system are provided for query optimization. In some implementations, there may be provided a method including detecting a select query as a candidate for optimization, the select query including an order by, an offset value, and a limit value; determining, in response to the detecting, a modified offset value as the difference between the offset value and a size of a smaller dataset; and performing the select query to form a result set, the select query executed based on at least the modified offset value. Related systems, methods, and articles of manufacture are also described.

    Value-ID-based sorting in column-store databases

    公开(公告)号:US10762071B2

    公开(公告)日:2020-09-01

    申请号:US15363274

    申请日:2016-11-29

    申请人: SAP SE

    IPC分类号: G06F7/00 G06F16/22

    摘要: Innovations in performing sort operations for dictionary-compressed values of columns in a column-store database using value identifiers (“IDs”) are described. For example, a database system includes a data store and an execution engine. The data store stores values at positions of a column A dictionary maps distinct values to corresponding value IDs. An inverted index stores, for each of the corresponding value IDs, a list of those of the positions that contain the associated distinct value. The execution engine processes a request to sort values at an input set of the positions and identify an output set of the positions for sorted values. In particular, the execution engine iterates through positions stored in the lists of the inverted index. For a given position, the execution engine checks if the given position is one of the input set and, if so, adds the given position to the output set.

    Equi-Joins Between Split Tables
    5.
    发明申请

    公开(公告)号:US20180129682A1

    公开(公告)日:2018-05-10

    申请号:US15811527

    申请日:2017-11-13

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: A join operation between split data tables includes providing reduction data from first partitions to each partition among second partitions. The reduction data serves to identify actual values in one of the second partitions that also occur in one of the first partitions. Global IDs are assigned. Translation lists including the global IDs are sent to the first partitions. Each first partition and each second partition create globalized lists which can then be combined to generate respective first and second compiled lists. The join operation can then be conducted on the first and second compiled lists.

    Equi-Joins Between Split Tables
    6.
    发明申请
    Equi-Joins Between Split Tables 审中-公开
    分离表之间的连接

    公开(公告)号:US20150347514A1

    公开(公告)日:2015-12-03

    申请号:US14823943

    申请日:2015-08-11

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: A join operation between split data tables includes providing value IDs. For each of the value IDs, a unique global ID may be associated with the value ID when the actual value represented by the value ID occurs among actual values comprising the second attribute of the second partition. For each identified unique global ID, the identified unique global ID may be paired with a document ID of a data record contained in a second partition stored at the second server in which the actual value in the data record is represented by the value ID associated with the identified unique global ID.

    摘要翻译: 分割数据表之间的连接操作包括提供值ID。 对于每个值ID,当由值ID表示的实际值出现在包括第二分区的第二属性的实际值中时,唯一的全局ID可以与值ID相关联。 对于每个识别的唯一全局ID,所识别的唯一全局ID可以与包含在第二服务器中存储的第二分区中的数据记录的文档ID配对,其中数据记录中的实际值由与 识别的唯一全局ID。

    Equi-joins between split tables
    7.
    发明授权

    公开(公告)号:US10810170B2

    公开(公告)日:2020-10-20

    申请号:US15811527

    申请日:2017-11-13

    申请人: SAP SE

    摘要: A join operation between split data tables includes providing reduction data from first partitions to each partition among second partitions. The reduction data serves to identify actual values in one of the second partitions that also occur in one of the first partitions. Global IDs are assigned. Translation lists including the global IDs are sent to the first partitions. Each first partition and each second partition create globalized lists which can then be combined to generate respective first and second compiled lists. The join operation can then be conducted on the first and second compiled lists.

    Synchronization mechanism for serialized data log replay in database systems

    公开(公告)号:US10762107B2

    公开(公告)日:2020-09-01

    申请号:US15409234

    申请日:2017-01-18

    申请人: SAP SE

    IPC分类号: G06F16/27

    摘要: A computer implemented method for updating a secondary database includes receiving transaction log information from a primary database system at a secondary database system. The transaction log information comprises a first marker identifying a beginning of an operation comprising a plurality of sub-operations and a second marker identifying an end of the operation. The transaction log information is sequentially parsed, a first marker is identified, and a lock state is created after replaying the first marker to ensure proper serialization of the plurality of sub-operations. The lock state is terminated after replaying the second marker. Related apparatus, systems, techniques and articles are also described.

    OPTIMAL OFFSET PUSHDOWN FOR MULTIPART SORTING

    公开(公告)号:US20180341676A1

    公开(公告)日:2018-11-29

    申请号:US15603282

    申请日:2017-05-23

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: In some aspects, a method, computer program product, and system are provided for query optimization. In some implementations, there may be provided a method including detecting a select query as a candidate for optimization, the select query including an order by, an offset value, and a limit value; determining, in response to the detecting, a modified offset value as the difference between the offset value and a size of a smaller dataset; and performing the select query to form a result set, the select query executed based on at least the modified offset value. Related systems, methods, and articles of manufacture are also described.