Hash-join in parallel computation environments
    12.
    发明授权
    Hash-join in parallel computation environments 有权
    哈希加入并行计算环境

    公开(公告)号:US09177025B2

    公开(公告)日:2015-11-03

    申请号:US13742034

    申请日:2013-01-15

    Applicant: SAP SE

    CPC classification number: G06F17/30466 G06F17/3033 G06F17/30445

    Abstract: According to some embodiments, a system and method for a parallel join of relational data tables may be provided by calculating, by a plurality of concurrently executing execution threads, hash values for join columns of a first input table and a second input table; storing the calculated hash values in a set of disjoint thread-local hash maps for each of the first input table and the second input table; merging the set of thread-local hash maps of the first input table, by a second plurality of execution threads operating concurrently, to produce a set of merged hash maps; comparing each entry of the merged hash maps to each entry of the set of thread-local hash maps for the second input table to determine whether there is a match, according to a join type; and generating an output table including matches as determined by the comparing.

    Abstract translation: 根据一些实施例,可以通过由多个并发执行执行线程计算第一输入表和第二输入表的连接列的散列值来提供用于关系数据表的并行连接的系统和方法; 将所计算的散列值存储在所述第一输入表和所述第二输入表中的每一个的一组不相交的线程局部散列图中; 通过并行操作的第二多个执行线程来合并第一输入表的一组线程局部散列图,以产生一组合并的散列图; 将合并的散列映射的每个条目与第二输入表的线程局部散列映射集合的每个条目进行比较,以根据连接类型确定是否存在匹配; 以及生成包括通过比较确定的匹配的输出表。

    Throughput-optimized schema-flexible storage with transactional properties

    公开(公告)号:US12210512B2

    公开(公告)日:2025-01-28

    申请号:US17752066

    申请日:2022-05-24

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for transferring data using a failsafe storage system. In some embodiments, a database receives a request to insert records of data. The records of data are to be transmitted from the source system to the target system. The database allocates a partition in the database based on a size of the plurality of records of data. The database inserts one or more records of data in each segment. The database inserts a create timestamp in each record header of the plurality of records. The create timestamp indicates a date and time when an insert of the one or more records of data is committed in a respective segment. The database inserts a delete timestamp in each segment of records of data, as each respective record of data is transmitted to the target system.

    Storing and querying knowledge graphs in column stores using a global dictionary

    公开(公告)号:US12019608B2

    公开(公告)日:2024-06-25

    申请号:US17893608

    申请日:2022-08-23

    Applicant: SAP SE

    CPC classification number: G06F16/2282 G06F16/213 G06F16/2255 G06F16/2456

    Abstract: Various embodiments for a triple integration and querying system with dictionary compression are described herein. An embodiment operates by identifying a table of a database with four or more columns with triple formatted data including one subject column, one predicate column, and two or more object columns. It is determined that a master dictionary is to be generated for the both the subject column and the predicate column based on an identical datatype being used for both columns. A subject data dictionary and a predicate data dictionary are generated. A unique value is assigned a same unique identifier a in both the object data dictionary and the subject data dictionary. A master dictionary including both the unique values from the subject data dictionary and the predicate data dictionary is generated. Values in the subject column and the predicate column are replaced based on the unique values from the master dictionary.

    Notification framework for document store

    公开(公告)号:US11609934B2

    公开(公告)日:2023-03-21

    申请号:US15961075

    申请日:2018-04-24

    Applicant: SAP SE

    Abstract: Provided are systems and methods for implementing a database notification and statistics framework. In one example, the method may include receiving a message published by a database, the message including information about a database event performed by the database, determining a publication type of the message from among a plurality of publication types based on attributes included in the message, identifying one or more database components that have subscribed to the identified publication type of the message, and transferring the information about the database event included in the message to the one or more identified database components that have subscribed to the publication type of the message.

    Synopsis based advanced partition elimination

    公开(公告)号:US11468093B2

    公开(公告)日:2022-10-11

    申请号:US15796365

    申请日:2017-10-27

    Applicant: SAP SE

    Abstract: Disclosed herein are system, method, and computer program product embodiments for determining a constraint data statistics object is consistent. An embodiment operates by receiving a query for a database table. The embodiment determines a constraint data statistic object associated with a partition of the database table. The embodiment determines the constraint data statistics object is consistent with a data state of the partition of the database table based on consistency metadata of the constraint data statistics object. The embodiment processes the query for the database table based on the determining the constraint data statistics object is consistent with the data state of the partition of the database table.

    Multi-master with ownership transfer

    公开(公告)号:US11106698B2

    公开(公告)日:2021-08-31

    申请号:US16437852

    申请日:2019-06-11

    Applicant: SAP SE

    Abstract: A method, a system and a computer program product for executing management of ownership of data. An index in a plurality of indexes is selected. The index corresponds to a plurality of ranges of data values stored in a plurality of database slices of a database. The index further corresponds to a partitioning structure that includes a plurality of hierarchically arranged nodes. Each node corresponds to a range of data values stored in at least one database slice. The structure is replicated across a plurality of computing systems. A computing system executes an update to one or more ranges of data values. The system replicates at least one of a database slice including the updated ranges of data values and a node that includes the updated ranges of data values to another computing system for storage of a replicate of the updated ranges of data values.

    Table placement in distributed databases

    公开(公告)号:US10984002B2

    公开(公告)日:2021-04-20

    申请号:US16245520

    申请日:2019-01-11

    Applicant: SAP SE

    Abstract: A node type of a plurality of distributed nodes to which a table to be added to a distributed database should be assigned can be identified by applying a set of placement rules defined for the table. The set of placement rules can also be applied to determine whether the table should be partitioned into more than one partition. A table group name associated with the table can be obtained and used in conjunction with the node type and determination of whether to partition the table to store the table in the distributed database on at least one node of the plurality of nodes as one or more partitions.

    Encoding and decoding files for a document store

    公开(公告)号:US10769214B2

    公开(公告)日:2020-09-08

    申请号:US15344187

    申请日:2016-11-04

    Applicant: SAP SE

    Abstract: A system for encoding and/or decoding documents for a document store is provided. In some implementations, the system performs operations comprising receiving a query related to at least a portion of a plurality of stored documents and retrieving one or more documents from the plurality of stored documents based on the query. The operations can further include generating a client-side dictionary including a plurality of definitions corresponding to a plurality of parameters (e.g., reference values for corresponding parameters) from the plurality of stored documents. The operations can also include providing the client-side dictionary and the one or more documents in response to the query. One or more of the operations can be implemented by a database server. Related systems, methods, and articles of manufacture are also described.

Patent Agency Ranking