Executing database queries for joining tables using channel based flow control

    公开(公告)号:US12067058B2

    公开(公告)日:2024-08-20

    申请号:US17886383

    申请日:2022-08-11

    Inventor: Adam Szymański

    Abstract: A database system generates an execution plan including multiple operators for processing a database query, for example, a join query or a group by query. The database system allocates a set of threads. Threads communicate with other threads via blocking channels. A blocking channel includes a buffer of a fixed capacity. The database system processes the database query by streaming data through operators of the execution plan. A thread sends data generated by an operator to another thread via the blocking channel if the buffer of the blocking channel has available capacity to store the data, or else the thread blocks until the buffer has capacity to store the data. Similarly, a thread receives data generated by an operator of another thread via the blocking channel if the buffer of the blocking channel has available data, or else the thread blocks until the buffer has data.

    Computerized system and method for performing parameterization of columns in a virtual semantic layer

    公开(公告)号:US12056113B2

    公开(公告)日:2024-08-06

    申请号:US17488363

    申请日:2021-09-29

    CPC classification number: G06F16/2365 G06F16/221 G06F16/24537

    Abstract: The disclosed systems and methods provide a novel framework that parameterizes columns in a templated virtual semantic layer. The disclosed framework enables inter- and cross-column relationships between stored data within a SQL database to be determined and stored, and then leveraged at query time to enable an optimized search of the data within the database. The disclosed framework enables a database to be construed, modified and/or configured according to a hierarchy, and/or with types of metadata, that corresponds to the determined column relationships. This not only reduces the amount of data stored and reduces the time to process queries, but also enables a more streamlined approach for searches to be constructed and executed, which can increase the performance of the database and the operations of systems that are backed by the database.

    UNIFIED DATA ACCESS AND QUERYING
    4.
    发明公开

    公开(公告)号:US20240193170A1

    公开(公告)日:2024-06-13

    申请号:US18076768

    申请日:2022-12-07

    Applicant: SAP SE

    CPC classification number: G06F16/248 G06F16/24537 G06F16/24542 G06F16/24553

    Abstract: The present disclosure relates to computer-implemented methods, software, and systems for generating intelligent data reports based on data of different data source type(s). Metadata is obtained from data sources to support querying. A request is received by the compound database and from a query panel to generate a report associated with at least one data source. The request includes one or more data manipulation operations, wherein at least one of the data manipulation operations is not natively supported by a data source of the at least one data source. A query is generated based on the request. The query is split based on a number of data sources associated with the request to generate the report. The query is transformed into at least one sub-queries corresponding to each of the at least one data source and a report is generated based on their execution.

    Build-side skew handling for hash-partitioning hash joins

    公开(公告)号:US12001428B2

    公开(公告)日:2024-06-04

    申请号:US18047872

    申请日:2022-10-19

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/24537 G06F16/2255

    Abstract: Provided herein are systems and methods for handling build-side skew. For example, a method includes computing a plurality of hash values for a join operation. The join operation uses a corresponding plurality of row sets. The plurality of hash values are sampled to detect a frequent hash value. A build-side row set is partitioned using the frequent hash value to generate a partitioned build-side row set. The build-side row set is selected from the plurality of row sets. The partitioned build-side row set is distributed to a plurality of hash-join-build (HJB) instances executing at a corresponding plurality of servers.

    ARBITRARY DIRECTORY DATA PROPERTIES IN SQL-BASED DIRECTORY SYSTEMS

    公开(公告)号:US20240152514A1

    公开(公告)日:2024-05-09

    申请号:US17984184

    申请日:2022-11-09

    CPC classification number: G06F16/24537 G06F16/24542

    Abstract: The techniques disclosed herein enhance the efficiency and functionality of directory systems. This is accomplished by augmenting a primary table with an extended table for storing properties of an associated entity (e.g., a user, a company). A table serves to organize directory data and comprises rows and columns. Each row of the primary table stores an entity with each column storing an associated property. In contrast, the primary table is configured with an extended table in which each row of the extended table stores a property for an associated entity while each column stores additional information for the property such as timestamps and metadata. Storing specific properties in the extended table eliminates empty spaces in the primary table thereby reducing the volume of stored data. Moreover, by including additional data for properties, the extended table enables property-specific features such as search, synchronization, extensibility, and lifecycle management.

    Information processing apparatus, method and storage medium

    公开(公告)号:US11954107B2

    公开(公告)日:2024-04-09

    申请号:US17893470

    申请日:2022-08-23

    CPC classification number: G06F16/2456 G06F16/24537

    Abstract: According to one embodiment, an information processing apparatus includes a processor. The processor is configured to receive, from a client terminal, a first query to request a result of a join operation process on first and second tables, create a second query to a first database management device based on the first query, receive the first table from the first database management device by performing a process on the second query, create a third query to a second database management device based on the first query and the first table, receive a part of a second table from the second database management device by performing a process on the third query, and transmit, to the client terminal, a response to the first query based on the part of the second table.

    COLUMN-BASED UNION PRUNING
    10.
    发明公开

    公开(公告)号:US20240095243A1

    公开(公告)日:2024-03-21

    申请号:US17933212

    申请日:2022-09-19

    Applicant: SAP SE

    CPC classification number: G06F16/24537 G06F11/3419 G06F16/244

    Abstract: In one aspect, a method may include receiving a query associated with a plurality of data sources, wherein the query includes a first attribute; identifying that a query operator, which is associated with execution of the query and the first attribute, includes a first input from a first data source of the plurality of data sources and a second input from a second data source of the plurality of data sources; determining that the first attribute at the second data source corresponds to null; pruning, based on the determined null, the second input from the second data source to inhibit a select from the second data source; and in response to the pruning, performing the query operator by selecting, from the first data source, a column corresponding to the first attribute. Related systems, methods, and articles or manufacture are also disclosed.

Patent Agency Ranking