SCALABLE QUERY PROCESSING
    161.
    发明申请

    公开(公告)号:US20220414097A1

    公开(公告)日:2022-12-29

    申请号:US17823572

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide a dynamic query execution model. This query execution model may provide acceleration by scaling out parallel parts of a query (also referred to as a fragment) to additional computing resources, for example computing resources leased from a pool of computing resources. Execution of the parts of the query may be coordinated by a parent query coordinator, where the query originated, and a fragment query coordinator.

    INLINE COMPILATION OF USER DEFINED FUNCTIONS

    公开(公告)号:US20220413816A1

    公开(公告)日:2022-12-29

    申请号:US17823621

    申请日:2022-08-31

    Applicant: Snowflake Inc.

    Abstract: Embodiments described herein provide techniques for in-line compiling of UDFs in other programming languages. These techniques enable faster programming iterations because it allows users to compile directly in the cloud processing system. Moreover, it allows the UDFs to tie into existing libraries. The compiled results are treated as sensitive and handled with appropriate security policies, as with any other user data in the system.

    Implementing multidimensional two-sided interval joins using sampling-based input-domain demarcation

    公开(公告)号:US11537614B2

    公开(公告)日:2022-12-27

    申请号:US17454899

    申请日:2021-11-15

    Applicant: Snowflake Inc.

    Abstract: In an embodiment, a data platform receives a query that includes a two-sided N dimensional interval join of first and second input relations. The data platform samples, with respect to each of one or more of the N dimensions, one or both of the first input relation and the second input relation with respect to an interval size of an interval determined from the input relation. The data platform demarcates the N dimensional input domain into non-overlapping N dimensional input-domain cells based on the sampling. The data platform implements the interval join using a query-execution plan that includes an equality join that is keyed on input-domain-cell identifiers of input-domain cells that at least partially overlap bounding polygons of the intervals determined from the first and second input relations. The equality join is followed in the query-execution plan by a filter that applies the interval-join predicate. The data platform obtains results of the query by executing the query-execution plan.

    Merge small file consolidation
    165.
    发明授权

    公开(公告)号:US11537613B1

    公开(公告)日:2022-12-27

    申请号:US17514084

    申请日:2021-10-29

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a query plan corresponding to a query. The subject technology executes the query based at least in part on the query plan, the executing including: filtering a first set of files that are to be modified by a merge statement, performing a split operation to send information related to a second set of files to a scan set builder operation in a first portion of the query plan and scan back operation in a second portion of the query plan, performing the scan set builder operation to remove the second set of files from the first set of files, performing a table scan operation based on a third set of files, and performing a first union all operation to combine the first set of data with a second set of data as a first set of combined data.

    SYSTEM AND METHOD FOR DISJUNCTIVE JOINS

    公开(公告)号:US20220391390A1

    公开(公告)日:2022-12-08

    申请号:US17879615

    申请日:2022-08-02

    Applicant: SNOWFLAKE INC.

    Abstract: Joining data using a disjunctive operator is described. An example computer-implemented method can include generating a query plan for a query, wherein there is a join operator expression for each of a plurality of disjunctive predicates and each join operator expression includes at least a conjunctive predicate and a disjunctive operator. The method may also include generating a bloom filter for each of the plurality of disjunctive operators. The method may further include evaluating each of the plurality of join operator expressions using a corresponding one of the plurality of disjunctive operators and bloom filter for each of the plurality of disjunctive predicates to generate a result set.

    QUERY PROCESSING USING A DISTRIBUTED STOP OPERATOR

    公开(公告)号:US20220382782A1

    公开(公告)日:2022-12-01

    申请号:US17815389

    申请日:2022-07-27

    Applicant: Snowflake Inc.

    Abstract: A global and local row count limit associated with a limit query are received by a stop operator of a first execution node among a set of execution nodes that are assigned to process the limit query. Local distributed row count data is generated based on a local row count corresponding to a number of rows output by the first execution node in processing the query. Based on determining the local row count satisfies the local limit, the first execution node buffers rows produced in processing the query. The local distributed row count data is updated based on remote distributed row count data received from a second execution node. A stopping condition is detected based on determining the global limit is satisfied based on updated local distributed row count data and query processing by the first execution node based on detecting the stopping condition.

    Client side certificate revocation service

    公开(公告)号:US11516023B1

    公开(公告)日:2022-11-29

    申请号:US17520041

    申请日:2021-11-05

    Applicant: Snowflake Inc.

    Abstract: A proxy revocation service provides a reliable service for performing revocation checks. The proxy revocation service queries public certificate authorities for the revocation status of a set of digital certificates and maintains a database of the revocation statuses. The proxy revocation service provides a singular endpoint that is Application Protocol Interface (API) accessible to web clients. Web clients communicate with the proxy revocation service through use of API message to perform revocation checks, rather than communicating with the public certificate authorities using an online certificate status protocol (OCSP). Use of the proxy revocation service provides both a reliable service for performing revocation checks as well as shifts the complexity away from the web clients.

    Resource provisioning in database systems

    公开(公告)号:US11514064B2

    公开(公告)日:2022-11-29

    申请号:US17663248

    申请日:2022-05-13

    Applicant: Snowflake Inc.

    Abstract: Resource provisioning systems and methods are described. In an embodiment, a system includes a plurality of shared storage devices collectively storing database data, an execution platform, and a compute service manager. The compute service manager is configured to determine a task to be executed in response to a trigger event and determine a query plan for executing the task, wherein the query plan comprises a plurality of discrete subtasks. The compute service manager is further configured to assign the plurality of discrete subtasks to one or more nodes of a plurality of nodes of the execution platform, determine whether execution of the task is complete, and in response to determining the execution of the task is complete, store a record in the plurality of shared storage devices indicating the task was completed.

Patent Agency Ranking