DISTRIBUTED IN-DATABASE VECTORIZED OPERATIONS USING USER DEFINED TABLE FUNCTIONS

    公开(公告)号:US20250139076A1

    公开(公告)日:2025-05-01

    申请号:US18496734

    申请日:2023-10-27

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives first data. The subject technology transforms first data to a wide format, the wide format comprising a second table. The subject technology splits a set of rows into a set of shards of rows. The subject technology, for each shard from the set of shards, sends a particular user defined table function (UDTF), including a particular shard of rows, to a different execution node to perform a set of operations for determining a rolling correlation over a window size. The subject technology provides a set of output values of each particular UDTF corresponding to each shard from the set of shards in a second UDTF. The subject technology sends the second UDTF to a particular execution node to perform an aggregate operation. The subject technology receives a value of the rolling correlation from the particular execution node based on the aggregate operation.

    DISTRIBUTED IN-DATABASE VECTORIZED OPERATIONS USING USER DEFINED TABLE FUNCTIONS

    公开(公告)号:US20240256515A1

    公开(公告)日:2024-08-01

    申请号:US18461959

    申请日:2023-09-06

    Applicant: Snowflake Inc.

    CPC classification number: G06F16/2291

    Abstract: The subject technology receives a set of values for a data set corresponding to a request for determining a Pearson correlation. The subject technology determines a number of rows from the data set. The subject technology determines a number of execution nodes to execute the request. The subject technology determines a set of shards of rows from the data set. The subject technology, for each shard from the set of shards, sends a particular user defined table function (UDTF), including a particular shard of rows, to a different execution node to perform a set of operations. The subject technology provides a set of output values of each particular UDTF corresponding to each shard from the set of shards in a second UDTF. The subject technology sends the second UDTF to a particular execution node to perform an aggregate operation using the set of output values of each particular UDTF.

    Distributed in-database vectorized operations using user defined table functions

    公开(公告)号:US12197413B2

    公开(公告)日:2025-01-14

    申请号:US18461959

    申请日:2023-09-06

    Applicant: Snowflake Inc.

    Abstract: The subject technology receives a set of values for a data set corresponding to a request for determining a Pearson correlation. The subject technology determines a number of rows from the data set. The subject technology determines a number of execution nodes to execute the request. The subject technology determines a set of shards of rows from the data set. The subject technology, for each shard from the set of shards, sends a particular user defined table function (UDTF), including a particular shard of rows, to a different execution node to perform a set of operations. The subject technology provides a set of output values of each particular UDTF corresponding to each shard from the set of shards in a second UDTF. The subject technology sends the second UDTF to a particular execution node to perform an aggregate operation using the set of output values of each particular UDTF.

Patent Agency Ranking