NOISY AGGREGATES IN A QUERY PROCESSING SYSTEM

    公开(公告)号:US20240362355A1

    公开(公告)日:2024-10-31

    申请号:US18647728

    申请日:2024-04-26

    Applicant: Snowflake Inc.

    CPC classification number: G06F21/6227 G06F16/24556 G06F16/24565

    Abstract: A noisy aggregation constraint system receives a query for a shared dataset, where the query identifies an operation. The noisy aggregation constraint system accesses a set of data from the shared dataset to perform the operation, the set of data comprises data accessed from a table of the shared dataset. The system determines that an aggregation constraint policy is attached to the table, the policy restricts output of data values stored in the table. Based on the context of the query, the system determines that the aggregation constraint policy should be enforced in relation to the query. The system assigns a specified noise level to the shared dataset and generates an output based on the set of data and the operation; the output comprises data values added to the table based on the specified noise level.

    COLUMN CLASSIFICATION MODEL
    2.
    发明公开

    公开(公告)号:US20240330412A1

    公开(公告)日:2024-10-03

    申请号:US18192243

    申请日:2023-03-29

    Applicant: Snowflake Inc.

    CPC classification number: G06F18/241 G06F16/221

    Abstract: Systems and methods for classifying columns using a model are provided. The systems and methods access a table associated with a column of features and retrieve a list of categories each associated with a different scoring model. The systems and methods, for each category in the list of categories, apply a respective scoring model to the features of the column to generate a respective set of confidence values indicating a likelihood that the column belongs to a respective one of the categories. The systems and methods process the respective sets of confidence values to select a target category from the list of categories and associate the selected target category with the column.

Patent Agency Ranking