Metadata classification
    1.
    发明授权

    公开(公告)号:US11630853B2

    公开(公告)日:2023-04-18

    申请号:US17163156

    申请日:2021-01-29

    Applicant: Snowflake Inc.

    Abstract: Generating semantic names for a data set is described. An example method can include retrieving data from a data set, the data organized in a plurality of columns. The method may also include generating one or more candidate semantic categories for that column, wherein each of the one or more candidate semantic categories has a corresponding probability for each of the columns. The method may also further include creating a feature vector for each column from the one or more column candidate semantic categories and the corresponding probabilities. Additionally, the method may also include selecting, for each column, a column semantic category from the one or more candidate semantic categories using at least the feature vector and a trained machine learning model.

    Metadata classification
    2.
    发明授权

    公开(公告)号:US11853329B2

    公开(公告)日:2023-12-26

    申请号:US18124415

    申请日:2023-03-21

    Applicant: SNOWFLAKE INC.

    CPC classification number: G06F16/285 G06F16/221 G06N5/01

    Abstract: Systems and method are disclosed that retrieve data from a data set organized in a plurality of columns. For each column in the plurality of columns, the systems and method generate one or more candidate semantic categories for the column, where each of the one or more candidate semantic categories has a corresponding probability. The systems and method create a feature vector for the column from the one or more candidate semantic categories and the corresponding probabilities. The systems and method determine a semantic category type of the column based on the feature vector. The systems and method anonymize the data in the column based on the semantic category type, which includes replacing more specific data in the column with less specific data based on a data hierarchy that relates the more specific data to the less specific data.

Patent Agency Ranking