HANDLING SYSTEM-CHARACTERISTICS DRIFT IN MACHINE LEARNING APPLICATIONS

    公开(公告)号:US20230132117A1

    公开(公告)日:2023-04-27

    申请号:US18087518

    申请日:2022-12-22

    Applicant: SNOWFLAKE INC.

    Abstract: Systems and methods for managing input and output error of a machine learning (ML) model in a database system are presented herein. A set of test queries is executed on a first version of a database system to generate first test data, wherein the first version of the system comprises a ML model to generate an output corresponding to a function of the database system. An error model is trained based on the first test data and second test data generated based on a previous version of the system. The error model determines an error associated with the ML model between the first and previous versions of the system. The first version of the system is deployed with the error model, which corrects an output or an input of the ML model until sufficient data has been produced by the error model to retrain the ML model.

    Data-driven query-execution scheduling

    公开(公告)号:US11620289B1

    公开(公告)日:2023-04-04

    申请号:US17930277

    申请日:2022-09-07

    Applicant: Snowflake Inc.

    Abstract: Embodiments of the present disclosure may provide a database optimization system that can generate computational values through a database compiler and assignment data for execution of a query by a plurality of nodes of a database system. The computational values and assignment data can be generated by one or more machine learning schemes. The machine learning schemes can be trained on previous computational values and previous assignment data.

    ANALYZING QUERY COMMENTS FOR IDENTIFYING POTENTIAL SOFTWARE BUGS

    公开(公告)号:US20220245135A1

    公开(公告)日:2022-08-04

    申请号:US17654147

    申请日:2022-03-09

    Applicant: Snowflake Inc.

    Inventor: Orestis Kostakis

    Abstract: Disclosed herein are embodiments of systems and methods for analyzing query comments for identifying potential software bugs. In an example, a data platform obtains query comments associated with a query. Based on determining that the query comments include a reference to a software bug of the data platform, the data platform generates a software-bug alert based on the query comments, and transmits the software-bug alert to an endpoint.

    HANDLING SYSTEM-CHARACTERISTICS DRIFT IN MACHINE LEARNING APPLICATIONS

    公开(公告)号:US20220230093A1

    公开(公告)日:2022-07-21

    申请号:US17154928

    申请日:2021-01-21

    Applicant: SNOWFLAKE INC.

    Abstract: Systems and methods for managing input and output error of a machine learning (ML) model in a database system are presented herein. A set of test queries is executed on a first version of a database system to generate first test data, wherein the first version of the system comprises a ML model to generate an output corresponding to a function of the database system. An error model is trained based on the first test data and second test data generated based on a previous version of the system. The error model determines an error associated with the ML model between the first and previous versions of the system. The first version of the system is deployed with the error model, which corrects an output or an input of the ML model until sufficient data has been produced by the error model to retrain the ML model.

    Adaptive freepool size prediction
    45.
    发明授权

    公开(公告)号:US11138038B1

    公开(公告)日:2021-10-05

    申请号:US17173717

    申请日:2021-02-11

    Applicant: Snowflake Inc.

    Abstract: The subject technology determines usage history metadata. The subject technology predicts a size value indicating an amount of computing resources to request for executing a set of queries based at least in part on the usage history metadata. The subject technology determines, during a prefetch window of time within a first period of time, a current size of a freepool of computing resources. The subject technology, in response to the current size of the freepool of computing resources being smaller than the predicted size value, sends a request for additional computing resources to include in the freepool of computing resources. The subject technology receives an indication that the request for additional computing resources was granted. The subject technology performs an operation to include the additional computing resources in the freepool of computing resources.

Patent Agency Ranking