PIPELINE RANKING WITH MODEL-BASED DYNAMIC DATA ALLOCATION

    公开(公告)号:US20220343207A1

    公开(公告)日:2022-10-27

    申请号:US17237379

    申请日:2021-04-22

    摘要: In a method for ranking machine learning (ML) pipelines for a dataset, a processor receives first performance curves predicted by a meta learner model for a plurality of ML pipelines. A processor allocates a first subset of data points from the dataset to each of the plurality of ML pipelines. A processor receives first performance scores for each of the ML pipelines for the first subset of data points. A processor updates the meta learner model using the first performance scores. A processor receives second performance curves from the meta learner model updated with the first performance scores. A processor ranks the plurality of ML pipelines based on the second performance curves.

    CODE GENERATION FOR AUTO-AI
    4.
    发明申请

    公开(公告)号:US20220004914A1

    公开(公告)日:2022-01-06

    申请号:US16919258

    申请日:2020-07-02

    摘要: An embodiment of the invention may include a method, computer program product, and system for creating a data analysis tool. The method may include a computing device that generates an AI pipeline based on an input dataset, wherein the AI pipeline is generated using an Automated Machine Learning program. The method may include converting the AI pipeline to a non-native format of the Automated Machine Learning program. This may enable the AI pipeline to be used outside of the Automated Machine Learning program, thereby increasing the usefulness of the created program by not tying it to the Automated Machine Learning program. Additionally, this may increase the efficiency of running the AI pipeline by eliminating unnecessary computations performed by the Automated Machine Learning program.

    Code generation for Auto-AI
    9.
    发明授权

    公开(公告)号:US11861469B2

    公开(公告)日:2024-01-02

    申请号:US16919258

    申请日:2020-07-02

    IPC分类号: G06N20/00 G06F8/35 G06F8/76

    CPC分类号: G06N20/00 G06F8/35 G06F8/76

    摘要: An embodiment of the invention may include a method, computer program product, and system for creating a data analysis tool. The method may include a computing device that generates an AI pipeline based on an input dataset, wherein the AI pipeline is generated using an Automated Machine Learning program. The method may include converting the AI pipeline to a non-native format of the Automated Machine Learning program. This may enable the AI pipeline to be used outside of the Automated Machine Learning program, thereby increasing the usefulness of the created program by not tying it to the Automated Machine Learning program. Additionally, this may increase the efficiency of running the AI pipeline by eliminating unnecessary computations performed by the Automated Machine Learning program.