PIPELINE RANKING WITH MODEL-BASED DYNAMIC DATA ALLOCATION

    公开(公告)号:US20220343207A1

    公开(公告)日:2022-10-27

    申请号:US17237379

    申请日:2021-04-22

    摘要: In a method for ranking machine learning (ML) pipelines for a dataset, a processor receives first performance curves predicted by a meta learner model for a plurality of ML pipelines. A processor allocates a first subset of data points from the dataset to each of the plurality of ML pipelines. A processor receives first performance scores for each of the ML pipelines for the first subset of data points. A processor updates the meta learner model using the first performance scores. A processor receives second performance curves from the meta learner model updated with the first performance scores. A processor ranks the plurality of ML pipelines based on the second performance curves.

    Testing and modifying calendar and event sensitive timer series data analytics

    公开(公告)号:US11099979B2

    公开(公告)日:2021-08-24

    申请号:US16669761

    申请日:2019-10-31

    IPC分类号: G06F11/36 G06F8/60

    摘要: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.

    MULTI-AGENT REINFORCEMENT LEARNING PIPELINE ENSEMBLE

    公开(公告)号:US20230237385A1

    公开(公告)日:2023-07-27

    申请号:US17583522

    申请日:2022-01-25

    IPC分类号: G06N20/20

    CPC分类号: G06N20/20

    摘要: A computer-implemented method for configuring a plurality of machine learning pipelines into a machine learning pipeline ensemble is disclosed. The computer-implemented method includes determining, by a reinforcement learning agent coupled to a machine learning pipeline, performance information of the machine learning pipeline. The computer-implemented method further includes receiving, by the reinforcement learning agent, configuration parameter values of uncoupled machine learning pipelines of the plurality of machine learning pipelines. The computer-implemented method further includes adjusting, by the reinforcement learning agent, configuration parameter values of the machine learning pipeline based on the performance information of the machine learning pipeline and the configuration parameter values of the uncoupled machine learning pipelines.