PIPELINE RANKING WITH MODEL-BASED DYNAMIC DATA ALLOCATION

    公开(公告)号:US20220343207A1

    公开(公告)日:2022-10-27

    申请号:US17237379

    申请日:2021-04-22

    Abstract: In a method for ranking machine learning (ML) pipelines for a dataset, a processor receives first performance curves predicted by a meta learner model for a plurality of ML pipelines. A processor allocates a first subset of data points from the dataset to each of the plurality of ML pipelines. A processor receives first performance scores for each of the ML pipelines for the first subset of data points. A processor updates the meta learner model using the first performance scores. A processor receives second performance curves from the meta learner model updated with the first performance scores. A processor ranks the plurality of ML pipelines based on the second performance curves.

    OUTLIER DETECTION WITH TRANSFER LEARNING

    公开(公告)号:US20240428124A1

    公开(公告)日:2024-12-26

    申请号:US18338671

    申请日:2023-06-21

    Abstract: Embodiments of the invention are directed to a computer system including a memory communicatively coupled to a processor system. The processor system is operable to perform processor system operations that include using a first machine learning (ML) algorithm to convert to-be-classified-data (TBC-data) from a TBC-data format to a second data format; and extract features from the TBC-data in the second data format. A second ML algorithm is used to perform a task that includes determining, based at least in part on the features of the TBC-data in the second data format, that the TBC-data having the second data format is an outlier.

    Testing and modifying calendar and event sensitive timer series data analytics

    公开(公告)号:US11099979B2

    公开(公告)日:2021-08-24

    申请号:US16669761

    申请日:2019-10-31

    Abstract: A mechanism is provided to identify wall-clock time reference dependency in one or more software components of a data analytics solution. The data analytics solution is decomposed into a set of software components. A first software component of the set of software components is deployed to a first computer server and the remaining software components are deployed to a second computer server. A system clock time on the first computer server is changed to differ from the system clock of the second computer server. Based on executing a test on the data analytics solution, a determination is made of whether the first software component, is wall-clock time independent. Responsive to the test of the of the software component failing indicating that the wall-clock time of the software component is dependent of the system clock time difference, the software component is recorded as wall-clock time dependent and an administrator is notified.

Patent Agency Ranking