Compatible and secure software upgrades

    公开(公告)号:US11836483B1

    公开(公告)日:2023-12-05

    申请号:US17804322

    申请日:2022-05-27

    CPC分类号: G06F8/71 G06N20/00

    摘要: Described are techniques for machine learning library management. The techniques include generating a table including a plurality of machine learning libraries and their current versions that are used in a deployed machine learning platform (MLP) instance, a first available version upgrade for a first machine learning library of the plurality of machine learning libraries, a security indication associated with the first available version upgrade relative to a current version implemented by the first machine learning library, and a compatibility indication between the first available version upgrade and the current version of the first machine learning library. The techniques further include generating a recommendation related to upgrading the first machine learning library based on the security indication and the compatibility indication.

    SYSTEM AND METHOD FOR EFFICIENT TRANSFORMATION PREDICTION IN A DATA ANALYTICS PREDICTION MODEL PIPELINE

    公开(公告)号:US20230359941A1

    公开(公告)日:2023-11-09

    申请号:US17739716

    申请日:2022-05-09

    IPC分类号: G06N20/20 G06Q20/40

    CPC分类号: G06N20/20 G06Q20/4016

    摘要: A computer-implemented system, platform, programing product, and/or method for improving transformation selection in an ensemble machine learning (ML) model that includes: providing all base ML models of the ensemble ML model; identifying all of a plurality of Derived Fields in all the base ML models; performing a Derived Field run prediction analysis for all the Derived Fields; computing the Derived Field Importance Weight for Field (DFIW4F) and the Derived Field Importance Weight for Model (DFIW4M) for all the Derived Fields; clustering all the Derived Fields into a plurality of Derived Field clusters, wherein each Derived Field cluster is based upon the DFIW4M and the DFIW4F for the Derived Field; sorting all the Derived Field clusters by best cluster based upon DFIW4M and DFIW4F; and running the base ML models based upon the Derived Fields in the best Derived Field cluster until sufficient base ML models have been run.

    INTELLIGENT EXPANSION OF REVIEWER FEEDBACK ON TRAINING DATA

    公开(公告)号:US20230214454A1

    公开(公告)日:2023-07-06

    申请号:US17568305

    申请日:2022-01-04

    IPC分类号: G06K9/62 G06N20/20

    摘要: An embodiment generates an initial set of training data from monitoring data. The initial set of training data is generated by combining outputs from a plurality of pretrained classifiers. The embodiment trains a new classification model using the initial set of training data to identify anomalies in monitoring data. The embodiment performs a multiple-level clustering of the data samples resulting in a plurality of clusters of sub-clusters of data samples, and generates a review list of data samples by selecting a representative data sample from each of the clusters. The embodiment receives an updated data sample from the expert review that includes a revised target classification for at least one of the data samples of the expert review list. The embodiment then trains another replacement classification model using a revised set of training data that includes the updated data sample and associated revised target classification.

    EFFICIENT MACHINE LEARNING MODEL INFERENCE

    公开(公告)号:US20230138987A1

    公开(公告)日:2023-05-04

    申请号:US17453565

    申请日:2021-11-04

    IPC分类号: G06N3/08 G06N20/20

    摘要: One or more computer processors calculate a cache prediction for a received inference request within an inference cache structured as a self-learning tree, wherein the inference request comprises a set of input values. The one or more computer processors responsive to the retrieved cache prediction exceeding a cache prediction threshold, transmit the cache prediction. The one or more computer processors parallel compute a model prediction for the received inference request utilizing a trained model. The one or more computer processors responsive to the retrieved model prediction exceeding a model prediction threshold, convert the trained model into a tree structure. The one or more computer processors update the inference cache with the converted train model. The one or more computer processors transmit the model prediction.

    RAPID INITIAL DEPLOYMENT DATABASE SECURITY MODEL

    公开(公告)号:US20220224722A1

    公开(公告)日:2022-07-14

    申请号:US17148684

    申请日:2021-01-14

    IPC分类号: H04L29/06 G06K9/62 G06N20/00

    摘要: A method, system, and computer program product for recommending an initial database security model. The method may include identifying a plurality of nodes connected to a security network. The method may also include analyzing security characteristics of each node of the plurality of nodes. The method may also include identifying, from the security characteristics, key factors for each node. The method may also include calculating similarities between each node of the plurality of nodes. The method may also include building a self-organized centerless network across the plurality of nodes by grouping nodes with high similarities based on the similarities between each node, where the self-organized centerless network is a centerless network without a central management server, and includes groups of nodes from the plurality of nodes. The method may also include generating federated security models for the groups of nodes.

    PERFORMANCE EVALUATION METHOD USING SIMULATED PROBE DATA MAPPING

    公开(公告)号:US20220091964A1

    公开(公告)日:2022-03-24

    申请号:US16948520

    申请日:2020-09-22

    IPC分类号: G06F11/36

    摘要: Embodiments of the present invention disclose a method, computer program product, and system for estimating the results of a performance test on an updated software application. A method, the method comprising receiving an updated software application, wherein the size of the updated software application is a first size and generating a plurality of small probe, wherein the size of each of the small probe data is a second size, wherein the second size is less than the first size. Conducting a first performance test on the plurality of small probe data and calculating an estimated elapsed time for a performance test on the updated software application. Conducting the performance test on the updated software application and determining if the updated software is given a PASS or FAIL for the performance test, based in part on the elapsed time of the performance test on the updated software application.