Feature Recommendations for Machine Learning Models Based on Feature-Model Co-Occurrences

    公开(公告)号:US20240362523A1

    公开(公告)日:2024-10-31

    申请号:US18140203

    申请日:2023-04-27

    IPC分类号: G06N20/00

    CPC分类号: G06N20/00

    摘要: A system maintains a data store for managing machine-learning (ML) models and features that are used by the models. The system generates a graph including nodes for each model and a node for each feature, and edges linking models and features that are used by the models. For a new model to be trained, the system receives a proposed feature corresponding to a node in the graph, and identifies one or more candidate features corresponding to nodes in the graph based in part on relevancy scores between the proposed feature with other features corresponding to nodes in the graph. The system presents in a user interface a suggestion to use one or more candidate features with the new model. Responsive to receiving a user selection of at least one candidate feature, the system causes the new model to be trained using the selected candidate feature and the proposed feature.

    Feature Recommendations for Machine Learning Models Using Trained Feature Prediction Model

    公开(公告)号:US20240362455A1

    公开(公告)日:2024-10-31

    申请号:US18140210

    申请日:2023-04-27

    IPC分类号: G06N3/045 G06N3/09

    CPC分类号: G06N3/045 G06N3/09

    摘要: A feature management system (the “system”) receives information about a new machine learning (ML) model to be trained. The information includes metadata about the new model. The system applies a trained feature prediction model to the information about the new model and metadata about a plurality of features. The feature prediction model is trained to predict a probability that each of the plurality of features should be selected as an input feature for the new model. The feature management system identifies one or more candidate features based on an output probability score of the feature prediction model. The system presents in a user interface a suggestion to use the one or more candidate features with the new model. The system selects at least one candidate feature and causes the new model to be trained using a set of input features, including the selected candidate feature.