MODEL SELECTION IN ENSEMBLE LEARNING
摘要:
Certain aspects of the present disclosure provide techniques for detecting data errors. A method generally includes training each of a plurality of models on a plurality of training data sets to generate a set of trained models, determining a plurality of subsets of trained models from the set of trained models, for each respective subset: determining a plurality of ensemble outputs for the respective subset based on a plurality of validation data sets; and determining at least one evaluation metric for the respective subset based on the plurality of ensemble outputs; and determining an ensemble model as a subset of trained models having a best evaluation metric among a plurality of evaluation metrics associated with the plurality of subsets, wherein each subset comprises a different selection of models from the set of trained model than each other subset of trained models in the plurality of subsets of trained models.
信息查询
0/0