MACHINE LEARNING DATA FEATURE REDUCTION AND MODEL OPTIMIZATION
摘要:
For machine learning data reduction and model optimization a method randomly assigns each data feature of a training data set to a plurality of solution groups. Each solution group has no more than a solution group number k of data features and each data feature is assigned to a plurality of solution groups. The method identifies each solution group as a high-quality solution group or a low-quality solution group. The method further calculates data feature scores for each data feature comprising a high bin number and a low bin number. The method determines level data for each data feature from the data feature scores using a fuzzy inference system. The method identifies an optimized data feature set based on the level data. The method further trains a production model using only the optimized data feature set. The method predicts a result using the production model.
信息查询
0/0