Reinforcement learning for training compression policies for machine learning models

    公开(公告)号:US11501173B1

    公开(公告)日:2022-11-15

    申请号:US16831595

    申请日:2020-03-26

    Abstract: A compression policy to produce compression profiles for compressing trained machine learning models may be trained using reinforcement learning. An iterative reinforcement learning may be performed response to a search request. Different prospective compression profiles may be generated for received machine learning models according to a compression policy being trained. Performance of compressed versions of the trained neural networks according to the compression profiles may be caused using data sets used to train the machine learning models. The compression policy may be updated according to reward signal determined from an application of a reward function for performance criteria to performance results of the different versions of the machine learning models. When a search criteria is satisfied, the trained compression policy may be provided.

Patent Agency Ranking