Compression of machine-learned models via entropy penalized weight reparameterization

Invention Grant

US11574232B2 Compression of machine-learned models via entropy penalized weight reparameterization 有权

Please log in to see more content

Patent Title: Compression of machine-learned models via entropy penalized weight reparameterization
Application No.: US15931016

Application Date: 2020-05-13
Publication No.: US11574232B2

Publication Date: 2023-02-07
Inventor: Deniz Oktay , Saurabh Singh , Johannes Balle , Abhinav Shrivastava
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Dority & Manning, P.A.
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N3/08

Compression of machine-learned models via entropy penalized weight reparameterization

Abstract:

Example aspects of the present disclosure are directed to systems and methods that learn a compressed representation of a machine-learned model (e.g., neural network) via representation of the model parameters within a reparameterization space during training of the model. In particular, the present disclosure describes an end-to-end model weight compression approach that employs a latent-variable data compression method. The model parameters (e.g., weights and biases) are represented in a “latent” or “reparameterization” space, amounting to a reparameterization. In some implementations, this space can be equipped with a learned probability model, which is used first to impose an entropy penalty on the parameter representation during training, and second to compress the representation using arithmetic coding after training. The proposed approach can thus maximize accuracy and model compressibility jointly, in an end-to-end fashion, with the rate-error trade-off specified by a hyperparameter.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习