COMPRESSING WEIGHT UPDATES FOR DECODER-SIDE NEURAL NETWORKS

    公开(公告)号:US20200311551A1

    公开(公告)日:2020-10-01

    申请号:US16828106

    申请日:2020-03-24

    Abstract: A method, apparatus, and computer program product are provided for training a neural network or providing a pre-trained neural network with the weight-updates being compressible using at least a weight-update compression loss function and/or task loss function. The weight-update compression loss function can comprise a weight-update vector defined as a latest weight vector minus an initial weight vector before training. A pre-trained neural network can be compressed by pruning one or more small-valued weights. The training of the neural network can consider the compressibility of the neural network, for instance, using a compression loss function, such as a task loss and/or a weight-update compression loss. The compressed neural network can be applied within a decoding loop of an encoder side or in a post-processing stage, as well as at a decoder side.

Patent Agency Ranking