AUTOMATIC MACHINE LEARNING POLICY NETWORK FOR PARAMETRIC BINARY NEURAL NETWORKS

    公开(公告)号:US20220164669A1

    公开(公告)日:2022-05-26

    申请号:US17442111

    申请日:2019-06-05

    Abstract: Systems, methods, apparatuses, and computer program products to receive a plurality of binary weight values for a binary neural network sampled from a policy neural network comprising a posterior distribution conditioned on a theta value. An error of a forward propagation of the binary neural network may be determined based on a training data and the received plurality of binary weight values. A respective gradient value may be computed for the plurality of binary weight values based on a backward propagation of the binary neural network. The theta value for the posterior distribution may be updated using reward values computed based on the gradient values, the plurality of binary weight values, and a scaling factor.

Patent Agency Ranking