COMPRESSION OF SPARSE DEEP CONVOLUTIONAL NETWORK WEIGHTS

    公开(公告)号:US20250131258A1

    公开(公告)日:2025-04-24

    申请号:US19001327

    申请日:2024-12-24

    Abstract: The present disclosure describes methods, computer-readable media, and apparatuses for operating neural networks. For example, an apparatus may receive a set of sparse weight vectors. The apparatus may perform a sparse computation based on the set of sparse weight vectors. The apparatus may combine sparse weight vectors in response to determining a combined time to perform respective numbers of MAC operations for the sparse weight vectors satisfies a threshold number of clock cycles. The apparatus may operate a neural network based at least in part on one or more partial sums produced in performing the sparse computation.

    APPROXIMATION OF NON-LINEAR FUNCTIONS IN FIXED POINT USING LOOK-UP TABLES

    公开(公告)号:US20180060278A1

    公开(公告)日:2018-03-01

    申请号:US15255015

    申请日:2016-09-01

    CPC classification number: G06F17/17 G06F7/544 G06F2207/5354

    Abstract: Computing a non-linear function ƒ(x) in hardware or embedded systems can be complex and resource intensive. In one or more aspects of the disclosure, a method, a computer-readable medium, and an apparatus are provided for computing a non-linear function ƒ(x) accurately and efficiently in hardware using look-up tables (LUTs) and interpolation or extrapolation. The apparatus may be a processor. The processor computes a non-linear function ƒ(x) for an input variable x, where ƒ(x)=g(y(x),z(x)). The processor determines an integer n by determining a position of a most significant bit (MSB) of an input variable x. In addition, the processor determines a value for y(x) based on a first look-up table and the determined integer n. Also, the processor determines a value for z(x) based on n and the input variable x, and based on a second look-up table. Further, the processor computes ƒ(x) based on the determined values for y(x) and z(x).

    COMPRESSION OF SPARSE DEEP CONVOLUTIONAL NETWORK WEIGHTS

    公开(公告)号:US20190087713A1

    公开(公告)日:2019-03-21

    申请号:US16137491

    申请日:2018-09-20

    Abstract: The present disclosure describes methods, computer-readable media, and apparatuses for operating neural networks. For example, a first apparatus may receive a set of sparse weight vectors. The first apparatus may compress the set of sparse weight vectors to produce a compressed set of sparse weight vectors. The first apparatus may operate a neural network based on the compressed set of sparse weight vectors. In another example, a second apparatus may receive a set of sparse weight vectors. The second apparatus may perform a sparse computation based on the set of sparse weight vectors, and the performance of the sparse computation may produce one or more partial sums. The second apparatus may operate a neural network based at least in part on the one or more partial sums.

Patent Agency Ranking