Compression Technique For Deep Neural Network Weights

    公开(公告)号:US20220321143A1

    公开(公告)日:2022-10-06

    申请号:US17220620

    申请日:2021-04-01

    Abstract: Various embodiments include methods and devices for compression and decompression of weight data sets. Some embodiments may include compressing weight data by receiving a weight data set of binary numbers representing weight values, generating a frame payload including a compressed first frame of a first subset of the weight values in the weight data set, and generating a block of compressed weight data having the frame payload. Some embodiments may include decompressing weight data by retrieving a block of compressed weight data, in which the block of compressed weight data includes a frame header associated with a frame payload, in which the frame header includes a normalization factor indicator, and in which the frame payload includes compressed weight values, and generating a first decompressed frame comprising decompressed weight values of the compressed weight values of the frame payload.

    Processing data in pixel-to-pixel neural networks

    公开(公告)号:US12254405B2

    公开(公告)日:2025-03-18

    申请号:US17200090

    申请日:2021-03-12

    Abstract: Technologies are provided for processing data in neural networks. An example method can include processing, by each layer of a neural network, a row in a first stripe of a data input, the row being processed sequentially in a horizontal direction and according to a layer-by-layer sequence; after processing the row, processing, by each layer, subsequent rows in the first stripe on a row-by-row basis, each subsequent row being processed sequentially in the horizontal direction and according to the layer-by-layer sequence; generating an output stripe based on the processing of the row and subsequent rows; processing, by each layer, a second stripe of the data input, each row in the second stripe being processed in the horizontal direction and according to the layer-by-layer sequence, wherein rows in the second stripe are processed on a row-by-row basis; and generating another output stripe based on the processing of the second stripe.

    Compression technique for deep neural network weights

    公开(公告)号:US11757469B2

    公开(公告)日:2023-09-12

    申请号:US17220620

    申请日:2021-04-01

    CPC classification number: H03M7/702 G06N3/04

    Abstract: Various embodiments include methods and devices for compression and decompression of weight data sets. Some embodiments may include compressing weight data by receiving a weight data set of binary numbers representing weight values, generating a frame payload including a compressed first frame of a first subset of the weight values in the weight data set, and generating a block of compressed weight data having the frame payload. Some embodiments may include decompressing weight data by retrieving a block of compressed weight data, in which the block of compressed weight data includes a frame header associated with a frame payload, in which the frame header includes a normalization factor indicator, and in which the frame payload includes compressed weight values, and generating a first decompressed frame comprising decompressed weight values of the compressed weight values of the frame payload.

    Packet-based universal bit-field masking coding using configurable sparsity information

    公开(公告)号:US12132502B1

    公开(公告)日:2024-10-29

    申请号:US18483389

    申请日:2023-10-09

    CPC classification number: H03M7/3066 H03M7/6011 H03M7/6076

    Abstract: Systems and techniques are provided for compressing data. A process can include generating a compressed sub-packet by removing one or more sparsity bytes from a sequence of values corresponding to a sub-packet, the sequence of values including one or more sparsity bytes each equal to a configured sparsity value and one or more non-sparsity bytes each corresponding to a respective data value different from the configured sparsity value. A sub-packet header can be generated for the compressed sub-packet, and indicative of a respective location within the sequence of values of each non-sparsity byte. A packet header can be generated for a plurality of compressed sub-packets, and indicative of the configured sparsity value and respective coding information for each compressed sub-packet. A compressed data packet can be generated to include at least the packet header, the sub-packet header, and the one or more non-sparsity bytes included in the sequence of values.

Patent Agency Ranking