Compression technique for deep neural network weights

Invention Grant

US11757469B2 Compression technique for deep neural network weights 有权

Please log in to see more content

Patent Title: Compression technique for deep neural network weights
Application No.: US17220620

Application Date: 2021-04-01
Publication No.: US11757469B2

Publication Date: 2023-09-12
Inventor: Prajakt Kulkarni , Lakshmi Narayana Macha , Haoping Xu
Applicant: QUALCOMM Incorporated
Applicant Address: US CA San Diego
Assignee: QUALCOMM Incorporated
Current Assignee: QUALCOMM Incorporated
Current Assignee Address: US CA San Diego
Agency: The Marbury Law Group, PLLC
Main IPC: H03M7/30
IPC: H03M7/30 ; G06N3/04

Compression technique for deep neural network weights

Abstract:

Various embodiments include methods and devices for compression and decompression of weight data sets. Some embodiments may include compressing weight data by receiving a weight data set of binary numbers representing weight values, generating a frame payload including a compressed first frame of a first subset of the weight values in the weight data set, and generating a block of compressed weight data having the frame payload. Some embodiments may include decompressing weight data by retrieving a block of compressed weight data, in which the block of compressed weight data includes a frame header associated with a frame payload, in which the frame header includes a normalization factor indicator, and in which the frame payload includes compressed weight values, and generating a first decompressed frame comprising decompressed weight values of the compressed weight values of the frame payload.

Public/Granted literature

US20220321143A1 Compression Technique For Deep Neural Network Weights Public/Granted day:2022-10-06

Information query

Espacenet

IPC分类:

H	电学
H03	基本电子电路
H03M	一般编码、译码或代码转换（用射流方法入F15C4/00；光学模/数转换器入G02F7/00；专用于特殊应用的编码、译码或代码转换见有关小类，例如G01D，G01R，G06F，G06T，G09G，G10L，G11B，G11C，H04B，H04L，H04M，H04N；专用于密码技术或涉及需要保密的其他目的的编码或译码入G09C）
H03M7/00	把用给定序列的数字或给定数目的数字来表示信息的码，转换到用不同序列的数字或不同数目的数字来表示相同信息的码
H03M7/30	.压缩（用于减少冗余的语言分析—合成入G10L19/00；用于图像通信的入H04N）；扩展；消除不需要的数据，例如减少冗余