Method and apparatus for quantization, adaptive block partitioning and codebook coding for neural network model compression

Invention Grant

US11245903B2 Method and apparatus for quantization, adaptive block partitioning and codebook coding for neural network model compression 有权

Please log in to see more content

Patent Title: Method and apparatus for quantization, adaptive block partitioning and codebook coding for neural network model compression
Application No.: US17099202

Application Date: 2020-11-16
Publication No.: US11245903B2

Publication Date: 2022-02-08
Inventor: Wei Wang , Wei Jiang , Shan Liu
Applicant: TENCENT AMERICA LLC
Applicant Address: US CA Palo Alto
Assignee: TENCENT AMERICA LLC
Current Assignee: TENCENT AMERICA LLC
Current Assignee Address: US CA Palo Alto
Agency: Sughrue Mion, PLLC
Main IPC: H04N19/13
IPC: H04N19/13 ; H04N19/176 ; H04N19/124 ; H04N19/119 ; H04N19/192 ; H04N19/46 ; G06N3/08 ; H04N19/597 ; H04N19/96 ; H04N19/30 ; H04N19/147

Method and apparatus for quantization, adaptive block partitioning and codebook coding for neural network model compression

Abstract:

A method of quantization, adaptive block partitioning and codebook coding for neural network model compression, is performed by at least one processor and includes determining a saturated maximum value of a multi-dimensional tensor in a layer of a neural network, and a bit depth corresponding to the saturated maximum value, and clipping weight coefficients in the multi-dimensional tensor to be within a range of the saturated maximum value. The method further includes quantizing the clipped weight coefficients, based on the bit depth, and transmitting, to a decoder, a layer header including the bit depth.

Public/Granted literature

US20210160499A1 METHOD AND APPARATUS FOR QUANTIZATION, ADAPTIVE BLOCK PARTITIONING AND CODEBOOK CODING FOR NEURAL NETWORK MODEL COMPRESSION Public/Granted day:2021-05-27

Information query

Espacenet

IPC分类:

H	电学
H04	电通信技术
H04N	图像通信，如电视
H04N19/00	用于数字视频信号编码，解码，压缩或解压缩的方法或装置
H04N19/10	.使用自适应编码
H04N19/102	..其特征在于由一个元素，参数或选择影响或通过自适应编码控制
H04N19/13	...自适应熵编码，例如自适应可变长度编码〔AVLC〕或上下文自适应二进制运算〔CABAC〕编码