Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization

发明授权

US10325352B1 Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization 有权

请登陆查看更多内容

专利标题： Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization
申请号： US16255197

申请日： 2019-01-23
公开(公告)号： US10325352B1

公开(公告)日： 2019-06-18
发明人: Kye-Hyeon Kim , Yongjoong Kim , Insu Kim , Hak-Kyoung Kim , Woonhyun Nam , SukHoon Boo , Myungchul Sung , Donghun Yeo , Wooju Ryu , Taewoong Jang , Kyungjoong Jeong , Hongmo Je , Hojin Cho
申请人： Stradvision, Inc.
申请人地址： KR Pohang
专利权人： STRADVISION, INC.
当前专利权人： STRADVISION, INC.
当前专利权人地址： KR Pohang
代理机构： XSensus LLP
主分类号： G06K9/00
IPC分类号： G06K9/00 ; G06T3/40

Method and device for transforming CNN layers to optimize CNN parameter quantization to be used for mobile devices or compact networks with high precision via hardware optimization

摘要：

There is provided a method for transforming convolutional layers of a CNN including m convolutional blocks to optimize CNN parameter quantization to be used for mobile devices, compact networks, and the like with high precision via hardware optimization. The method includes steps of: a computing device (a) generating k-th quantization loss values by referring to k-th initial weights of a k-th initial convolutional layer included in a k-th convolutional block, a (k−1)-th feature map outputted from the (k−1)-th convolutional block, and each of k-th scaling parameters; (b) determining each of k-th optimized scaling parameters by referring to the k-th quantization loss values; (c) generating a k-th scaling layer and a k-th inverse scaling layer by referring to the k-th optimized scaling parameters; and (d) transforming the k-th initial convolutional layer into a k-th integrated convolutional layer by using the k-th scaling layer and the (k−1)-th inverse scaling layer.

信息查询

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )