发明授权
- 专利标题: Method and system for compressing application data for operations on multi-core systems
-
申请号: US18178237申请日: 2023-03-03
-
公开(公告)号: US12126367B2公开(公告)日: 2024-10-22
- 发明人: Tianfang Liu
- 申请人: Cornami, Inc.
- 申请人地址: US TX Dallas
- 专利权人: Cornami, Inc.
- 当前专利权人: Cornami, Inc.
- 当前专利权人地址: US TX Dallas
- 代理机构: NIXON PEABODY LLP
- 主分类号: H03M7/30
- IPC分类号: H03M7/30 ; G06F7/483 ; G06F9/445 ; G06F18/2135 ; G06F18/24 ; G06N3/02 ; G06N3/0464 ; G06V10/764 ; G06V10/77 ; G06V10/82 ; H03M7/02 ; H03M7/40
摘要:
A system and method to compress application control data, such as weights for a layer of a convolutional neural network, is disclosed. A multi-core system for executing at least one layer of the convolutional neural network includes a storage device storing a compressed weight matrix of a set of weights of the at least one layer of the convolutional network and a decompression matrix. The compressed weight matrix is formed by matrix factorization and quantization of a floating point value of each weight to a floating point format. A decompression module is operable to obtain an approximation of the weight values by decompressing the compressed weight matrix through the decompression matrix. A plurality of cores executes the at least one layer of the convolutional neural network with the approximation of weight values to produce an inference output.
公开/授权文献
信息查询
IPC分类: