- 专利标题: Method and apparatus for performing operation of convolutional layers in convolutional neural network
-
申请号: US16203031申请日: 2018-11-28
-
公开(公告)号: US11822616B2公开(公告)日: 2023-11-21
- 发明人: Delin Li , Kun Ling , Liang Chen , Jianjun Li
- 申请人: Nanjing Horizon Robotics Technology Co., Ltd.
- 申请人地址: CN Nanjing
- 专利权人: Nanjing Horizon Robotics Technology Co., Ltd.
- 当前专利权人: Nanjing Horizon Robotics Technology Co., Ltd.
- 当前专利权人地址: CN Nanjing
- 代理机构: K&L Gates LLP
- 优先权: CN 1711212061.4 2017.11.28
- 主分类号: G06N3/063
- IPC分类号: G06N3/063 ; G06N3/08 ; G06F17/15 ; G06N3/045 ; G06N20/10 ; G06F7/501 ; G06F7/523
摘要:
Disclosed are a method and an apparatus for performing an operation of a convolutional layer in a convolutional neural network. The method comprises: reading unfolded feature data provided to the convolution layer and an original convolution kernel of the convolutional layer from a dynamic random access memory (DRAM); folding the unfolded feature data in at least one dimension of width and height to generate folded feature data; pre-processing the folded feature data and the original convolution kernel; storing the pre-processed folded feature data into a static random-access memory (SRAM); folding the pre-processed original convolution kernel in the at least one dimension to generate one or more folded convolution kernels corresponding to the original convolution kernel; storing the one or more folded convolution kernels in the SRAM; and reading the pre-processed folded feature data and the one or more folded convolution kernels from the SRAM into a calculation unit for convolving the pre-processed folded feature data with the one or more folded convolution kernels. By means of the method and/or apparatus in accordance with embodiments of the present disclosure, channel utilization may be improved, cache occupancy may be reduced, and operation efficiency may be improved.
公开/授权文献
信息查询