- 专利标题: Multi-GPU deep learning using CPUs
-
申请号: US15843244申请日: 2017-12-15
-
公开(公告)号: US11164079B2公开(公告)日: 2021-11-02
- 发明人: Tung D. Le , Haruki Imai , Taro Sekiyama , Yasushi Negishi
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 代理机构: Tutunjian & Bitetto, P.C.
- 代理商 Randall Bluestone
- 主分类号: G06N3/08
- IPC分类号: G06N3/08 ; G06T1/20 ; G06N3/04
摘要:
A computer-implemented method, computer program product, and computer processing system are provided for accelerating neural network data parallel training in multiple graphics processing units (GPUs) using at least one central processing unit (CPU). The method includes forming a set of chunks. Each of the chunks includes a respective group of neural network layers other than a last layer. The method further includes performing one or more chunk-wise synchronization operations during a backward phase of the neural network data parallel training, by each of the multiple GPUs and the at least one CPU.
公开/授权文献
- US20190188560A1 MULTI-GPU DEEP LEARNING USING CPUS 公开/授权日:2019-06-20
信息查询