-
公开(公告)号:US11961007B2
公开(公告)日:2024-04-16
申请号:US16783047
申请日:2020-02-05
Applicant: QUALCOMM Incorporated
Abstract: A method for accelerating machine learning on a computing device is described. The method includes hosting a neural network in a first inference accelerator and a second inference accelerator. The neural network split between the first inference accelerator and the second inference accelerator. The method also includes routing intermediate inference request results directly between the first inference accelerator and the second inference accelerator. The method further includes generating a final inference request result from the intermediate inference request results.