-
1.
公开(公告)号:US20240231903A1
公开(公告)日:2024-07-11
申请号:US18614639
申请日:2024-03-23
Applicant: SambaNova Systems, Inc.
Inventor: Qi ZHENG , Arnav GOEL , Conrad Alexander TURLIK , Guoyao FENG , Joshua Earle POLZIN , Fansheng CHENG , Ravinder KUMAR , Greg DYKEMA , Subhra MAZUMDAR , Milad SHARIF , Jiayu BAI , Neal SANGHVI , Arjun SABNIS , Letao CHEN
CPC classification number: G06F9/4881 , G06F9/3877
Abstract: In a computer-implemented method a Dynamic Transfer Engine (DTE) included in a computing system receives a dynamic stimulus associated with transfer of stage data during execution of a dataflow application by the system. The DTE determines, based on source and destination devices of the transfer, a transfer method and a transfer channel to transfer the stage data between memories coupled to the source and destination devices. The DTE acquires, hardware resources of the computing system to transfer the stage using the channel and, initiates the transfer. A computer program product can cause one or more processors to perform the method. A computing system can comprise source and destination processors and memories, hardware channels to transfer data between the memories, a resource manager, and a DTE configured to perform the method.
-
公开(公告)号:US20230259823A1
公开(公告)日:2023-08-17
申请号:US18109080
申请日:2023-02-13
Applicant: SambaNova Systems, Inc.
Inventor: Greg DYKEMA , Fansheng CHENG , Kuan ZHOU , Arnav GOEL , Subhra MAZUMDAR , Milad SHARIF , Po-Yu WU , Bowen YANG , Qi ZHENG
IPC: G06N20/00
CPC classification number: G06N20/00
Abstract: In a method an orchestrator of a computing system determines that results of Machine Learning model computations are available and dispatches a worker to perform model computations that include computing gradients of the results. The orchestrator determines that a set of gradients of the results is available and dispatches a gradient worker to compute a sum of the gradients. The orchestrator determines that a second set of gradients of the results is available and dispatches a second gradient worker to compute a sum of the second set of gradients. The orchestrator determines that the sums of the first and second gradients are available and dispatches a third gradient worker to compute synchronized gradients. The gradient workers compute the sums and synchronized gradients concurrent with training workers computing additional model computations results and/or gradients. A computer program product can include the method and a computing system can include the orchestrator.
-