-
公开(公告)号:US20210350212A1
公开(公告)日:2021-11-11
申请号:US17328028
申请日:2021-05-24
Applicant: Intel Corporation
Inventor: DHIRAJ D. KALAMKAR , KARTHIKEYAN VAIDYANATHAN , SRINIVAS SRIDHARAN , DIPANKAR DAS
Abstract: One embodiment provides for a non-transitory machine readable medium storing instructions which, when executed by one or more processors, cause the one or more processors to perform operations comprising providing an interface to define a neural network using machine-learning domain specific terminology, wherein the interface enables selection of a neural network topology and abstracts low-level communication details of distributed training of the neural network.
-
公开(公告)号:US20220101480A1
公开(公告)日:2022-03-31
申请号:US17398295
申请日:2021-08-10
Applicant: Intel Corporation
Inventor: DHIRAJ D. KALAMKAR , KARTHIKEYAN VAIDYANATHAN , SRINIVAS SRIDHARAN , DIPANKAR DAS
Abstract: One embodiment provides for a method of transmitting data between multiple compute nodes of a distributed compute system, the method comprising creating a global view of communication operations to be performed between the multiple compute nodes of the distributed compute system, the global view created using information specific to a machine learning model associated with the distributed compute system; using the global view to determine a communication cost of the communication operations; and automatically determining a number of network endpoints for use in transmitting the data between the multiple compute nodes of the distributed compute system.
-