SYSTEM AND METHOD FOR OPTIMIZING DATA-TRANSFER AMONG MULTIPLE COMPUTE UNITS IN A DATA-PARALLEL COMPUTING SYSTEM

    公开(公告)号:US20240394218A1

    公开(公告)日:2024-11-28

    申请号:US18794143

    申请日:2024-08-05

    Abstract: System and method for optimizing data-transfer among multiple compute units in a data-parallel computing system. A topological communications configurator (TCC) determines a connections-optimized configuration of processors associated with compute nodes of the computing system. The processors can execute dataflow workers of an application and form intranodal segments of an internodal interconnection topology coupling the intranodal segments. The TCC determines the connections-optimized configuration based on internodal communications costs corresponding to communications routes among the internodal segments via the internodal interconnection fabric.

    Overlapping Gradient Synchronization In Machine Learning

    公开(公告)号:US20230259823A1

    公开(公告)日:2023-08-17

    申请号:US18109080

    申请日:2023-02-13

    CPC classification number: G06N20/00

    Abstract: In a method an orchestrator of a computing system determines that results of Machine Learning model computations are available and dispatches a worker to perform model computations that include computing gradients of the results. The orchestrator determines that a set of gradients of the results is available and dispatches a gradient worker to compute a sum of the gradients. The orchestrator determines that a second set of gradients of the results is available and dispatches a second gradient worker to compute a sum of the second set of gradients. The orchestrator determines that the sums of the first and second gradients are available and dispatches a third gradient worker to compute synchronized gradients. The gradient workers compute the sums and synchronized gradients concurrent with training workers computing additional model computations results and/or gradients. A computer program product can include the method and a computing system can include the orchestrator.

Patent Agency Ranking