-
公开(公告)号:US20240220315A1
公开(公告)日:2024-07-04
申请号:US18091443
申请日:2022-12-30
Applicant: ADVANCED MICRO DEVICES, INC.
Inventor: Suchita Pati , Shaizeen Aga , Nuwan Jayasena , Matthew David Sinclair
CPC classification number: G06F9/4881 , G06F9/52
Abstract: A processing system includes a scheduling mechanism for producing data for fine-grained reordering of workgroups of a kernel to produce blocks of data, such as for communication across devices to enable overlapping of a producer computation with an all-reduce communication across the network. This scheduling mechanism enables a first parallel processor to schedule and execute a set of workgroups of a producer operation to generate data for transmission to a second parallel processor in a desired traffic pattern. At the same time, the second parallel processor schedules and executes a different set of workgroups of the producer operation to generate data for transmission in a desired traffic pattern to a third parallel processor or back to the first parallel processor.
-
公开(公告)号:US20240201990A1
公开(公告)日:2024-06-20
申请号:US18190620
申请日:2023-03-27
Applicant: Advanced Micro Devices, Inc.
Inventor: Shaizeen Dilawarhusen Aga , Suchita Pati , Nuwan S. Jayasena
CPC classification number: G06F9/30036 , G06F9/3834
Abstract: Fused data generation and associated communication techniques are described. In an implementation, a system includes processing system having a plurality of processors. A data generation and communication tracking module is configured to track programmatically defined data generation and associated communication as performed by the plurality of processors. A targeted communication module is configured to trigger targeted communication of data between the plurality of processors based on the tracked programmatically defined data generation and associated communication.
-