-
公开(公告)号:US20230333746A1
公开(公告)日:2023-10-19
申请号:US17989129
申请日:2022-11-17
Applicant: NVIDIA CORPORATION
IPC: G06F3/06
CPC classification number: G06F3/0613 , G06F3/0659 , G06F3/067
Abstract: Various embodiments include techniques for performing speculative remote memory operation tracking in a multiprocessor computing system. Conventionally, transfers of data between processors and other components of a computing system require memory synchronization operations to determine that the data is valid and coherent before the data is transferred from a destination to a requesting source. Existing techniques for performing these memory synchronization operations are increasingly inefficient as the number of components in a computing system increases, particularly for remote memory operations. The disclosed techniques track remote memory operations and speculatively perform these memory synchronization operations. As a result, a given memory synchronization operation is often complete prior to the corresponding remote memory operation arrives at the destination, leading to improved efficiency and performance of remote memory operations in complex computing systems.
-
公开(公告)号:US20230021678A1
公开(公告)日:2023-01-26
申请号:US17380424
申请日:2021-07-20
Applicant: NVIDIA CORPORATION
Inventor: Michael Allen PARKER , Debajit BHATTACHARYA , David FONTAINE , Shirish GADRE , Wishwesh Anil GANDHI , Olivier GIROUX , Hemayet HOSSAIN , Ronny M. KRASHINSKY , Ze LONG , Raymond Hoi Man WONG
Abstract: Various embodiments include a parallel processing computer system that provides multiple memory synchronization domains in a single parallel processor to reduce unneeded synchronization operations. During execution, one execution kernel may synchronize with one or more other execution kernels by processing outstanding memory references. The parallel processor tracks memory references for each domain to each portion of local and remote memory. During synchronization, the processor synchronizes the memory references for a specific domain while refraining from synchronizing memory references for other domains. As a result, synchronization operations between kernels complete in a reduced amount of time relative to prior approaches.
-