-
公开(公告)号:US20240354106A1
公开(公告)日:2024-10-24
申请号:US18755097
申请日:2024-06-26
申请人: NVIDIA Corporation
发明人: Srinivas Santosh Kumar MADUGULA , Olivier GIROUX , Wishwesh Anil GANDHI , Michael Allen PARKER , Raghuram L , Ivan TANASIC , Manan PATEL , Mark HUMMEL , Alexander L. MINKIN , Gregory Michael THORSON
IPC分类号: G06F9/30
CPC分类号: G06F9/30043 , G06F9/30087
摘要: Various embodiments include techniques for performing self-synchronizing remote memory operations in a data center or multiprocessor computing system. During a remote memory operation, a source processor transmits multiple data segments to a destination processor. For each data segment, the source processor transmits a remote memory operation to the destination processor that includes associated metadata that identifies the memory location of a corresponding synchronization object representing a count of data segments to be stored or a flag for each data segment to be stored. The remote memory operation along with the metadata is transmitted as a single unit to the destination processor. The destination processor splits the operation into the remote memory operation and the memory synchronization operation. As a result, the source processor avoids the need to perform a separate memory synchronization operation, thereby reducing inter-processor communications and increasing performance of remote memory operations.