NON-UNIFORM ALLOCATION OF SYMMETRIC MEMORY IN PARALLEL PROGRAMS

    公开(公告)号:US20250156314A1

    公开(公告)日:2025-05-15

    申请号:US18507659

    申请日:2023-11-13

    Abstract: Systems and methods herein are for at least one circuit to perform at least one process of different processes that may be associated with one or more applications, where the process may use one part of a virtual memory based on different requests by the different processes, where the virtual memory may include parts of equal allocations based on a maximum of different memory sizes in specifications associated with the different processes, where the one part of the virtual memory may be in a mapping with respect to one part of a physical memory of different allocated sizes, and where a translation for the mapping can occur using a start address and the maximum of the different memory sizes.

    Automatic coalescing of GPU-initiated network communication

    公开(公告)号:US11677839B2

    公开(公告)日:2023-06-13

    申请号:US17351002

    申请日:2021-06-17

    CPC classification number: H04L67/141 H04L67/104 H04L67/12

    Abstract: Apparatuses, systems, and techniques are directed to automatic coalescing of GPU-initiated network communications. In one method, a communication engine receives, from a shared memory application executing on a first graphics processing unit (GPU), a first communication request assigned to or having a second GPU as a destination to be processed. The communication engine determines that the first communication request satisfies a coalescing criterion and stores the first communication request in association with a group of requests that have a common property. The communication engine coalesces the group of requests into a coalesced request and transports the coalesced request to the second GPU over a network.

    POST-SEND SUBMISSION COALESCING
    4.
    发明申请

    公开(公告)号:US20250077444A1

    公开(公告)日:2025-03-06

    申请号:US18627981

    申请日:2024-04-05

    Abstract: A system is described having an input/output (IO) device and a processing unit coupled with the IO device. The processing unit is enabled to elect a thread from among a plurality of threads to atomically update a queue head pointer, uses the queue head pointer to reserve space in a plurality of memory registers for work queue elements belonging to the plurality of threads, and submit the work queue elements to the IO device.

    AUTOMATIC COALESCING OF GPU-INITIATED NETWORK COMMUNICATION

    公开(公告)号:US20220407920A1

    公开(公告)日:2022-12-22

    申请号:US17351002

    申请日:2021-06-17

    Abstract: Apparatuses, systems, and techniques are directed to automatic coalescing of GPU-initiated network communications. In one method, a communication engine receives, from a shared memory application executing on a first graphics processing unit (GPU), a first communication request assigned to or having a second GPU as a destination to be processed. The communication engine determines that the first communication request satisfies a coalescing criterion and stores the first communication request in association with a group of requests that have a common property. The communication engine coalesces the group of requests into a coalesced request and transports the coalesced request to the second GPU over a network.

Patent Agency Ranking