System and method for implementing a network-interface-based allreduce operation

    公开(公告)号:US11714765B2

    公开(公告)日:2023-08-01

    申请号:US17383606

    申请日:2021-07-23

    CPC classification number: G06F13/20 G06F12/10 G06F2212/1024

    Abstract: An apparatus is provided that includes a network interface to transmit and receive data packets over a network; a memory including one or more buffers; an arithmetic logic unit to perform arithmetic operations for organizing and combining the data packets; and a circuitry to receive, via the network interface, data packets from the network; aggregate, via the arithmetic logic unit, the received data packets in the one or more buffers at a network rate; and transmit, via the network interface, the aggregated data packets to one or more compute nodes in the network, thereby optimizing latency incurred in combining the received data packets and transmitting the aggregated data packets, and hence accelerating a bulk data allreduce operation. One embodiment provides a system and method for performing the allreduce operation. During operation, the system performs the allreduce operation by pacing network operations for enhancing performance of the allreduce operation.

    DATATYPE ENGINE TO SUPPORT HIGH PERFORMANCE COMPUTING

    公开(公告)号:US20250028450A1

    公开(公告)日:2025-01-23

    申请号:US18905441

    申请日:2024-10-03

    Abstract: A method and apparatus are provided for facilitating a datatype engine (DTE) to support high performance computing. A network interface card (NIC) receives, via a message passing interface, a command to read data from a host memory. The NIC determines that the command indicates a first datatype descriptor stored in the NIC. The NIC forms, based on the command, a packet which indicates a base address and a length associated with the data to be read from the host memory and passes the packet to the DTE. The DTE generates a plurality of read requests comprising offsets from the base address and corresponding lengths based on the first datatype descriptor. The DTE passes the plurality of read requests to a direct memory access module, thereby allowing the NIC to access the host memory while eliminating copies of the data on the host during transfer of the command across a network.

    Datatype engine to support high performance computing

    公开(公告)号:US12131029B2

    公开(公告)日:2024-10-29

    申请号:US17976721

    申请日:2022-10-28

    CPC classification number: G06F3/0613 G06F3/0659 G06F3/067

    Abstract: A method and apparatus are provided for facilitating a datatype engine (DTE) to support high performance computing. A network interface card (NIC) receives, via a message passing interface, a command to read data from a host memory. The NIC determines that the command indicates a first datatype descriptor stored in the NIC. The NIC forms, based on the command, a packet which indicates a base address and a length associated with the data to be read from the host memory and passes the packet to the DTE. The DTE generates a plurality of read requests comprising offsets from the base address and corresponding lengths based on the first datatype descriptor. The DTE passes the plurality of read requests to a direct memory access module, thereby allowing the NIC to access the host memory while eliminating copies of the data on the host during transfer of the command across a network.

    EFFICIENT KEY MANAGEMENT IN DISTRIBUTED APPLICATION

    公开(公告)号:US20240121228A1

    公开(公告)日:2024-04-11

    申请号:US18479601

    申请日:2023-10-02

    CPC classification number: H04L63/0435 H04L1/18 H04L9/0891 H04L9/3236

    Abstract: An apparatus facilitating efficient key refresh in a node is provided. During operation, the apparatus can determine a collective operation initiated by the node. The node can include a processor and can be in a distributed system comprising a plurality of nodes. The collective operation can be performed by a subset of the plurality of nodes in conjunction with each other. The apparatus can generate a new key based on a previous key maintained at the apparatus. Here, a respective key can be used for encrypting an inter-node packet in the distributed system. The apparatus can maintain the new and previous keys for the duration of the collective operation. Either of the new and previous keys can be used for decrypting messages received at the apparatus from other nodes of the distributed system. Upon determining a threshold point of the collective operation, the apparatus can discard the previous key.

    FLOW-TRIMMING BASED CONGESTION MANAGEMENT
    16.
    发明公开

    公开(公告)号:US20240121189A1

    公开(公告)日:2024-04-11

    申请号:US18479803

    申请日:2023-10-02

    CPC classification number: H04L47/11 H04L47/122 H04L47/39

    Abstract: A piece of networking equipment facilitating efficient congestion management is provided. During operation, the equipment can receive, via a network, a plurality of packets that include portions of a data segment sent from a sender device to a receiver device. The equipment can identify, among the plurality of packets, one or more payload packets comprising payload of the data segment, and at least a header packet comprising header information of the data segment and a header-packet indicator. The equipment can determine whether congestion is detected at the receiver device based on a number of sender devices sending packets to the receiver device via the equipment. Upon determining congestion at the receiver device, the equipment can perform flow trimming by forwarding the header packet to the receiver device and dropping a subset of the one or more payload packets.

    SYSTEM AND METHOD FOR FACILITATING DYNAMIC TRIGGERED OPERATION MANAGEMENT IN A NETWORK INTERFACE CONTROLLER (NIC)

    公开(公告)号:US20230036404A1

    公开(公告)日:2023-02-02

    申请号:US17387334

    申请日:2021-07-28

    Abstract: A system for facilitating efficient command management in a network interface controller (NIC) is provided. During operation, the system can determine, at the NIC, a trigger condition and a location in a command queue for a set of commands corresponding to the trigger condition. The command queue can be external to the NIC. The location can correspond to an end of the set of commands in the command queue. The system can then determine, at the NIC, whether the trigger condition has been satisfied. If the trigger condition is satisfied, the system can fetch a respective command of the set of commands from the command queue and issuing the command from the NIC until the location is reached, thereby bypassing locally storing the set of commands prior to the trigger condition being satisfied.

Patent Agency Ranking