-
公开(公告)号:US11714765B2
公开(公告)日:2023-08-01
申请号:US17383606
申请日:2021-07-23
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Robert L. Alverson , Duncan Roweth , Nathan L. Wichmann
CPC classification number: G06F13/20 , G06F12/10 , G06F2212/1024
Abstract: An apparatus is provided that includes a network interface to transmit and receive data packets over a network; a memory including one or more buffers; an arithmetic logic unit to perform arithmetic operations for organizing and combining the data packets; and a circuitry to receive, via the network interface, data packets from the network; aggregate, via the arithmetic logic unit, the received data packets in the one or more buffers at a network rate; and transmit, via the network interface, the aggregated data packets to one or more compute nodes in the network, thereby optimizing latency incurred in combining the received data packets and transmitting the aggregated data packets, and hence accelerating a bulk data allreduce operation. One embodiment provides a system and method for performing the allreduce operation. During operation, the system performs the allreduce operation by pacing network operations for enhancing performance of the allreduce operation.
-
公开(公告)号:US20250028450A1
公开(公告)日:2025-01-23
申请号:US18905441
申请日:2024-10-03
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Robert L. Alverson
IPC: G06F3/06
Abstract: A method and apparatus are provided for facilitating a datatype engine (DTE) to support high performance computing. A network interface card (NIC) receives, via a message passing interface, a command to read data from a host memory. The NIC determines that the command indicates a first datatype descriptor stored in the NIC. The NIC forms, based on the command, a packet which indicates a base address and a length associated with the data to be read from the host memory and passes the packet to the DTE. The DTE generates a plurality of read requests comprising offsets from the base address and corresponding lengths based on the first datatype descriptor. The DTE passes the plurality of read requests to a direct memory access module, thereby allowing the NIC to access the host memory while eliminating copies of the data on the host during transfer of the command across a network.
-
公开(公告)号:US12131029B2
公开(公告)日:2024-10-29
申请号:US17976721
申请日:2022-10-28
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Robert L. Alverson
IPC: G06F3/06
CPC classification number: G06F3/0613 , G06F3/0659 , G06F3/067
Abstract: A method and apparatus are provided for facilitating a datatype engine (DTE) to support high performance computing. A network interface card (NIC) receives, via a message passing interface, a command to read data from a host memory. The NIC determines that the command indicates a first datatype descriptor stored in the NIC. The NIC forms, based on the command, a packet which indicates a base address and a length associated with the data to be read from the host memory and passes the packet to the DTE. The DTE generates a plurality of read requests comprising offsets from the base address and corresponding lengths based on the first datatype descriptor. The DTE passes the plurality of read requests to a direct memory access module, thereby allowing the NIC to access the host memory while eliminating copies of the data on the host during transfer of the command across a network.
-
14.
公开(公告)号:US20240143494A1
公开(公告)日:2024-05-02
申请号:US17976730
申请日:2022-10-28
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Robert L. Alverson , Christopher Michael Brueggen
CPC classification number: G06F12/0207 , G06F3/0604 , G06F3/0659 , G06F3/0673 , G06F2212/1016
Abstract: A system, method, and apparatus are provided to facilitate data structures for a datatype engine and provide inline compaction. The system receives, by a network interface card (NIC), a command to read data from a host memory, wherein the command indicates a datatype. The system generates a plurality of read requests comprising offsets from a base address and corresponding lengths based on the datatype. The system issues the plurality of read requests to the host memory to obtain the data from the host memory. The system obtains a byte-mask descriptor corresponding to the datatype. The system performs, based on the obtained data and the byte-mask descriptor, on-the-fly compaction of the obtained data, thereby allowing the NIC to return a requested subset of the obtained data.
-
公开(公告)号:US20240121228A1
公开(公告)日:2024-04-11
申请号:US18479601
申请日:2023-10-02
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Duncan Roweth
CPC classification number: H04L63/0435 , H04L1/18 , H04L9/0891 , H04L9/3236
Abstract: An apparatus facilitating efficient key refresh in a node is provided. During operation, the apparatus can determine a collective operation initiated by the node. The node can include a processor and can be in a distributed system comprising a plurality of nodes. The collective operation can be performed by a subset of the plurality of nodes in conjunction with each other. The apparatus can generate a new key based on a previous key maintained at the apparatus. Here, a respective key can be used for encrypting an inter-node packet in the distributed system. The apparatus can maintain the new and previous keys for the duration of the collective operation. Either of the new and previous keys can be used for decrypting messages received at the apparatus from other nodes of the distributed system. Upon determining a threshold point of the collective operation, the apparatus can discard the previous key.
-
公开(公告)号:US20240121189A1
公开(公告)日:2024-04-11
申请号:US18479803
申请日:2023-10-02
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood
IPC: H04L47/11 , H04L47/10 , H04L47/122
CPC classification number: H04L47/11 , H04L47/122 , H04L47/39
Abstract: A piece of networking equipment facilitating efficient congestion management is provided. During operation, the equipment can receive, via a network, a plurality of packets that include portions of a data segment sent from a sender device to a receiver device. The equipment can identify, among the plurality of packets, one or more payload packets comprising payload of the data segment, and at least a header packet comprising header information of the data segment and a header-packet indicator. The equipment can determine whether congestion is detected at the receiver device based on a number of sender devices sending packets to the receiver device via the equipment. Upon determining congestion at the receiver device, the equipment can perform flow trimming by forwarding the header packet to the receiver device and dropping a subset of the one or more payload packets.
-
公开(公告)号:US11665113B2
公开(公告)日:2023-05-30
申请号:US17387334
申请日:2021-07-28
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Duncan Roweth , Robert L. Alverson
IPC: H04L12/28 , H04L49/901 , H04L47/62 , H04L47/625
CPC classification number: H04L49/901 , H04L47/622 , H04L47/625
Abstract: A system for facilitating efficient command management in a network interface controller (NIC) is provided. During operation, the system can determine, at the NIC, a trigger condition and a location in a command queue for a set of commands corresponding to the trigger condition. The command queue can be external to the NIC. The location can correspond to an end of the set of commands in the command queue. The system can then determine, at the NIC, whether the trigger condition has been satisfied. If the trigger condition is satisfied, the system can fetch a respective command of the set of commands from the command queue and issuing the command from the NIC until the location is reached, thereby bypassing locally storing the set of commands prior to the trigger condition being satisfied.
-
公开(公告)号:US20230036404A1
公开(公告)日:2023-02-02
申请号:US17387334
申请日:2021-07-28
Applicant: Hewlett Packard Enterprise Development LP
Inventor: Keith D. Underwood , Duncan Roweth , Robert L. Alverson
IPC: H04L12/879 , H04L12/863
Abstract: A system for facilitating efficient command management in a network interface controller (NIC) is provided. During operation, the system can determine, at the NIC, a trigger condition and a location in a command queue for a set of commands corresponding to the trigger condition. The command queue can be external to the NIC. The location can correspond to an end of the set of commands in the command queue. The system can then determine, at the NIC, whether the trigger condition has been satisfied. If the trigger condition is satisfied, the system can fetch a respective command of the set of commands from the command queue and issuing the command from the NIC until the location is reached, thereby bypassing locally storing the set of commands prior to the trigger condition being satisfied.
-
-
-
-
-
-
-