Flow-based congestion control
    2.
    发明公开

    公开(公告)号:US20240073141A1

    公开(公告)日:2024-02-29

    申请号:US17895108

    申请日:2022-08-25

    IPC分类号: H04L47/12 H04L47/30

    CPC分类号: H04L47/12 H04L47/30

    摘要: A network device includes multiple ports, a Shared Buffer (SB) and a SB controller. The ports to connect to a communication network. The SB to temporarily store packets received from the communication network via the ports, the packets belonging to multiple flows. The SB controller to allocate one or more flow-specific storage regions in the SB, a given flow-specific storage region being allocated to store the packets that (i) belong to respective one or more of the flows and (ii) are to be transmitted via a respective egress queue. In response to detecting that an occupancy level in the given flow-specific storage region exceeds a specified occupancy threshold, the SB controller to report the flows in the given flow-specific storage region as congested.

    REMOTE DIRECT MEMORY ACCESS (RDMA) MULTIPATH

    公开(公告)号:US20240195728A1

    公开(公告)日:2024-06-13

    申请号:US18443928

    申请日:2024-02-16

    摘要: Technologies for spreading a burst of data across multiple network paths in remote direct memory access (RDMA) over converged Ethernet (RoCE) and InfiniBand are described. A network interface controller sends a first burst of a transport flow directed to a second node over a first network path. The network interface controller determines that a second burst is to be sent over a different network path, and identifies a second network path using a multipath context. The multipath context stores a first weight value or a first state associated with the first network path and a second weight value or a second state associated with the second network path. The network interface controller sends the second burst of data to the second node via the second network path.

    Flexible per-flow multipath managed by sender-side network adapter

    公开(公告)号:US20240080266A1

    公开(公告)日:2024-03-07

    申请号:US17902920

    申请日:2022-09-05

    IPC分类号: H04L45/00 H04L45/24

    摘要: A network adapter includes a port and one or more circuits. The port communicates packets over a network in which switches forward packets in accordance with tuples of the packets. The one or more circuits are to hold a user-programmable scheme specifying assignments of the packets of a given flow destined to a peer node to sub-flows having respective different tuples, assign first packets of the given flow to one or more of the sub-flows in accordance with the user-programmable scheme, by setting respective tuples of the first packets, transmit the first packets to the peer node via the port, monitor notifications received from the network, the notifications being indicative of respective states of the sub-flows, based on the notifications and on the user-programmable scheme determine an assignment of second packets of the given flow to the sub-flows, and transmit the second packets to the peer node via the port.

    REMOTE DIRECT MEMORY ACCESS (RDMA) MULTIPATH

    公开(公告)号:US20240080256A1

    公开(公告)日:2024-03-07

    申请号:US17901671

    申请日:2022-09-01

    摘要: Technologies for spreading a single transport flow across multiple network paths in remote direct memory access (RDMA) over converged Ethernet (RoCE) and InfiniBand are described. A network interface controller receives a first packet and a second packet of a transport flow directed to a second node. The network interface controller assigns a first network routing identifier to the first packet and a second network routing identifier to the second packet, the first network routing identifier corresponding to a first network path between the first and second nodes, the second network routing identifier corresponding to a second network path between the first node and the second node. The network interface controller schedules a first packet of the transport flow to be sent via the first network path and a second packet of the transport flow to be sent via the second network path.

    Zero-copy processing
    7.
    发明授权

    公开(公告)号:US11757796B2

    公开(公告)日:2023-09-12

    申请号:US17488362

    申请日:2021-09-29

    摘要: In one embodiment, a system includes a peripheral device including a memory access interface to receive from a host device headers of packets, while corresponding payloads of the packets are stored in a host memory of the host device, and descriptors being indicative of respective locations in the host memory at which the corresponding payloads are stored, a data processing unit memory to store the received headers and the descriptors without the payloads of the packets, and a data processing unit to process the received headers, wherein the peripheral device is configured, upon completion of the processing of the received headers by the data processing unit, to fetch the payloads of the packets over the memory access interface from the respective locations in the host memory responsively to respective ones of the descriptors, and packet processing circuitry to receive the headers and payloads of the packets, and process the packets.

    Explicit notification of operative conditions along a network path

    公开(公告)号:US11622028B2

    公开(公告)日:2023-04-04

    申请号:US17198292

    申请日:2021-03-11

    摘要: A network element includes circuitry and multiple ports. The multiple ports are configured to connect to a communication network. The circuitry is configured to receive via one of the ports a packet that originated from a source node and is destined to a destination node, the packet including a mark that is indicative of a cumulative state derived from at least bandwidth utilization conditions of output ports that were traversed by the packet along a path, from the source node up to the network element, to select a port for forwarding the packet toward the destination node, to update the mark of the packet based at least on a value of the mark in the received packet and on a local bandwidth utilization condition of the selected port, and to transmit the packet having the updated mark to the destination node via the selected port.

    Adjusting rate of outgoing data requests for avoiding incast congestion

    公开(公告)号:US20200084150A1

    公开(公告)日:2020-03-12

    申请号:US16559640

    申请日:2019-09-04

    摘要: A network adapter includes circuitry and one or more ports. The ports connect to a communication network including multiple network elements. The circuitry accesses outbound messages that are pending to be sent over the communication network to multiple remote nodes via the ports. At least some of the outbound messages request the remote nodes to send respective amounts of data back to the network adapter. Based on the amounts of data requested by the outbound messages, the circuitry forecasts a bandwidth of inbound response traffic, which is expected to traverse a selected network element in response to the outbound messages toward the network adapter, determines a schedule for transmitting the outbound messages to the remote nodes so that the forecasted bandwidth meets a bandwidth supported by the selected network element, and transmits the outbound messages to the remote nodes in accordance with the determined schedule.