Single-step collective operations

    公开(公告)号:US11922237B1

    公开(公告)日:2024-03-05

    申请号:US18105846

    申请日:2023-02-05

    发明人: Richard Graham

    IPC分类号: G06F9/54

    CPC分类号: G06F9/546

    摘要: A method for collective communications includes invoking a collective operation over a group of computing processes in which the processes concurrently transmit and receive data to and from other processes in the group via a communication medium. Messages are composed for transmission by source processes including metadata indicating how the data to be transmitted by the source processes in the collective operation are to be handled by destination processes that are to receive the data and also including in at least some of the messages the data to be transmitted by one or more of the source processes to one or more of the destination processes. The composed messages are transmitted concurrently from the source processes to the destination processes in the group over the communication medium. The data are processed by the destination processes in response to the metadata included in the messages received by the destination processes.

    Patterned Remote Direct Memory Access (RDMA)

    公开(公告)号:US20240015217A1

    公开(公告)日:2024-01-11

    申请号:US17858097

    申请日:2022-07-06

    IPC分类号: H04L67/1097

    CPC分类号: H04L67/1097

    摘要: A network device includes a first interface, a second interface and circuitry. The first interface is configured to communicate at least with a first memory. The second interface is configured to communicate over a network with a peer network device coupled to a second memory. The circuitry is configured to (i) receive a request to transfer data over the network between the first memory and the second memory in accordance with a pattern of offsets to be accessed in the first memory or in the second memory, and (ii) transfer the data in accordance with the request.

    Single-step collective operations
    5.
    发明公开

    公开(公告)号:US20240118949A1

    公开(公告)日:2024-04-11

    申请号:US18389893

    申请日:2023-12-20

    发明人: Richard Graham

    IPC分类号: G06F9/54

    CPC分类号: G06F9/546

    摘要: A method for collective communications includes invoking a collective operation over a group of computing processes in which the processes concurrently transmit and receive data to and from other processes in the group via a communication medium. Messages are composed for transmission by source processes including metadata indicating how the data to be transmitted by the source processes in the collective operation are to be handled by destination processes that are to receive the data and also including in at least some of the messages the data to be transmitted by one or more of the source processes to one or more of the destination processes. The composed messages are transmitted concurrently from the source processes to the destination processes in the group over the communication medium. The data are processed by the destination processes in response to the metadata included in the messages received by the destination processes.

    Memory-based synchronization of distributed operations

    公开(公告)号:US20220398197A1

    公开(公告)日:2022-12-15

    申请号:US17863453

    申请日:2022-07-13

    摘要: A network device in a communication network includes a controller and processing circuitry. The controller is configured to manage execution of an operation whose execution depends on inputs from a group of one or more work-request initiators. The processing circuitry is configured to read one or more values, which are set by the work-request initiators in one or more memory locations that are accessible to the work-request initiators and to the network device, and to trigger execution of the operation in response to verifying that the one or more values read from the one or more memory locations indicate that the work-request initiators in the group have provided the respective inputs.

    Memory-based synchronization of distributed operations

    公开(公告)号:US20210406179A1

    公开(公告)日:2021-12-30

    申请号:US16916153

    申请日:2020-06-30

    摘要: A network device in a communication network includes a controller and processing circuitry. The controller is configured to manage execution of an operation whose execution depends on inputs from a group of one or more work-request initiators. The processing circuitry is configured to read one or more values, which are set by the work-request initiators in one or more memory locations that are accessible to the work-request initiators and to the network device, and to trigger execution of the operation in response to verifying that the one or more values read from the one or more memory locations indicate that the work-request initiators in the group have provided the respective inputs.

    Aggregation protocol
    10.
    发明申请
    Aggregation protocol 审中-公开
    聚合协议

    公开(公告)号:US20170063613A1

    公开(公告)日:2017-03-02

    申请号:US15250953

    申请日:2016-08-30

    IPC分类号: H04L12/24 H04L12/18 H04L12/44

    摘要: A switch in a data network is configured to mediate data exchanges among network elements. The apparatus further includes a processor, which organizes the network elements into a hierarchical tree having a root node network element, vertex node network elements child node network elements that include leaf node network elements. The leaf node network elements are originate aggregation data and transmit the aggregation data to respective parent vertex node network elements. The vertex node network elements combine the aggregation data from at least a portion of the child node network elements, and transmit the combined aggregation data from the vertex node network elements to parent vertex node network elements. The root node network element is operative for initiating a reduction operation on the aggregation data.

    摘要翻译: 数据网络中的交换机被配置为中介网元之间的数据交换。 该装置还包括处理器,其将网络元件组织成具有根节点网络元件的分层树,顶点节点网络元素包括叶节点网络元素的子节点网络元素。 叶节点网元是始发聚合数据,并将聚合数据发送到相应的父顶点节点网元。 顶点节点网络元素组合来自至少一部分子节点网元的聚合数据,并将组合聚合数据从顶点节点网元发送到父顶点节点网元。 根节点网元可用于启动对聚合数据的简化操作。