Collective communication system and methods

    公开(公告)号:US12177039B2

    公开(公告)日:2024-12-24

    申请号:US18513565

    申请日:2023-11-19

    Abstract: A method includes providing a plurality of processes interconnected by a network, each of the plurality of processes being configured to hold a block of data destined for others of the plurality of processes. A set of data for all-to-all data exchange is received from one or more of the processes. The set of data is configured as a plurality of blocks of data in a matrix as matrix data, the matrix being distributed among the plurality of processes. The matrix data is transposed by changing the position of selected blocks of data of the plurality of blocks of data relative to the other blocks of data of the plurality of the blocks of data, without changing the structure of each of the blocks of data. The transposed matrix data is over the network and is then received, repacked, and conveyed to destination processes.

    Work scheduling
    35.
    发明公开
    Work scheduling 审中-公开

    公开(公告)号:US20240168797A1

    公开(公告)日:2024-05-23

    申请号:US17988812

    申请日:2022-11-17

    CPC classification number: G06F9/4881 G06F1/12 G06F13/405

    Abstract: In one embodiment, a system includes a peripheral data connection bus configured to connect to devices and transfer data between the devices, a scheduling machine configured to connect to the peripheral data connection bus and send a read request message to a first processing device, and the first processing device configured to be connected to the peripheral data connection bus, and responsively to the read request message add a time value to a read response message, and provide the read response message to the scheduling machine, and wherein the scheduling machine is configured to read the time value from the provided read response message and schedule processing of an operation by a second processing device responsively to the read time value.

    Efficient network device work queue
    36.
    发明公开

    公开(公告)号:US20240146664A1

    公开(公告)日:2024-05-02

    申请号:US17979018

    申请日:2022-11-02

    CPC classification number: H04L47/6255 H04L47/6225 H04L47/6275

    Abstract: In one embodiment, a system includes a memory to store a work queue including work queue entry slots, a processing device to write work queue entries to the work queue in a consecutive and cyclic manner, and a network device including a network interface to share packet over a network, and packet processing circuitry to read the work queue entries from the work queue in a consecutive and cyclic manner, the work queue entries indicating work to be performed associated with the packets, dequeue respective ones of the work queue entries read from the work queue responsively to reading the respective work queue entries from the work queue, add the work queue entries to an execution database used to track execution of the work queue entries, and execute the work queue entries in the execution database.

    Offloading execution of a multi-task parameter-dependent operation to a network device

    公开(公告)号:US20230089099A1

    公开(公告)日:2023-03-23

    申请号:US18071692

    申请日:2022-11-30

    Abstract: A processing device includes an interface and one or more processing circuits. The interface is to connect to a host processor. The one or more processing circuits are to receive from the host processor, via the interface, a notification specifying an operation for execution by the processing device, the operation including (i) multiple tasks that are executable by the network device, and (ii) execution dependencies among the tasks, in response to the notification, to determine a schedule for executing the tasks, the schedule complying with the execution dependencies, and to execute the operation by executing the tasks of the operation in accordance with the schedule.

Patent Agency Ranking