MECHANISM TO PROVIDE RELIABLE RECEIPT OF EVENT MESSAGES

    公开(公告)号:US20230056665A1

    公开(公告)日:2023-02-23

    申请号:US17405217

    申请日:2021-08-18

    Abstract: Devices and techniques for providing receipts for event messages in a processor are described herein. A system includes multiple memory-compute nodes coupled to one another over a scale fabric; a set of registers; and an event manager hardware circuitry to: receive an event message corresponding to an event, and the event associated with an event mode; track a counter value representing a number of received event messages related to the event, the counter value stored in the set of registers; compare the number of received event messages to a trigger value; and in response to the number of received event messages equaling the trigger value: use an atomic operation to reset the counter value in the set of registers while maintaining the event mode; and alert a thread of the event.

    LOOP EXECUTION IN A RECONFIGURABLE COMPUTE FABRIC

    公开(公告)号:US20220206804A1

    公开(公告)日:2022-06-30

    申请号:US17405371

    申请日:2021-08-18

    Abstract: Various examples are directed to systems and methods for executing a loop in a reconfigurable compute fabric. A first flow controller may initiate a first thread at a first synchronous flow to execute a first portion of a first iteration of the loop. A second flow controller may receive a first asynchronous message instructing the second flow controller to initiate a first thread at a second synchronous flow to execute a second portion of the first iteration. The second flow controller may determine that the first iteration of the loop is the last iteration of the loop to be executed and initiate the first thread at the second synchronous flow with a last iteration flag set.

    NEURAL NETWORK TRANSPOSE LAYER REMOVAL

    公开(公告)号:US20230068168A1

    公开(公告)日:2023-03-02

    申请号:US17405738

    申请日:2021-08-18

    Inventor: Patrick Estep

    Abstract: Devices and techniques for neural network transpose layer removal are described herein. A neural network model that includes matrices of synaptic weights arranged in several layers is obtained. The neural network model is inspected to determine whether a transposition of a matrix to a fully connected layer exists. If there is a matrix transposition, then a modified neural network model is created by changing values of the fully connected layer to correspond to values in the matrix prior to the transposition and eliminating the transposition. The modified neural network model can then be provided to computer hardware to perform inference operations.

    MANAGING RETURN PARAMETER ALLOCATION

    公开(公告)号:US20230058935A1

    公开(公告)日:2023-02-23

    申请号:US17405646

    申请日:2021-08-18

    Abstract: A hybrid threading processor (HTP) supports thread creation by executing an instruction that indicates an amount of storage space to reserve for return values. Before a thread is created, the indicated amount of space is reserved. The newly created child thread sends a return packet back to the parent thread when the child thread completes. The thread writes its return information into the reserved space and waits for the parent thread to execute a thread join instruction. The thread join instruction takes the returned information from the reserved space and transfers it to the parent thread's register state. The reserved space is released once the child thread is joined. Using a configurable amount of space for each child thread may allow for more child threads to be executed simultaneously.

Patent Agency Ranking