HARDWARE ACCELERATION OF MACHINE LEARNING DESIGNS

    公开(公告)号:US20230401480A1

    公开(公告)日:2023-12-14

    申请号:US17806906

    申请日:2022-06-14

    Applicant: Xilinx, Inc.

    CPC classification number: G06N20/00

    Abstract: Hardware acceleration of machine learning (ML) designs includes translating an ML primitive into an intermediate representation. The intermediate representation is subdivided to specify a functional compute block. The functional compute block is sized according to a compute node primitive adapted for implementing the ML primitive on target hardware. An overlay is generated for the ML primitive, at least in part, by mapping the functional compute block to the compute node primitive. The overlay is synthesizable to implement the ML primitive on the target hardware. The overlay can be scheduled for operation within the target hardware as part of an ML design including the ML primitive.

    Instruction set architecture for data processing array control

    公开(公告)号:US12248786B2

    公开(公告)日:2025-03-11

    申请号:US17818309

    申请日:2022-08-08

    Applicant: Xilinx, Inc.

    Abstract: Controlling a data processing (DP) array includes creating a replica of a register address space of the DP array based on the design and the DP array. A sequence of instructions, including write instructions and read instructions, is received. The write instructions correspond to buffer descriptors specifying runtime data movements for a design for a DP array. The write instructions are converted into transaction instructions and the read instructions are converted into wait instructions based on the replica of the register address space. The transaction instructions and the wait instructions are included in an instruction buffer. The instruction buffer is provided to a microcontroller configured to execute the transaction instructions and the wait instructions to implement the runtime data movements for the design as implemented in the DP array. In another aspect, the instruction buffer is stored in a file for subsequent execution by the microcontroller.

Patent Agency Ranking