Neural core, neural processing device including same, and method for loading data of neural processing device

    公开(公告)号:US11954584B2

    公开(公告)日:2024-04-09

    申请号:US18322519

    申请日:2023-05-23

    CPC classification number: G06N3/063 G06F9/544 G06F12/084

    Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.

    NEURAL CORE, NEURAL PROCESSING DEVICE INCLUDING SAME, AND METHOD FOR LOADING DATA OF NEURAL PROCESSING DEVICE

    公开(公告)号:US20240013038A1

    公开(公告)日:2024-01-11

    申请号:US18322519

    申请日:2023-05-23

    CPC classification number: G06N3/063 G06F9/544

    Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.

    NEURAL PROCESSING DEVICE AND METHOD FOR CONVERTING DATA THEREOF

    公开(公告)号:US20230315336A1

    公开(公告)日:2023-10-05

    申请号:US18191737

    申请日:2023-03-28

    Inventor: Jinwook Oh

    CPC classification number: G06F3/0656 G06F3/0604 G06F3/0679

    Abstract: A neural processing device and a method for converting data thereof are provided. The neural processing device comprises a first compute unit configured to receive first input data in first precision and generate first output data in the first precision by performing calculations, a second compute unit configured to receive second input data in second precision which is different from the first precision and generate second output data in the second precision by performing calculation, and a first converting buffer configured to receive and store the first output data, generate the second input data by converting the first output data into the second precision, and transmit the second input data to the second compute unit.

    PROCESSING ELEMENT, NEURAL PROCESSING DEVICE INCLUDING SAME, AND MULTIPLICATION OPERATION METHOD USING SAME

    公开(公告)号:US20220405560A1

    公开(公告)日:2022-12-22

    申请号:US17807082

    申请日:2022-06-15

    Abstract: The present disclosure discloses a processing element and a neural processing device including the processing element. The processing element includes a weight register configured to store a weight, an input activation register configured to store input activation, a flexible multiplier configured to generate result data by performing a multiplication operation of the weight and the input activation by using a first multiplier of a first precision or using both the first multiplier and a second multiplier of the first precision in response to a calculation mode signal and a saturating adder configured to generate a partial sum by using the result data.

    NEURAL PROCESSING DEVICE AND METHOD FOR PRUNING THEREOF

    公开(公告)号:US20220300817A1

    公开(公告)日:2022-09-22

    申请号:US17656625

    申请日:2022-03-25

    Inventor: Jinwook Oh

    Abstract: A neural processing device and method for pruning thereof are provided. The neural processing device includes a processing unit configured to perform calculations, an L0 memory configured to store input and output data of the processing unit, wherein the input and output data include a two-dimensional weight matrix and a weight manipulator configured to receive the two-dimensional weight matrix and partition it into preset sizes to thereby generate partitioned matrices, to generate a pruning matrix by pruning the partitioned matrix, and to transmit the pruning matrix to the processing unit.

    PROCESSING ELEMENT, NEURAL PROCESSING DEVICE INCLUDING SAME, AND METHOD FOR CALCULATING THEREOF

    公开(公告)号:US20220300794A1

    公开(公告)日:2022-09-22

    申请号:US17656631

    申请日:2022-03-25

    Inventor: Jinwook Oh

    Abstract: A processing element, a neural processing device including the same, and a method for calculating thereof are provided. The processing element includes a weight register configured to receive and store weights, an input activation register configured to store input activations, a flexible multiplier configured to receive the weight and the input activation, to perform a multiplication calculation in a first precision or a second precision different from the first precision according to a mode signal, occurrence of an overflow, and occurrence of an underflow, and to generates result data; and a saturating adder configured to receive the result data and generate subtotals.

    NEURAL CORE, NEURAL PROCESSING DEVICE INCLUDING SAME, AND METHOD FOR LOADING DATA OF NEURAL PROCESSING DEVICE

    公开(公告)号:US20240211742A1

    公开(公告)日:2024-06-27

    申请号:US18597728

    申请日:2024-03-06

    CPC classification number: G06N3/063 G06F9/544 G06F12/084

    Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.

    PROCESSING DEVICE AND METHOD FOR MANAGING TASKS THEREOF

    公开(公告)号:US20240152391A1

    公开(公告)日:2024-05-09

    申请号:US18491695

    申请日:2023-10-20

    CPC classification number: G06F9/4881 G06F9/3838

    Abstract: A neural processing device and a method for managing tasks thereof are provided. The neural processing device includes a neural core configured to perform a task and generate a completion signal for completion of the task, a core global configured to transfer task information for the task to the neural core and receive the completion signal of the task from the neural core, and a task manager configured to generate and transmit the task information to the core global, receive the completion signal from the core global, generate a completion report, and transmit the completion report.

    NEURAL PROCESSING DEVICE
    20.
    发明公开

    公开(公告)号:US20230385198A1

    公开(公告)日:2023-11-30

    申请号:US18448102

    申请日:2023-08-10

    CPC classification number: G06F12/084 G06F2212/622

    Abstract: A neural processing device is provided. The neural processing device comprises: a processing unit configured to perform calculations, an L0 memory configured to receive data from the processing unit and provide data to the processing unit, and an LSU (Load/Store Unit) configured to perform load and store operations of the data, wherein the LSU comprises: a neural core load unit configured to issue a load instruction of the data, a neural core store unit configured to issue a store instruction for transmitting and storing the data, and a sync ID logic configured to provide a sync ID to the neural core load unit and the neural core store unit to thereby cause a synchronization signal to be generated for each sync ID.

Patent Agency Ranking