-
公开(公告)号:US11954584B2
公开(公告)日:2024-04-09
申请号:US18322519
申请日:2023-05-23
Applicant: Rebellions Inc.
Inventor: Jinseok Kim , Kyeongryeol Bong , Jinwook Oh , Yoonho Boo
IPC: G06F9/54 , G06N3/063 , G06F12/084
CPC classification number: G06N3/063 , G06F9/544 , G06F12/084
Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.
-
12.
公开(公告)号:US20240013038A1
公开(公告)日:2024-01-11
申请号:US18322519
申请日:2023-05-23
Applicant: Rebellions Inc.
Inventor: Jinseok Kim , Kyeongryeol Bong , Jinwook Oh , Yoonho Boo
Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.
-
公开(公告)号:US20230315336A1
公开(公告)日:2023-10-05
申请号:US18191737
申请日:2023-03-28
Applicant: Rebellions Inc.
Inventor: Jinwook Oh
IPC: G06F3/06
CPC classification number: G06F3/0656 , G06F3/0604 , G06F3/0679
Abstract: A neural processing device and a method for converting data thereof are provided. The neural processing device comprises a first compute unit configured to receive first input data in first precision and generate first output data in the first precision by performing calculations, a second compute unit configured to receive second input data in second precision which is different from the first precision and generate second output data in the second precision by performing calculation, and a first converting buffer configured to receive and store the first output data, generate the second input data by converting the first output data into the second precision, and transmit the second input data to the second compute unit.
-
14.
公开(公告)号:US20220405560A1
公开(公告)日:2022-12-22
申请号:US17807082
申请日:2022-06-15
Applicant: Rebellions Inc.
Inventor: Jaewan Bae , Jinwook Oh , Karim Charfi
Abstract: The present disclosure discloses a processing element and a neural processing device including the processing element. The processing element includes a weight register configured to store a weight, an input activation register configured to store input activation, a flexible multiplier configured to generate result data by performing a multiplication operation of the weight and the input activation by using a first multiplier of a first precision or using both the first multiplier and a second multiplier of the first precision in response to a calculation mode signal and a saturating adder configured to generate a partial sum by using the result data.
-
公开(公告)号:US20220300817A1
公开(公告)日:2022-09-22
申请号:US17656625
申请日:2022-03-25
Applicant: Rebellions Inc.
Inventor: Jinwook Oh
Abstract: A neural processing device and method for pruning thereof are provided. The neural processing device includes a processing unit configured to perform calculations, an L0 memory configured to store input and output data of the processing unit, wherein the input and output data include a two-dimensional weight matrix and a weight manipulator configured to receive the two-dimensional weight matrix and partition it into preset sizes to thereby generate partitioned matrices, to generate a pruning matrix by pruning the partitioned matrix, and to transmit the pruning matrix to the processing unit.
-
16.
公开(公告)号:US20220300794A1
公开(公告)日:2022-09-22
申请号:US17656631
申请日:2022-03-25
Applicant: Rebellions Inc.
Inventor: Jinwook Oh
Abstract: A processing element, a neural processing device including the same, and a method for calculating thereof are provided. The processing element includes a weight register configured to receive and store weights, an input activation register configured to store input activations, a flexible multiplier configured to receive the weight and the input activation, to perform a multiplication calculation in a first precision or a second precision different from the first precision according to a mode signal, occurrence of an overflow, and occurrence of an underflow, and to generates result data; and a saturating adder configured to receive the result data and generate subtotals.
-
17.
公开(公告)号:US20240211742A1
公开(公告)日:2024-06-27
申请号:US18597728
申请日:2024-03-06
Applicant: Rebellions Inc.
Inventor: Jinseok Kim , Kyeongryeol Bong , Jinwook Oh , Yoonho Boo
IPC: G06N3/063 , G06F9/54 , G06F12/084
CPC classification number: G06N3/063 , G06F9/544 , G06F12/084
Abstract: A neural core, a neural processing device including same and a method for lauding data of a neural processing device are provided. The neural core comprises a processing unit configured to perform operations, an L0 memory configured to store input data and an LSU configured to perform a load task and a store task of data between the processing unit and the L0 memory, wherein the LSU comprises a local memory load unit configured to transmit the input data in the L0 memory to the processing unit, and the local memory load unit comprises a target decision module configured to identify and retrieve the input data in the L0 memory, a transformation logic configured to transform the input data and thereby generate transformed data and an output FIFO configured to receive the transformed data and transmit the transformed data to the processing unit in the received order.
-
公开(公告)号:US20240152391A1
公开(公告)日:2024-05-09
申请号:US18491695
申请日:2023-10-20
Applicant: Rebellions Inc.
Inventor: Wongyu Shin , Miock Chi , Hongyun Kim , Jinwook Oh , Juyeong Yoon
CPC classification number: G06F9/4881 , G06F9/3838
Abstract: A neural processing device and a method for managing tasks thereof are provided. The neural processing device includes a neural core configured to perform a task and generate a completion signal for completion of the task, a core global configured to transfer task information for the task to the neural core and receive the completion signal of the task from the neural core, and a task manager configured to generate and transmit the task information to the core global, receive the completion signal from the core global, generate a completion report, and transmit the completion report.
-
19.
公开(公告)号:US11868741B2
公开(公告)日:2024-01-09
申请号:US17807082
申请日:2022-06-15
Applicant: Rebellions Inc.
Inventor: Jaewan Bae , Jinwook Oh , Karim Charfi
CPC classification number: G06F7/533 , G06F7/50 , G06F7/535 , G06F7/5443 , G06N3/063
Abstract: The present disclosure discloses a processing element and a neural processing device including the processing element. The processing element includes a weight register configured to store a weight, an input activation register configured to store input activation, a flexible multiplier configured to generate result data by performing a multiplication operation of the weight and the input activation by using a first multiplier of a first precision or using both the first multiplier and a second multiplier of the first precision in response to a calculation mode signal and a saturating adder configured to generate a partial sum by using the result data.
-
公开(公告)号:US20230385198A1
公开(公告)日:2023-11-30
申请号:US18448102
申请日:2023-08-10
Applicant: Rebellions Inc.
Inventor: Jinseok Kim , Jinwook Oh , Donghan Kim
IPC: G06F12/084
CPC classification number: G06F12/084 , G06F2212/622
Abstract: A neural processing device is provided. The neural processing device comprises: a processing unit configured to perform calculations, an L0 memory configured to receive data from the processing unit and provide data to the processing unit, and an LSU (Load/Store Unit) configured to perform load and store operations of the data, wherein the LSU comprises: a neural core load unit configured to issue a load instruction of the data, a neural core store unit configured to issue a store instruction for transmitting and storing the data, and a sync ID logic configured to provide a sync ID to the neural core load unit and the neural core store unit to thereby cause a synchronization signal to be generated for each sync ID.
-
-
-
-
-
-
-
-
-