Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Hamzah Abdelaziz"

1.

发明申请
SYSTEM AND METHOD FOR PERFORMING COMPUTATIONS FOR DEEP NEURAL NETWORKS 有权

公开(公告)号：US20210326686A1

公开(公告)日：2021-10-21

申请号：US16900845

申请日：2020-06-12

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hamzah Abdelaziz , Joseph Hassoun , Ali Shafiee Ardestani

IPC: G06N3/063 , G06N3/08 , G06N3/04 , G06F9/30 , G06F9/54

Abstract: A computation unit for performing a computation of a neural network layer is disclosed. A number of processing element (PE) units are arranged in an array. First input values are provided in parallel in an input dimension of the array during a first processing period, and a second input values are provided in parallel in the input dimension during a second processing period. Computations are performed by the PE units based on stored weight values. An adder coupled to the first set of PE units generates a first sum of results of the computations by the first set of PE units during the first processing cycle, and generates a second sum of results of the computations during the second processing cycle. A first accumulator coupled to the first adder stores the first sum, and further shifts the first sum to a second accumulator prior to storing the second sum.

2.

发明授权
Mixed-precision neural processing unit (NPU) using spatial fusion with load balancing 有权

公开(公告)号：US12001929B2

公开(公告)日：2024-06-04

申请号：US16898433

申请日：2020-06-10

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hamzah Abdelaziz , Joseph Hassoun , Ali Shafiee Ardestani

IPC: G06N20/00 , H04L67/1001

CPC classification number: G06N20/00 , H04L67/1001

Abstract: According to one general aspect, an apparatus may include a machine learning system. The machine learning system may include a precision determination circuit configured to: determine a precision level of data, and divide the data into a data subdivision. The machine learning system may exploit sparsity during the computation of each subdivision. The machine learning system may include a load balancing circuit configured to select a load balancing technique, wherein the load balancing technique includes alternately loading the computation circuit with at least a first data/weight subdivision combination and a second data/weight subdivision combination. The load balancing circuit may be configured to load a computation circuit with a selected data subdivision and a selected weight subdivision based, at least in part, upon the load balancing technique. The machine learning system may include a computation circuit configured to compute a partial computation result based, at least in part, upon the selected data subdivision and the weight subdivision.

3.

发明申请
SYSTEM AND METHOD FOR PERFORMING COMPUTATIONS FOR DEEP NEURAL NETWORKS 有权

公开(公告)号：US20230047273A1

公开(公告)日：2023-02-16

申请号：US17966488

申请日：2022-10-14

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hamzah Abdelaziz , Joseph Hassoun , Ali Shafiee Ardestani

IPC: G06N3/063 , G06F9/30 , G06F9/54 , G06N3/04 , G06N3/08

Abstract: A computation unit for performing a computation of a neural network layer is disclosed. A number of processing element (PE) units are arranged in an array. First input values are provided in parallel in an input dimension of the array during a first processing period, and a second input values are provided in parallel in the input dimension during a second processing period. Computations are performed by the PE units based on stored weight values. An adder coupled to the first set of PE units generates a first sum of results of the computations by the first set of PE units during the first processing cycle, and generates a second sum of results of the computations during the second processing cycle. A first accumulator coupled to the first adder stores the first sum, and further shifts the first sum to a second accumulator prior to storing the second sum.

4.

发明授权
System and method for performing computations for deep neural networks 有权

公开(公告)号：US11681907B2

公开(公告)日：2023-06-20

申请号：US17966488

申请日：2022-10-14

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hamzah Abdelaziz , Joseph Hassoun , Ali Shafiee Ardestani

IPC: G06N7/00 , G06N3/065 , G06F9/30 , G06F9/54 , G06N3/04 , G06N3/08

CPC classification number: G06N3/065 , G06F9/3001 , G06F9/30032 , G06F9/30036 , G06F9/30101 , G06F9/545 , G06N3/04 , G06N3/08

Abstract: A computation unit for performing a computation of a neural network layer is disclosed. A number of processing element (PE) units are arranged in an array. First input values are provided in parallel in an input dimension of the array during a first processing period, and a second input values are provided in parallel in the input dimension during a second processing period. Computations are performed by the PE units based on stored weight values. An adder coupled to the first set of PE units generates a first sum of results of the computations by the first set of PE units during the first processing cycle, and generates a second sum of results of the computations during the second processing cycle. A first accumulator coupled to the first adder stores the first sum, and further shifts the first sum to a second accumulator prior to storing the second sum.

5.

发明授权
System and method for performing computations for deep neural networks 有权

公开(公告)号：US11507817B2

公开(公告)日：2022-11-22

申请号：US16900845

申请日：2020-06-12

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hamzah Abdelaziz , Joseph Hassoun , Ali Shafiee Ardestani

IPC: G06F9/46 , G06N3/063 , G06F9/30 , G06F9/54 , G06N3/04 , G06N3/08

Abstract: A computation unit for performing a computation of a neural network layer is disclosed. A number of processing element (PE) units are arranged in an array. First input values are provided in parallel in an input dimension of the array during a first processing period, and a second input values are provided in parallel in the input dimension during a second processing period. Computations are performed by the PE units based on stored weight values. An adder coupled to the first set of PE units generates a first sum of results of the computations by the first set of PE units during the first processing cycle, and generates a second sum of results of the computations during the second processing cycle. A first accumulator coupled to the first adder stores the first sum, and further shifts the first sum to a second accumulator prior to storing the second sum.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification