-
公开(公告)号:US20220058487A1
公开(公告)日:2022-02-24
申请号:US17520326
申请日:2021-11-05
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Dongsoo LEE , Sejung KWON , Byeoungwook Kim
Abstract: An electronic apparatus, including a memory configured to store weight data used for computation of a neural network model; and a processor configured to: identify, from among weight values included in the weight data, at least one weight value having a size less than or equal to a threshold value, quantize remaining weight values other than the identified at least one weight value to obtain first quantized data including quantized values corresponding to the remaining weight values, identify, from among the quantized values, a quantized value closest to a predetermined value, obtain second quantized data including a quantized value corresponding to the at least one weight value based on the quantized value closest to the predetermined value, and store the first quantized data and the second quantized data in the memory
-
公开(公告)号:US11568254B2
公开(公告)日:2023-01-31
申请号:US16727323
申请日:2019-12-26
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Dongsoo Lee , Sejung Kwon , Parichay Kapoor , Byeoungwook Kim
Abstract: An electronic apparatus is provided. The electronic apparatus includes sample data and memory storing a first matrix included in an artificial intelligence model trained based on sample data, and a processor configured to prunes each of a plurality of first elements included in the first matrix based on a first threshold, and acquire a first pruning index matrix that indicates whether each of the plurality of first elements has been pruned with binary data, factorize the first matrix to a second matrix of which size was determined based on the number of rows and the rank, and a third matrix of which size was determined based on the rank and the number of columns of the first matrix, prunes each of a plurality of second elements included in the second matrix based on a second threshold, and acquire a second pruning index matrix that indicates whether each of the plurality of second elements has been pruned with binary data, prunes each of a plurality of third elements included in the third matrix based on a third threshold, and acquire a third pruning index matrix that indicates whether each of the plurality of third elements has been pruned with binary data, acquire a final index matrix based on the second pruning index matrix and the third pruning index matrix, and update at least one of the second pruning index matrix or the third pruning index matrix by comparing the final index matrix with the first pruning index matrix.
-
公开(公告)号:US11475281B2
公开(公告)日:2022-10-18
申请号:US16555331
申请日:2019-08-29
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Parichay Kapoor , Saehyung Lee , Dongsoo Lee , Byeoungwook Kim
Abstract: An electronic apparatus is provided. The electronic apparatus includes a storage storing a matrix included in an artificial intelligence model, and a processor. The processor divides data included in at least a portion of the matrix by one of rows and columns of the matrix to form groups, clusters the groups into clusters based on data included in each of the groups, and quantizes data divided by the other one of rows and columns of the matrix among data included in each of the clusters.
-
4.
公开(公告)号:US10608664B2
公开(公告)日:2020-03-31
申请号:US16240075
申请日:2019-01-04
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Dongsoo Lee , Youngchul Cho , Kwanghoon Son , Byeoungwook Kim
IPC: H04N7/26 , H03M7/32 , G06N20/00 , G06N3/08 , H03M13/41 , G05B13/00 , H03M5/00 , G06N7/02 , H03M1/12 , H03M7/00
Abstract: A data compression method and a data decompression method are provided. The method includes pruning an original data including a plurality of weight parameters, identifying at least one first weight parameter of which at least one first value is not changed by the pruning, among multiple weight parameters included in the pruned original data, and obtaining a first index data including location information of the at least one first weight parameter of which the at least one first value is not changed, identifying at least one second weight parameter of which at least one second value is changed by the pruning, among the multiple weight parameters included in the pruned original data, and substituting the at least one second weight parameter of which the at least one second value is changed with a don't care parameter.
-
公开(公告)号:US11595062B2
公开(公告)日:2023-02-28
申请号:US17130538
申请日:2020-12-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Dongsoo Lee , Sejung Kwon , Byeoungwook Kim , Parichay Kapoor , Baeseong Park
Abstract: A decompression apparatus is provided. The decompression apparatus includes a memory configured to store compressed data decompressed and used in neural network processing of an artificial intelligence model, a decoder configured to include a plurality of logic circuits related to a compression method of the compressed data, decompress the compressed data through the plurality of logic circuits based on an input of the compressed data, and output the decompressed data, and a processor configured to obtain data of a neural network processible form from the data output from the decoder.
-
公开(公告)号:US10917121B2
公开(公告)日:2021-02-09
申请号:US16854285
申请日:2020-04-21
Applicant: Samsung Electronics Co., Ltd.
Inventor: Dongsoo Lee , Sejung Kwon , Byeoungwook Kim , Parichay Kapoor , Baeseong Park
Abstract: A decompression apparatus is provided. The decompression apparatus includes a memory configured to store compressed data decompressed and used in neural network processing of an artificial intelligence model, a decoder configured to include a plurality of logic circuits related to a compression method of the compressed data, decompress the compressed data through the plurality of logic circuits based on an input of the compressed data, and output the decompressed data, and a processor configured to obtain data of a neural network processible form from the data output from the decoder.
-
-
-
-
-