-
公开(公告)号:US20210294784A1
公开(公告)日:2021-09-23
申请号:US17204168
申请日:2021-03-17
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vasyltsov IHOR , Wooseok CHANG
IPC: G06F16/22 , G06F16/2455
Abstract: A method of operating a hardware accelerator, includes loading a lookup table, mapping each of input data values of input data to an index of indexes in the lookup table based on an input data distribution of the input data, and obtaining output data values corresponding to the input data values using the lookup table. The output data values are proportional to corresponding softmax values of the input data values.
-
公开(公告)号:US20250156331A1
公开(公告)日:2025-05-15
申请号:US18944463
申请日:2024-11-12
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Zhen ZHANG , Jiali PANG , Xiaohui SU , Lin CHEN , Vasyltsov IHOR
IPC: G06F12/0875
Abstract: Disclosed is an apparatus and method. The method includes generating dispatchable streams and binding the dispatchable streams one-to-one to cache slices, where the cache slices are pre-partitioned from an accelerated cache, and, for each of dispatchable streams binding a dispatchable kernel function, determined for a corresponding dispatchable stream, to the corresponding dispatchable stream, for a first cache slice, of the cache slices, first duplicating the dispatchable kernel function to the first cache slice and starting the first duplicated dispatchable kernel function with respect to the first cache slice, and for a second cache slice, of the cache slices, second duplicating the dispatchable kernel function to the second cache slice and starting the second duplicated dispatchable kernel function with respect to the second cache slice, wherein the starting of the first duplicated dispatchable kernel function is performed asynchronously with the starting of the second duplicated dispatchable kernel function.
-