-
公开(公告)号:US11836256B2
公开(公告)日:2023-12-05
申请号:US16256107
申请日:2019-01-24
Applicant: International Business Machines Corporation
Inventor: Pin-Yu Chen , Sijia Liu , Lingfei Wu , Chia-Yu Chen
IPC: G06F21/57 , G06N3/04 , G06N3/08 , G06V10/764 , G06V10/82
CPC classification number: G06F21/577 , G06N3/04 , G06N3/08 , G06V10/764 , G06V10/82 , G06F2221/034
Abstract: An adversarial robustness testing method, system, and computer program product include testing a robustness of a black-box system under different access settings via an accelerator.
-
公开(公告)号:US20230121677A1
公开(公告)日:2023-04-20
申请号:US18068637
申请日:2022-12-20
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Chia-Yu Chen , Jui-Hsin Lai , Ko-Tao Lee , Li-Wen Hung
IPC: G06N3/08 , G06N5/04 , G11C11/419 , G11C11/412
Abstract: A Static Random Access Memory (SRAM) device in a binary neural network is provided. The SRAM device includes an SRAM inference engine having an SRAM computation architecture with a forward path that include multiple SRAM cells forming a chain of SRAM cells such that an output of a given one of the multiple SRAM cells is an input to a following one of the multiple SRAM cells. The SRAM computation architecture is configured to compute a prediction from an input. The SRAM computation architecture is configured to store ternary data and perform local computations on the ternary data.
-
公开(公告)号:US11562235B2
公开(公告)日:2023-01-24
申请号:US16797587
申请日:2020-02-21
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Mingu Kang , Kyu-Hyoun Kim , Seyoung Kim , Chia-Yu Chen
Abstract: A computer-implemented method for improving the efficiency of computing an activation function in a neural network system includes initializing, by a controller, weights in a weight vector associated with the neural network system. Further, the method includes receiving, by the controller, an input vector of input values for computing a dot product with the weight vector for the activation function, which determines an output value of a node in the neural network system. The method further includes predicting, by a rectifier linear unit (ReLU), which computes the activation function, that the output value of the node will be negative based on computing an intermediate value for computing the dot product, and based on a magnitude of the intermediate value exceeding a precomputed threshold value. Further, the method includes, in response to the prediction, terminating, by the ReLU, the computation of the dot product, and outputting a 0 as the output value.
-
公开(公告)号:US11138010B1
公开(公告)日:2021-10-05
申请号:US17060788
申请日:2020-10-01
Applicant: International Business Machines Corporation
Inventor: Chia-Yu Chen , Jungwook Choi , Brian William Curran , Bruce Fleischer , Kailash Gopalakrishnan , Jinwook Oh , Sunil K Shukla , Vijayalakshmi Srinivasan
IPC: G06F9/30
Abstract: Embodiments of the present invention include a computer system that manages execution of one or more programs with one or more loops where each loop having a loop level. Embodiments that manage loops that can skip execution and the number of loops changing during execution are also disclosed. A loop level register (LLEV) stores the loop level for a currently executing loop. A Loop-Back Program Counter Register (LBPR) has a table of one or more Loop-Back Registers. Each Loop-Back Register stores the loop level for a LBPR respective loop and a loop back PC location for the LBPR respective loop. A Program Counter points back to the PC location for each iteration of the loop. A Loop Current Count Register table (LCCR) tracks a number of iterations remaining to executed for of the loop. A loop management process causes one of the CPUs to execute all the one or more instructions of an iteration of the currently executing program loop. When all iterations of the executing loop are complete, the LLEV is decremented to a next loop level that contained the executed loop.
-
公开(公告)号:US10734385B2
公开(公告)日:2020-08-04
申请号:US16548265
申请日:2019-08-22
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Chia-Yu Chen , Bruce B. Doris , Hong He , Rajasekhar Venigalla
IPC: H01L29/04 , H01L27/092 , H01L29/78 , H01L29/165 , H01L21/18 , H01L27/12 , H01L21/8238 , H01L21/84 , H01L29/66
Abstract: A semiconductor device that includes at least one germanium containing fin structure having a length along a direction and a sidewall orientated along the (100) plane. The semiconductor device also includes at least one germanium free fin structure having a length along a direction and a sidewall orientated along the (100) plane. A gate structure is present on a channel region of each of the germanium containing fin structure and the germanium free fin structure. N-type epitaxial semiconductor material having a square geometry present on the source and drain portions of the sidewalls having the (100) plane orientation of the germanium free fin structures. P-type epitaxial semiconductor material having a square geometry is present on the source and drain portions of the sidewalls having the (100) plane orientation of the germanium containing fin structures.
-
公开(公告)号:US20200012706A1
公开(公告)日:2020-01-09
申请号:US16576144
申请日:2019-09-19
Applicant: International Business Machines Corporation
Inventor: Chia-Yu Chen , Jungwook Choi , Kailash Gopalakrishnan , Victor Han , Vijayalakshmi Srinivasan , Jintao Zhang
IPC: G06F17/16
Abstract: Techniques facilitating matrix multiplication on a systolic array are provided. A computer-implemented method can comprise populating, by a system operatively coupled to a processor, respective first registers of one or more processing elements of a systolic array structure with respective input data bits of a first data matrix. The one or more processing elements can comprise a first processing element that comprises a first input data bit of the first data matrix and a first activation bit of a second data matrix. The method can also include determining, by the system, at the first processing element, a first partial sum of a third data matrix. Further, the method can include streaming, by the system, the first partial sum of the third data matrix from the first processing element.
-
87.
公开(公告)号:US20190340542A1
公开(公告)日:2019-11-07
申请号:US15972108
申请日:2018-05-04
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: LINGFEI WU , Kun Xu , Pin-Yu Chen , Chia-Yu Chen
Abstract: A method and system of analyzing a symbolic sequence is provided. Metadata of a symbolic sequence is received from a computing device of an owner. A set of R random sequences are generated based on the received metadata and sent to the computing device of the owner of the symbolic sequence for computation of a feature matrix based on the set of R random sequences and the symbolic sequence. The feature matrix is received from the computing device of the owner. Upon determining that an inner product of the feature matrix is below a threshold accuracy, the iterative process returns to generating R random sequences. Upon determining that the inner product of the feature matrix is at or above the threshold accuracy, the feature matrix is categorized based on machine learning. The categorized global feature matrix is sent to be displayed on a user interface of the computing device of the owner.
-
公开(公告)号:US20190236113A1
公开(公告)日:2019-08-01
申请号:US16381530
申请日:2019-04-11
Applicant: International Business Machines Corporation
Inventor: Chia-Yu Chen , Jungwook Choi , Kailash Gopalakrishnan , Victor Han , Vijayalakshmi Srinivasan , Jintao Zhang
IPC: G06F17/16
CPC classification number: G06F17/16
Abstract: Techniques facilitating matrix multiplication on a systolic array are provided. A computer-implemented method can comprise populating, by a system operatively coupled to a processor, respective first registers of one or more processing elements of a systolic array structure with respective input data bits of a first data matrix. The one or more processing elements can comprise a first processing element that comprises a first input data bit of the first data matrix and a first activation bit of a second data matrix. The method can also include determining, by the system, at the first processing element, a first partial sum of a third data matrix. Further, the method can include streaming, by the system, the first partial sum of the third data matrix from the first processing element.
-
公开(公告)号:US20190164050A1
公开(公告)日:2019-05-30
申请号:US15827465
申请日:2017-11-30
Applicant: International Business Machines Corporation
Inventor: Chia-Yu Chen , Jungwook Choi , Kailash Gopalakrishnan , Suyog Gupta , Pritish Narayanan
Abstract: A system, having a memory that stores computer executable components, and a processor that executes the computer executable components, reduces data size in connection with training a neural network by exploiting spatial locality to weight matrices and effecting frequency transformation and compression. A receiving component receives neural network data in the form of a compressed frequency-domain weight matrix. A segmentation component segments the initial weight matrix into original sub-components, wherein respective original sub-components have spatial weights. A sampling component applies a generalized weight distribution to the respective original sub-components to generate respective normalized sub-components. A transform component applies a transform to the respective normalized sub-components. A cropping component crops high-frequency weights of the respective transformed normalized sub-components to yield a set of low-frequency normalized sub-components to generate a compressed representation of the original sub-components.
-
公开(公告)号:US10241972B2
公开(公告)日:2019-03-26
申请号:US15460755
申请日:2017-03-16
Applicant: International Business Machines Corporation
Inventor: Chia-Yu Chen , Jungwook Choi , Kailash Gopalakrishnan , Victor Han , Vijayalakshmi Srinivasan , Jintao Zhang
IPC: G06F17/16
Abstract: Techniques facilitating matrix multiplication on a systolic array are provided. A computer-implemented method can comprise populating, by a system operatively coupled to a processor, respective first registers of one or more processing elements of a systolic array structure with respective input data bits of a first data matrix. The one or more processing elements can comprise a first processing element that comprises a first input data bit of the first data matrix and a first activation bit of a second data matrix. The method can also include determining, by the system, at the first processing element, a first partial sum of a third data matrix. Further, the method can include streaming, by the system, the first partial sum of the third data matrix from the first processing element.
-
-
-
-
-
-
-
-
-