-
公开(公告)号:US12141229B2
公开(公告)日:2024-11-12
申请号:US17325120
申请日:2021-05-19
Applicant: NVIDIA CORPORATION
Inventor: Hanrui Wang , James Michael O'Connor , Donghyuk Lee
IPC: G06F17/16 , G06F9/50 , G06F16/901
Abstract: One embodiment sets forth a technique for performing one or more matrix multiplication operations based on a first matrix and a second matrix. The technique includes receiving data associated with the first matrix from a first traversal engine that accesses nonzero elements included in the first matrix via a first tree structure. The technique also includes performing one or more computations on the data associated with the first matrix and the data associated with the second matrix to produce a plurality of partial results. The technique further includes combining the plurality of partial results into one or more intermediate results and storing the one or more intermediate results in a first buffer memory.