Invention Grant
- Patent Title: Optimized and scalable sparse triangular linear systems on networks of accelerators
-
Application No.: US16044145Application Date: 2018-07-24
-
Publication No.: US10936697B2Publication Date: 2021-03-02
- Inventor: Khaled Hamidouche , Michael W. LeBeane , Nicholas P. Malaya , Joseph L. Greathouse
- Applicant: Advanced Micro Devices, Inc.
- Applicant Address: US CA Santa Clara
- Assignee: Advanced Micro Devices, Inc.
- Current Assignee: Advanced Micro Devices, Inc.
- Current Assignee Address: US CA Santa Clara
- Agency: Liang & Cheng, PC
- Main IPC: G06F17/16
- IPC: G06F17/16 ; G06F9/38 ; G06F9/30 ; G06F17/12

Abstract:
A method includes storing a first portion of a sparse triangular matrix in a local memory and launching a kernel for executing a set of workgroups. The first portion includes a plurality of row blocks, and each workgroup in the set of workgroups is associated with one of the plurality of row blocks. The method also includes, for each workgroup in the set of workgroups, solving the row block. The row block is solved by, for each row segment of a first subset of row segments in the row block, calculating a partial sum for the row segment based on one or more matrix elements in the row segment, and writing the partial sum to a remote memory of a first remote processing unit prior to terminating the kernel.
Public/Granted literature
- US20200034405A1 OPTIMIZED AND SCALABLE SPARSE TRIANGULAR LINEAR SYSTEMS ON NETWORKS OF ACCELERATORS Public/Granted day:2020-01-30
Information query