Invention Application
- Patent Title: RUNTIME OF CUBLAS MATRIX MULTIPLICATION ON GPU
- Patent Title (中): GPU上的CUBLAS矩阵多项式运行
-
Application No.: US14823889Application Date: 2015-08-11
-
Publication No.: US20170046307A1Publication Date: 2017-02-16
- Inventor: Alexey Y. Lvov , Jinjun Xiong , Vladimir Zolotov
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Main IPC: G06F17/16
- IPC: G06F17/16 ; G06T1/20

Abstract:
Methods for improving matrix multiplication runtimes are provided. A method includes determining, by a GPU, optimal partitions for matrix-by-matrix multiplication of two factor matrices having sizes known a priori. The determining step includes performing offline a plurality of matrix-by-matrix multiplication executions, each for a respective different combination of two-way partitions across a plurality of partition sizes. The determining step further includes determining offline a respective performance number for each of the executions based on runtime. The determining step also includes recursively repeating offline said performing and determining steps until the respective performance number ceases to improve for best-performing combinations of the two-way partitions and saving the best performing combinations of the two-way partitions as the optimal partitions. The method further includes performing online, by the GPU, the matrix-by-matrix multiplication of the two factor matrices using calls for a given one of the best performing combinations of the two-way partitions.
Public/Granted literature
- US09836434B2 Runtime of CUBLAS matrix multiplication on GPU Public/Granted day:2017-12-05
Information query