Invention Grant
- Patent Title: Device profiling in GPU accelerators by using host-device coordination
-
Application No.: US16939313Application Date: 2020-07-27
-
Publication No.: US11579852B2Publication Date: 2023-02-14
- Inventor: Hariharan Sandanagobalane , Sean Lee , Vinod Grover
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Taylor English Duma LLP
- Main IPC: G06F8/41
- IPC: G06F8/41 ; G06F11/36 ; G06F16/903 ; G06F16/901 ; G06F9/445

Abstract:
System and method of compiling a program having a mixture of host code and device code to enable Profile Guided Optimization (PGO) for device code execution. An exemplary integrated compiler can compile source code programmed to be executed by a host processor (e.g., CPU) and a co-processor (e.g., a GPU) concurrently. The compilation can generate an instrumented executable code which includes: profile instrumentation counters for the device functions; and instructions for the host processor to allocate and initialize device memory for the counters and to retrieve collected profile information from the device memory to generate instrumentation output. The output is fed back to the compiler for compiling the source code a second time to generate optimized executable code for the device functions defined in the source code.
Information query