SYSTEM AND METHOD FOR COMPILER SUPPORT FOR KERNEL LAUNCHES IN DEVICE CODE
    1.
    发明申请
    SYSTEM AND METHOD FOR COMPILER SUPPORT FOR KERNEL LAUNCHES IN DEVICE CODE 有权
    用于编码器支持的系统和方法,用于KERNEL在设备代码中的发布

    公开(公告)号:US20130300752A1

    公开(公告)日:2013-11-14

    申请号:US13735981

    申请日:2013-01-07

    CPC classification number: G06F9/54 G06F8/41 G06F9/4843

    Abstract: A system and method for compiling source code (e.g., with a compiler). The method includes accessing a portion of device source code and determining whether the portion of the device source code comprises a piece of work to be launched on a device from the device. The method further includes determining a plurality of application programming interface (API) calls based on the piece of work to be launched on the device and generating compiled code based on the plurality of API calls. The compiled code comprises a first portion operable to execute on a central processing unit (CPU) and a second portion operable to execute on the device (e.g., GPU).

    Abstract translation: 用于编译源代码的系统和方法(例如,使用编译器)。 该方法包括访问设备源代码的一部分并且确定设备源代码的部分是否包括要在设备上从设备启动的工作。 该方法还包括基于要在设备上启动的工作并基于多个API调用生成编译代码来确定多个应用编程接口(API)调用。 编译代码包括可操作以在中央处理单元(CPU)上执行的第一部分和可操作以在设备(例如,GPU)上执行的第二部分。

    MEMORY MANAGEMENT SYSTEM
    2.
    发明申请

    公开(公告)号:US20200264970A1

    公开(公告)日:2020-08-20

    申请号:US16279884

    申请日:2019-02-19

    Abstract: A computer system manages the allocation of memory to an application program using a dependency tree. The dependency tree informs a memory manager of data inputs, data outputs, and intermediate values associated with execution of the application program. The memory manager allocates a single heap structure within a physical memory. Data associated with each node of the dependency tree is allocated to the heap structure so that data input values are allocated in a contiguous block, and intermediate values are allocated separately. In various examples, as execution of the application program proceeds, the separation of intermediate values from non-intermediate values within the heap reduces memory fragmentation providing improved performance of the computer system as a whole.

    CODE COVERAGE GENERATION IN GPU BY USING HOST-DEVICE COORDINATION

    公开(公告)号:US20190108006A1

    公开(公告)日:2019-04-11

    申请号:US16154542

    申请日:2018-10-08

    Abstract: System and method of compiling a program having a mixture of host code and device code to enable code coverage data collection for device code execution. An exemplary integrated compiler can compile source code programmed to be executed by a host processor (e.g., CPU) and a co-processor (e.g., a GPU) concurrently. The compilation can generate an instrumented executable code which includes: coverage instrumentation counters for the device functions; mapping information that maps the counters with the instrumented source points; and instructions for the host processor to allocate and initialize device memory for the counters and to retrieve collected code coverage information from the device memory to the host memory. Execution of the instrumented executable can yield a coverage report on the device code functions.

    Optimizing multiple invocations of graphics processing unit programs in Java

    公开(公告)号:US10324693B2

    公开(公告)日:2019-06-18

    申请号:US14950134

    申请日:2015-11-24

    Abstract: A system and method for optimizing multiple invocations of a graphics processing unit (GPU) program in Java. In one embodiment, the system includes: (1) a frontend component in a computer system and configured to compile Java bytecode associated with the a class object that implements a functional interface into Intermediate Representation (IR) code and store the IR code with the associated jogArray and (2) a collector/composer component in the computer system, associated with the frontend and configured to traverse a tree containing the multiple invocations from the result to collect the IR code and compose the IR code collected in the traversing into aggregate IR code when a result of the GPU program is explicitly requested to be transferred to a host.

    System and method for compiler support for kernel launches in device code

    公开(公告)号:US10025643B2

    公开(公告)日:2018-07-17

    申请号:US13735981

    申请日:2013-01-07

    Abstract: A system and method for compiling source code (e.g., with a compiler). The method includes accessing a portion of device source code and determining whether the portion of the device source code comprises a piece of work to be launched on a device from the device. The method further includes determining a plurality of application programming interface (API) calls based on the piece of work to be launched on the device and generating compiled code based on the plurality of API calls. The compiled code comprises a first portion operable to execute on a central processing unit (CPU) and a second portion operable to execute on the device (e.g., GPU).

    MEMORY MANAGEMENT SYSTEM
    9.
    发明申请

    公开(公告)号:US20250138999A1

    公开(公告)日:2025-05-01

    申请号:US19011300

    申请日:2025-01-06

    Abstract: A computer system manages the allocation of memory to an application program using a dependency tree. The dependency tree informs a memory manager of data inputs, data outputs, and intermediate values associated with execution of the application program. The memory manager allocates a single heap structure within a physical memory. Data associated with each node of the dependency tree is allocated to the heap structure so that data input values are allocated in a contiguous block, and intermediate values are allocated separately. In various examples, as execution of the application program proceeds, the separation of intermediate values from non-intermediate values within the heap reduces memory fragmentation providing improved performance of the computer system as a whole.

Patent Agency Ranking