Compute work distribution reference counters
    2.
    发明授权
    Compute work distribution reference counters 有权
    计算工作分配参考计数器

    公开(公告)号:US09507638B2

    公开(公告)日:2016-11-29

    申请号:US13291369

    申请日:2011-11-08

    IPC分类号: G06F9/455 G06F9/50

    CPC分类号: G06F9/5022

    摘要: One embodiment of the present invention sets forth a technique for managing the allocation and release of resources during multi-threaded program execution. Programmable reference counters are initialized to values that limit the amount of resources for allocation to tasks that share the same reference counter. Resource parameters are specified for each task to define the amount of resources allocated for consumption by each array of execution threads that is launched to execute the task. The resource parameters also specify the behavior of the array for acquiring and releasing resources. Finally, during execution of each thread in the array, an exit instruction may be configured to override the release of the resources that were allocated to the array. The resources may then be retained for use by a child task that is generated during execution of a thread.

    摘要翻译: 本发明的一个实施例提出了一种用于在多线程程序执行期间管理资源的分配和释放的技术。 可编程参考计数器被初始化为限制用于分配给共享相同引用计数器的任务的资源量的值。 为每个任务指定资源参数,以定义为执行任务启动的每个执行线程数组分配的消耗资源量。 资源参数还指定数组用于获取和释放资源的行为。 最后,在执行阵列中的每个线程时,可以将退出指令配置为覆盖分配给阵列的资源的释放。 然后可以保留资源以供执行线程期间生成的子任务使用。

    Scheduling and execution of compute tasks
    7.
    发明授权
    Scheduling and execution of compute tasks 有权
    计划任务的计划和执行

    公开(公告)号:US09069609B2

    公开(公告)日:2015-06-30

    申请号:US13353150

    申请日:2012-01-18

    IPC分类号: G06F9/46 G06F9/48 G06F9/50

    摘要: One embodiment of the present invention sets forth a technique for assigning a compute task to a first processor included in a plurality of processors. The technique involves analyzing each compute task in a plurality of compute tasks to identify one or more compute tasks that are eligible for assignment to the first processor, where each compute task is listed in a first table and is associated with a priority value and an allocation order that indicates relative time at which the compute task was added to the first table. The technique further involves selecting a first task compute from the identified one or more compute tasks based on at least one of the priority value and the allocation order, and assigning the first compute task to the first processor for execution.

    摘要翻译: 本发明的一个实施例提出了一种用于将计算任务分配给包括在多个处理器中的第一处理器的技术。 该技术涉及分析多个计算任务中的每个计算任务以识别符合分配给第一处理器的一个或多个计算任务,其中每个计算任务在第一表中列出并且与优先级值和分配 指示将计算任务添加到第一个表的相对时间的顺序。 该技术还包括基于优先级值和分配顺序中的至少一个从所识别的一个或多个计算任务中选择第一任务计算,以及将第一计算任务分配给第一处理器以供执行。

    Pixel shader output map
    8.
    发明授权
    Pixel shader output map 有权
    像素着色器输出图

    公开(公告)号:US08922555B2

    公开(公告)日:2014-12-30

    申请号:US12898998

    申请日:2010-10-06

    IPC分类号: G06T15/00

    CPC分类号: G06T15/005

    摘要: One embodiment of the present invention sets forth a technique for storing only the enabled components for each enabled vector and writing only enabled components to one or more specified render targets. A shader program header (SPH) file provides per-component mask bits for each render target. Each enabled mask bit indicates that the pixel shader generates the corresponding component as an output to the raster operations unit. In the hardware, the per-component mask bits are combined with the applications programming interface (API)-level per-component write masks to determine the components that are updated by the shader program. The combined mask is used as the write enable bits for components in one or more render targets. One advantage of the combined mask is that the components that are not updated are not forwarded from the pixel shader to the ROP, thereby saving bandwidth between those processing units.

    摘要翻译: 本发明的一个实施例提出了一种用于仅存储每个启用向量的启用组件并仅将启用的组件写入一个或多个指定的渲染目标的技术。 着色器程序头(SPH)文件为每个渲染目标提供每个组件掩码位。 每个启用的屏蔽位指示像素着色器生成相应的组件作为光栅操作单元的输出。 在硬件中,每个组件掩码位与应用程序编程接口(API)级的每个组件写入掩码相结合,以确定由着色器程序更新的组件。 组合掩码用作一个或多个渲染目标中的组件的写使能位。 组合掩码的一个优点是未更新的组件不会从像素着色器转发到ROP,从而节省了这些处理单元之间的带宽。

    Shader program headers
    9.
    发明授权
    Shader program headers 有权
    着色器程序标题

    公开(公告)号:US08786618B2

    公开(公告)日:2014-07-22

    申请号:US12899431

    申请日:2010-10-06

    IPC分类号: G06T1/00

    CPC分类号: G06T15/005

    摘要: One embodiment of the present invention sets forth a technique for configuring a graphics processing pipeline (GPP) to process data according to one or more shader programs. The method includes receiving a plurality of pointers, where each pointer references a different shader program header (SPH) included in a plurality of SPHs, and each SPH is associated with a different shader program that executes within the GPP. For each SPH included in the plurality of SPHs, one or more GPP configuration parameters included in the SPH are identified, and the GPP is adjusted based on the one or more GPP configuration parameters.

    摘要翻译: 本发明的一个实施例提出了一种用于配置图形处理流水线(GPP)以根据一个或多个着色器程序处理数据的技术。 该方法包括接收多个指针,其中每个指针引用包括在多个SPH中的不同着色器程序头(SPH),并且每个SPH与在GPP内执行的不同着色器程序相关联。 对于包括在多个SPH中的每个SPH,识别包括在SPH中的一个或多个GPP配置参数,并且基于一个或多个GPP配置参数来调整GPP。

    GPU work creation and stateless graphics in OPENGL
    10.
    发明授权
    GPU work creation and stateless graphics in OPENGL 有权
    GPU工作创建和无状态图形在OPENGL

    公开(公告)号:US09275491B2

    公开(公告)日:2016-03-01

    申请号:US13078878

    申请日:2011-04-01

    摘要: One embodiment of the present invention sets forth a method for generating work to be processed by a graphics pipeline residing within a graphics processor. The method includes the steps of receiving an indication that a first graphics workload is to be submitted to a command queue associated with the graphics processor, allocating a first portion of shader accessible memory for one or more units of state information that are necessary for processing the first graphics workload, populating the first portion of shader accessible memory with the one or more units of state information, and transmitting to the command queue of the graphics processor the one or more units of state information stored within the first portion of shader accessible memory, wherein the first graphics workload is processed within the graphics pipeline based on the one or more units of state information.

    摘要翻译: 本发明的一个实施例提出了一种用于产生要由位于图形处理器内的图形管线处理的工作的方法。 该方法包括以下步骤:接收将要向第一图形工作负载提交到与图形处理器相关联的命令队列的指示,为处理所述图形处理所需的一个或多个状态信息单​​元分配着色器可访问存储器的第一部分 第一图形工作负载,用一个或多个状态信息单​​元填充着色器可访问存储器的第一部分,以及向存储在着色器可访问存储器的第一部分内的一个或多个状态信息单​​元传送到图形处理器的命令队列, 其中基于所述一个或多个状态信息单​​元在所述图形流水线内处理所述第一图形工作负载。