Removal of degenerated sub-primitives in tessellation

    公开(公告)号:US10580209B2

    公开(公告)日:2020-03-03

    申请号:US15913480

    申请日:2018-03-06

    Abstract: This disclosure describes a method and apparatus for performing tessellation in a graphics process. A graphics processing unit may be configured to determine tessellation factors for a first patch of the graphics data, determine, based on the tessellation factors, that a first edge of an outermost ring of the first patch will produce only degenerated sub-primitives, and skip performing tessellation for the first edge. A graphics processing unit may determine that a second edge of the outermost ring of the first patch will produce at least some normal sub-primitives, and perform tessellation for the second edge to produce output primitives.

    Runtime mechanism to optimize shader execution flow

    公开(公告)号:US12229864B2

    公开(公告)日:2025-02-18

    申请号:US17817815

    申请日:2022-08-05

    Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for runtime optimization of the shader execution flow. A graphics processor may obtain instruction execution data associated with a graphics workload, the instruction execution data including graphics data for a set of shader operations. The graphics processor may configure, at a first iteration, at least one predication value based on the instruction execution data including the graphics data for the set of shader operations. The graphics processor may adjust, at a second iteration, an execution flow of the graphics workload based on the configured at least one predication value, the execution flow of the graphics workload including the set of shader operations. The graphics processor may execute or refrain from executing, at the second iteration, each of the set of shader operations based on the adjusted execution flow of the graphics workload.

    Run-time mechanism for optimal shader

    公开(公告)号:US12067666B2

    公开(公告)日:2024-08-20

    申请号:US17664033

    申请日:2022-05-18

    CPC classification number: G06T15/005 G06T1/60

    Abstract: Aspects presented herein relate to methods and devices for graphics processing including an apparatus, e.g., a GPU. The apparatus may receive a set of draw call instructions corresponding to a graphics workload, where the set of draw call instructions is associated with at least one run-time parameter. The apparatus may also obtain a first shader program associated with storing data in a system memory and at least one second shader program associated with storing data in a constant memory. Further, the apparatus may execute the first shader program or the at least one second shader program based on whether the at least one run-time parameter is less than or equal to a size of the constant memory. The apparatus may also update or maintain a configuration of a shader processor or a streaming processor based on executing the first shader program or the at least one second shader program.

    SHADER PROGRAM EXECUTION TECHNIQUES FOR USE IN GRAPHICS PROCESSING
    7.
    发明申请
    SHADER PROGRAM EXECUTION TECHNIQUES FOR USE IN GRAPHICS PROCESSING 有权
    用于图形处理的较差程序执行技术

    公开(公告)号:US20160055667A1

    公开(公告)日:2016-02-25

    申请号:US14466554

    申请日:2014-08-22

    CPC classification number: G06T15/83 G06T15/005

    Abstract: This disclosure describes techniques for executing shader programs in a graphics processing unit (GPU). In some examples, the techniques for executing shader programs may include executing, with a shader unit of a graphics processor, a shader program that performs vertex shader processing and that generates multiple output vertices for each input vertex that is received by the shader program. In further examples, the techniques for executing shader programs may include executing a merged vertex/geometry shader program using a non-replicated mode of execution. The non-replicated mode of execution may involve assigning each of a plurality of primitives to one merged vertex/geometry shader program instance per primitive and causing each of the instances to output a plurality of vertices. In additional examples, the techniques for executing shader programs may include techniques for selecting one of a non-replicated mode and a replicated mode for executing a merged vertex/geometry shader program.

    Abstract translation: 本公开描述了用于在图形处理单元(GPU)中执行着色器程序的技术。 在一些示例中,用于执行着色器程序的技术可以包括使用图形处理器的着色器单元执行着色器程序,该着色器程序执行顶点着色器处理,并且为着色器程序接收的每个输入顶点生成多个输出顶点。 在另外的示例中,用于执行着色器程序的技术可以包括使用非复制的执行模式来执行合并的顶点/几何着色器程序。 非复制的执行模式可以包括将多个基元中的每一个分配给每个基元的一个合并的顶点/几何着色器程序实例,并使每个实例输出多个顶点。 在附加示例中,用于执行着色器程序的技术可以包括用于选择非复制模式和用于执行合并顶点/几何着色器程序的复制模式之一的技术。

    Dynamic wave pairing
    8.
    发明授权

    公开(公告)号:US11954758B2

    公开(公告)日:2024-04-09

    申请号:US17652478

    申请日:2022-02-24

    CPC classification number: G06T1/20 G06F9/505

    Abstract: This disclosure provides systems, devices, apparatus, and methods, including computer programs encoded on storage media, for dynamic wave pairing. A graphics processor may allocate one or more GPU workloads to one or more wave slots of a plurality of wave slots. The graphics processor may select a first execution slot of a plurality of execution slots for executing the one or more GPU workloads. The selection may be based on one of a plurality of granularities. The graphics processor may execute, at the selected first execution slot, the one or more GPU workloads at the one of the plurality of granularities.

Patent Agency Ranking