Method and apparatus for occlusion culling of graphic objects
    71.
    发明申请
    Method and apparatus for occlusion culling of graphic objects 审中-公开
    图形对象遮挡剔除的方法和装置

    公开(公告)号:US20060209065A1

    公开(公告)日:2006-09-21

    申请号:US11298167

    申请日:2005-12-08

    IPC分类号: G06T15/40

    CPC分类号: G06T15/40

    摘要: A method of occlusion culling of graphic objects, comprising the steps of storing a first mask and one or more depth values associated with areas inside and outside the mask for a pre-defined region, and evaluating the visibility of the primitive covering the same region, wherein visibility evaluation begins after the computation of the coverage mask of the primitive in the region, and the computation of one or more depth values representing the pixels of the primitive. The method of the present invention is a real-time method of generating per-region coverage mask and associated Z values after the second primitive is rendered in the same region, which can maximize the bandwidth savings for Z read for both overlapping and non-overlapping primitives, with different relations between their depth values.

    摘要翻译: 一种图形对象的遮挡剔除方法,包括以下步骤:存储第一掩码和与掩模内部和外部的区域相关联的一个或多个深度值用于预定义区域,以及评估覆盖相同区域的图元的可视性, 其中在计算区域中的原语的覆盖掩码之后开始可视性评估,以及计算表示图元的像素的一个或多个深度值。 本发明的方法是在将第二原语呈现在相同区域中之后生成每区域覆盖掩码和相关联的Z值的实时方法,这可以使重叠和非重叠的Z读取的带宽节省最大化 原始图像,其深度值之间具有不同的关系。

    PIXEL VALUE COMPACTION FOR GRAPHICS PROCESSING
    72.
    发明申请
    PIXEL VALUE COMPACTION FOR GRAPHICS PROCESSING 有权
    用于图形处理的像素值压缩

    公开(公告)号:US20120262493A1

    公开(公告)日:2012-10-18

    申请号:US13085855

    申请日:2011-04-13

    IPC分类号: G06F13/14 G06T3/40

    摘要: In general, aspects of this disclosure describe example techniques for efficient usage of the fixed data rate processing of a graphics processing unit (GPU) for a variable data rate processing. For example, the GPU may be coupled to a pixel value processing unit that receives pixel values for pixels in an image processed by the GPU. The pixel value processing unit may determine whether the pixel values are for pixels that require further processing, and store the pixel values for the pixels that are required for further processing in a buffer.

    摘要翻译: 通常,本公开的方面描述了用于有效使用用于可变数据速率处理的图形处理单元(GPU)的固定数据速率处理的示例技术。 例如,GPU可以耦合到像素值处理单元,其接收由GPU处理的图像中的像素的像素值。 像素值处理单元可以确定像素值是否用于需要进一步处理的像素,并且将用于进一步处理所需的像素的像素值存储在缓冲器中。

    Efficient scissoring for graphics application
    73.
    发明授权
    Efficient scissoring for graphics application 有权
    图形应用程序的高效剪裁

    公开(公告)号:US08269792B2

    公开(公告)日:2012-09-18

    申请号:US11562379

    申请日:2006-11-21

    IPC分类号: G06T11/00

    CPC分类号: G06T15/30 G06T2200/28

    摘要: Scissoring for any number of scissoring regions is performed in a sequential order by drawing one scissoring region at a time on a drawing surface and updating scissor values for pixels within each scissoring region. A scissor value for a pixel may indicate the number of scissoring regions covering the pixel and may be incremented for each scissoring region covering the pixel. A scissor value for a pixel may also be a bitmap, and a bit for a scissoring region may be set to one if the pixel is within the scissoring region. Pixels within a region of interest are passed and rendered, and pixels outside of the region are discarded. This region may be defined by a reference value, which may be set to (a) one for the union of all scissoring regions, for a scissoring UNION operation, or (b) larger than one for the intersection of multiple (e.g., all) scissoring regions, for a scissoring AND operation.

    摘要翻译: 通过在绘图面上一次绘制一个剪切区域并且更新每个剪切区域内的像素的剪刀值,以顺序的顺序执行任意数量的剪切区域的剪切。 像素的剪刀值可以指示覆盖像素的剪切区域的数量,并且可以针对覆盖像素的每个剪切区域增加剪裁值。 像素的剪刀值也可以是位图,并且如果像素在剪切区域内,则可以将剪切区域的位设置为1。 传递和渲染感兴趣区域内的像素,并且丢弃该区域外的像素。 该区域可以由参考值定义,其可以被设置为(a)用于所有剪切区域的联合,用于剪切UNION操作,或者(b)对于多个(例如,全部)的交点大于一个, 剪切区域,用于剪切和操作。

    Discarding of vertex points during two-dimensional graphics rendering using three-dimensional graphics hardware
    74.
    发明授权
    Discarding of vertex points during two-dimensional graphics rendering using three-dimensional graphics hardware 有权
    使用三维图形硬件在二维图形渲染期间舍弃顶点

    公开(公告)号:US08269775B2

    公开(公告)日:2012-09-18

    申请号:US12331273

    申请日:2008-12-09

    IPC分类号: G06T11/20

    CPC分类号: G06T11/203

    摘要: This disclosure describes techniques for removing vertex points during two-dimensional (2D) graphics rendering using three-dimensional (3D) graphics hardware. In accordance with the described techniques one or more vertex points may be removed during 2D graphics rendering using 3D graphics hardware. For example, the techniques may remove redundant vertex points in the display coordinate space by discarding vertex points that have the substantially same positional coordinates in the display coordinate space as a previous vertex point. Alternatively or additionally, the techniques may remove excess vertex points that lie in a straight line. Removing the redundant vertex points or vertex points that lie in a straight line allow for more efficient utilization of the hardware resources of the GPU and increase the speed at which the GPU renders the image for display.

    摘要翻译: 本公开描述了使用三维(3D)图形硬件在二维(2D)图形渲染期间去除顶点的技术。 根据描述的技术,可以使用3D图形硬件在2D图形渲染期间移除一个或多个顶点。 例如,这些技术可以通过丢弃在显示坐标空间中具有与先前顶点相同的位置坐标的顶点来去除显示坐标空间中的冗余顶点。 或者或另外,这些技术可以去除位于直线上的多余顶点。 去除位于直线上的冗余顶点或顶点可以更有效地利用GPU的硬件资源,并提高GPU渲染图像以进行显示的速度。

    OUT-OF-ORDER COMMAND EXECUTION
    75.
    发明申请
    OUT-OF-ORDER COMMAND EXECUTION 审中-公开
    不合格的命令执行

    公开(公告)号:US20120017069A1

    公开(公告)日:2012-01-19

    申请号:US12837600

    申请日:2010-07-16

    IPC分类号: G06F9/30

    摘要: Techniques are described for reordering commands to improve the speed at which at least one command stream may execute. Prior to distributing commands in the at least one command stream to multiple pipelines, a multimedia processor analyzes any inter-pipeline dependencies and determines the current execution state of the pipelines. The processor may, based on this information, reorder the at least one command stream by prioritizing commands that lack any current dependencies and therefore may be executed immediately by the appropriate pipeline. Such out of order execution of commands in the at least one command stream may increase the throughput of the multimedia processor by increasing the rate at which the command stream is executed.

    摘要翻译: 描述了用于重新排序命令以提高至少一个命令流可以执行的速度的技术。 在将至少一个命令流中的命令分配到多个管线之前,多媒体处理器分析任何流水线间依赖性并确定管道的当前执行状态。 基于该信息,处理器可以通过对缺少任何当前依赖性的命令进行优先级排序来重新排序至少一个命令流,因此可以由适当的管道立即执行。 在至少一个命令流中命令执行的这种不正常执行可以通过增加命令流被执行的速率来增加多媒体处理器的吞吐量。

    DATA ACCESS TOOL FOR PROGRAMMABLE GRAPHICS HARDWARE
    76.
    发明申请
    DATA ACCESS TOOL FOR PROGRAMMABLE GRAPHICS HARDWARE 有权
    数据访问工具可编程图形硬件

    公开(公告)号:US20090027407A1

    公开(公告)日:2009-01-29

    申请号:US11782509

    申请日:2007-07-24

    IPC分类号: G06T1/00

    CPC分类号: G06T15/005

    摘要: Methods and apparatuses for accessing data within programmable graphics hardware are provided. According to one aspect, a user inserts special log commands into a software program, which is compiled into instructions for the programmable graphics hardware to execute. The hardware writes data to an external memory during runtime according to a flow control protocol, and the software driver reads the data from the memory to display to the user.

    摘要翻译: 提供了在可编程图形硬件内访问数据的方法和装置。 根据一个方面,用户将特殊的日志命令插入到可编程图形硬件执行的指令中的软件程序中。 根据流控制协议,硬件在运行时将数据写入外部存储器,软件驱动程序从存储器读取数据以显示给用户。

    Methods and systems for performance monitoring in a graphics processing unit
    77.
    发明申请
    Methods and systems for performance monitoring in a graphics processing unit 审中-公开
    用于图形处理单元中性能监控的方法和系统

    公开(公告)号:US20070139421A1

    公开(公告)日:2007-06-21

    申请号:US11314184

    申请日:2005-12-21

    IPC分类号: G06T1/00

    CPC分类号: G06T1/20

    摘要: Provided is a system for monitoring the performance in a computer graphics processor having a plurality of pipeline processing blocks in a graphics pipeline. The system includes: performance monitoring logic, configured to gather data corresponding to graphics pipeline performance; a plurality of counting logic blocks, located within the performance monitoring logic; a plurality of logical counters, located in each of the plurality of pipeline processing blocks, configured to transmit a plurality of count signals to the performance monitoring logic; a plurality of counter configuration registers, configured to map a portion of the plurality of logical counters to the plurality of counting logic blocks; and a command processor configured to provide a plurality of commands to the performance monitoring logic.

    摘要翻译: 提供了一种用于在图形管线中监视具有多个流水线处理块的计算机图形处理器中的性能的系统。 该系统包括:性能监控逻辑,配置为收集对应于图形流水线性能的数据; 多个计数逻辑块,位于性能监视逻辑内; 多个逻辑计数器,位于多个流水线处理块中的每一个中,被配置为将多个计数信号发送到性能监视逻辑; 多个计数器配置寄存器,被配置为将所述多个逻辑计数器的一部分映射到所述多个计数逻辑块; 以及命令处理器,被配置为向所述性能监视逻辑提供多个命令。

    Program flow control for multiple divergent SIMD threads using a minimum resume counter
    78.
    发明授权
    Program flow control for multiple divergent SIMD threads using a minimum resume counter 有权
    使用最小恢复计数器对多个发散SIMD线程进行程序流控制

    公开(公告)号:US08832417B2

    公开(公告)日:2014-09-09

    申请号:US13227274

    申请日:2011-09-07

    IPC分类号: G06F9/38

    摘要: This disclosure describes techniques for handling divergent thread conditions in a multi-threaded processing system. In some examples, a control flow unit may obtain a control flow instruction identified by a program counter value stored in a program counter register. The control flow instruction may include a target value indicative of a target program counter value for the control flow instruction. The control flow unit may select one of the target program counter value and a minimum resume counter value as a value to load into the program counter register. The minimum resume counter value may be indicative of a smallest resume counter value from a set of one or more resume counter values associated with one or more inactive threads. Each of the one or more resume counter values may be indicative of a program counter value at which a respective inactive thread should be activated.

    摘要翻译: 本公开描述了用于处理多线程处理系统中的发散线程状况的技术。 在一些示例中,控制流程单元可以获得由存储在程序计数器寄存器中的程序计数器值所标识的控制流程指令。 控制流程指令可以包括指示控制流程指令的目标程序计数器值的目标值。 控制流程单元可以选择目标程序计数器值和最小恢复计数器值之一作为加载到程序计数器寄存器中的值。 最小恢复计数器值可以指示与一个或多个非活动线程相关联的一个或多个恢复计数器值的集合中的最小恢复计数器值。 一个或多个恢复计数器值中的每一个可以指示应该激活相应的非活动线程的程序计数器值。

    TECHNIQUES FOR HANDLING DIVERGENT THREADS IN A MULTI-THREADED PROCESSING SYSTEM
    79.
    发明申请
    TECHNIQUES FOR HANDLING DIVERGENT THREADS IN A MULTI-THREADED PROCESSING SYSTEM 有权
    在多螺纹加工系统中处理多条螺纹的技术

    公开(公告)号:US20130061027A1

    公开(公告)日:2013-03-07

    申请号:US13227274

    申请日:2011-09-07

    IPC分类号: G06F9/38 G06F9/30

    摘要: This disclosure describes techniques for handling divergent thread conditions in a multi-threaded processing system. In some examples, a control flow unit may obtain a control flow instruction identified by a program counter value stored in a program counter register. The control flow instruction may include a target value indicative of a target program counter value for the control flow instruction. The control flow unit may select one of the target program counter value and a minimum resume counter value as a value to load into the program counter register. The minimum resume counter value may be indicative of a smallest resume counter value from a set of one or more resume counter values associated with one or more inactive threads. Each of the one or more resume counter values may be indicative of a program counter value at which a respective inactive thread should be activated.

    摘要翻译: 本公开描述了用于处理多线程处理系统中的发散线程状况的技术。 在一些示例中,控制流程单元可以获得由存储在程序计数器寄存器中的程序计数器值所标识的控制流程指令。 控制流程指令可以包括指示控制流程指令的目标程序计数器值的目标值。 控制流程单元可以选择目标程序计数器值和最小恢复计数器值之一作为加载到程序计数器寄存器中的值。 最小恢复计数器值可以指示与一个或多个非活动线程相关联的一个或多个恢复计数器值的集合中的最小恢复计数器值。 一个或多个恢复计数器值中的每一个可以指示应该激活相应的非活动线程的程序计数器值。

    System and method of mapping shader variables into physical registers
    80.
    发明授权
    System and method of mapping shader variables into physical registers 有权
    将着色器变量映射到物理寄存器的系统和方法

    公开(公告)号:US08379032B2

    公开(公告)日:2013-02-19

    申请号:US11864484

    申请日:2007-09-28

    IPC分类号: G06F15/00

    CPC分类号: G06T15/005 G06F8/441

    摘要: The present disclosure includes system and method of mapping shader variables into physical registers. In an embodiment, a graphics processing unit (GPU) and a memory coupled to the GPU are disclosed. The memory includes a processor readable data file that has a register file portion. The register file portion has a rectangular structure including a plurality of data items. At least two of the plurality of data items corresponding to data elements of a shader program. The data elements have different data storage types.

    摘要翻译: 本公开包括将着色器变量映射到物理寄存器的系统和方法。 在一个实施例中,公开了一种图形处理单元(GPU)和耦合到GPU的存储器。 存储器包括具有寄存器文件部分的处理器可读数据文件。 寄存器文件部分具有包括多个数据项的矩形结构。 与着色器程序的数据元素对应的多个数据项中的至少两个。 数据元素具有不同的数据存储类型。