Programmable streaming processor with mixed precision instruction execution
    1.
    发明授权
    Programmable streaming processor with mixed precision instruction execution 有权
    具有混合精度指令执行的可编程流处理器

    公开(公告)号:US08633936B2

    公开(公告)日:2014-01-21

    申请号:US12106654

    申请日:2008-04-21

    IPC分类号: G06T1/00 G06F15/00 G06F15/16

    CPC分类号: G06T15/005 G06F8/47

    摘要: The disclosure relates to a programmable streaming processor that is capable of executing mixed-precision (e.g., full-precision, half-precision) instructions using different execution units. The various execution units are each capable of using graphics data to execute instructions at a particular precision level. An exemplary programmable shader processor includes a controller and multiple execution units. The controller is configured to receive an instruction for execution and to receive an indication of a data precision for execution of the instruction. The controller is also configured to receive a separate conversion instruction that, when executed, converts graphics data associated with the instruction to the indicated data precision. When operable, the controller selects one of the execution units based on the indicated data precision. The controller then causes the selected execution unit to execute the instruction with the indicated data precision using the graphics data associated with the instruction.

    摘要翻译: 本公开涉及一种能够使用不同执行单元执行混合精度(例如,全精度,半精度)指令的可编程流式处理器。 各种执行单元都能够使用图形数据来执行特定精度级别的指令。 示例性可编程着色器处理器包括控制器和多个执行单元。 控制器被配置为接收用于执行的指令并且接收用于执行指令的数据精度的指示。 控制器还被配置为接收单独的转换指令,该指令在执行时将与指令相关联的图形数据转换为所指示的数据精度。 当可操作时,控制器基于指示的数据精度选择一个执行单元。 然后,控制器使所选择的执行单元使用与指令相关联的图形数据,以指示的数据精度执行指令。

    Demand based power control in a graphics processing unit
    2.
    发明授权
    Demand based power control in a graphics processing unit 有权
    图形处理单元中基于需求的功率控制

    公开(公告)号:US08458497B2

    公开(公告)日:2013-06-04

    申请号:US11870597

    申请日:2007-10-11

    摘要: Disclosed herein is power controller for use with a graphics processing unit. The power controller monitors, manages and controls power supplied to components of a pipeline of the graphics processing unit. The power controller determining whether and to what extent power is to be supplied to a pipeline component based on status information received by the power controller in connection with the pipeline component. The power controller is capable of identifying a trend using the received status information, and determining whether and to what extent power is to be supplied to a pipeline component based on the identified trend.

    摘要翻译: 这里公开了与图形处理单元一起使用的功率控制器。 功率控制器监视,管理和控制提供给图形处理单元的管线的组件的电力。 功率控制器基于由电力控制器与流水线部件相关联的状态信息来确定是否以及在何种程度上向管道部件提供功率。 功率控制器能够使用接收到的状态信息来识别趋势,并且基于所识别的趋势来确定是否以及在何种程度上向管道部件供电。

    PROGRAMMABLE STREAMING PROCESSOR WITH MIXED PRECISION INSTRUCTION EXECUTION
    3.
    发明申请
    PROGRAMMABLE STREAMING PROCESSOR WITH MIXED PRECISION INSTRUCTION EXECUTION 有权
    具有混合精度指令执行的可编程流水处理器

    公开(公告)号:US20090265528A1

    公开(公告)日:2009-10-22

    申请号:US12106654

    申请日:2008-04-21

    IPC分类号: G06F9/30

    CPC分类号: G06T15/005 G06F8/47

    摘要: The disclosure relates to a programmable streaming processor that is capable of executing mixed-precision (e.g., full-precision, half-precision) instructions using different execution units. The various execution units are each capable of using graphics data to execute instructions at a particular precision level. An exemplary programmable shader processor includes a controller and multiple execution units. The controller is configured to receive an instruction for execution and to receive an indication of a data precision for execution of the instruction. The controller is also configured to receive a separate conversion instruction that, when executed, converts graphics data associated with the instruction to the indicated data precision. When operable, the controller selects one of the execution units based on the indicated data precision. The controller then causes the selected execution unit to execute the instruction with the indicated data precision using the graphics data associated with the instruction.

    摘要翻译: 本公开涉及一种能够使用不同执行单元执行混合精度(例如,全精度,半精度)指令的可编程流式处理器。 各种执行单元都能够使用图形数据来执行特定精度级别的指令。 示例性可编程着色器处理器包括控制器和多个执行单元。 控制器被配置为接收用于执行的指令并且接收用于执行指令的数据精度的指示。 控制器还被配置为接收单独的转换指令,该指令在执行时将与指令相关联的图形数据转换为所指示的数据精度。 当可操作时,控制器基于指示的数据精度选择一个执行单元。 然后,控制器使所选择的执行单元使用与指令相关联的图形数据,以指示的数据精度执行指令。

    DEMAND BASED POWER CONTROL IN A GRAPHICS PROCESSING UNIT
    4.
    发明申请
    DEMAND BASED POWER CONTROL IN A GRAPHICS PROCESSING UNIT 有权
    图形处理单元中基于需求的功率控制

    公开(公告)号:US20090096797A1

    公开(公告)日:2009-04-16

    申请号:US11870597

    申请日:2007-10-11

    IPC分类号: G06F15/80

    摘要: Disclosed herein is power controller for use with a graphics processing unit. The power controller monitors, manages and controls power supplied to components of a pipeline of the graphics processing unit. The power controller determining whether and to what extent power is to be supplied to a pipeline component based on status information received by the power controller in connection with the pipeline component. The power controller is capable of identifying a trend using the received status information, and determining whether and to what extent power is to be supplied to a pipeline component based on the identified trend.

    摘要翻译: 这里公开了与图形处理单元一起使用的功率控制器。 功率控制器监视,管理和控制提供给图形处理单元的管线的组件的电力。 功率控制器基于由电力控制器与流水线部件相关联的状态信息来确定是否以及在何种程度上向管道部件提供功率。 功率控制器能够使用接收到的状态信息来识别趋势,并且基于所识别的趋势来确定是否以及在何种程度上向管道部件供电。

    Graphics processor with arithmetic and elementary function units
    5.
    发明授权
    Graphics processor with arithmetic and elementary function units 有权
    具有算术和基本功能单元的图形处理器

    公开(公告)号:US08884972B2

    公开(公告)日:2014-11-11

    申请号:US11441696

    申请日:2006-05-25

    摘要: A graphics processor capable of efficiently performing arithmetic operations and computing elementary functions is described. The graphics processor has at least one arithmetic logic unit (ALU) that can perform arithmetic operations and at least one elementary function unit that can compute elementary functions. The ALU(s) and elementary function unit(s) may be arranged such that they can operate in parallel to improve throughput. The graphics processor may also include fewer elementary function units than ALUs, e.g., four ALUs and a single elementary function unit. The four ALUs may perform an arithmetic operation on (1) four components of an attribute for one pixel or (2) one component of an attribute for four pixels. The single elementary function unit may operate on one component of one pixel at a time. The use of a single elementary function unit may reduce cost while still providing good performance.

    摘要翻译: 描述能够有效执行算术运算和计算基本功能的图形处理器。 图形处理器具有至少一个可执行算术运算的算术逻辑单元(ALU)和至少一个可以计算基本功能的基本功能单元。 ALU和基本功能单元可以被布置成使得它们可以并行操作以提高吞吐量。 图形处理器还可以包括比ALU更少的基本功能单元,例如四个ALU和单个基本功能单元。 四个ALU可以对(1)四个像素的属性的四个分量或(2)四个像素的属性的一个分量执行算术运算。 单个基本功能单元可以一次操作一个像素的一个分量。 使用单个基本功能单元可以降低成本,同时仍然提供良好的性能。

    Graphics processing unit with deferred vertex shading
    6.
    发明授权
    Graphics processing unit with deferred vertex shading 有权
    具有延迟顶点着色的图形处理单元

    公开(公告)号:US08436854B2

    公开(公告)日:2013-05-07

    申请号:US12557427

    申请日:2009-09-10

    IPC分类号: G06T15/40

    CPC分类号: G06T15/40 G06T1/20 G06T15/005

    摘要: Techniques are described for processing graphics images with a graphics processing unit (GPU) using deferred vertex shading. An example method includes the following: generating, within a processing pipeline of a graphics processing unit (GPU), vertex coordinates for vertices of each primitive within an image geometry, wherein the vertex coordinates comprise a location and a perspective parameter for each one of the vertices, and wherein the image geometry represents a graphics image; identifying, within the processing pipeline of the GPU, visible primitives within the image geometry based upon the vertex coordinates; and, responsive to identifying the visible primitives, generating, within the processing pipeline of the GPU, vertex attributes only for the vertices of the visible primitives in order to determine surface properties of the graphics image.

    摘要翻译: 描述了使用延迟顶点着色处理具有图形处理单元(GPU)的图形图像的技术。 示例性方法包括以下:在图形处理单元(GPU)的处理流水线内生成图像几何中每个图元的顶点的顶点坐标,其中顶点坐标包括位置和透视参数 顶点,并且其中图像几何表示图形图像; 在GPU的处理流水线内识别基于顶点坐标的图像几何图形内的可见原始图形; 并且响应于识别可见原语,在GPU的处理流水线内生成仅针对可见图元的顶点的顶点属性,以便确定图形图像的表面特性。

    3-D clipping in a graphics processing unit
    7.
    发明授权
    3-D clipping in a graphics processing unit 有权
    图形处理单元中的3-D剪辑

    公开(公告)号:US08212840B2

    公开(公告)日:2012-07-03

    申请号:US11551900

    申请日:2006-10-23

    IPC分类号: G09G5/00

    摘要: A graphics processing unit (GPU) efficiently performs 3-dimensional (3-D) clipping using processing units used for other graphics functions. The GPU includes first and second hardware units and at least one buffer. The first hardware unit performs 3-D clipping of primitives using a first processing unit used for a first graphics function, e.g., an ALU used for triangle setup, depth gradient setup, etc. The first hardware unit may perform 3-D clipping by (a) computing clip codes for each vertex of each primitive, (b) determining whether to pass, discard or clip each primitive based on the clip codes for all vertices of the primitive, and (c) clipping each primitive to be clipped against clipping planes. The second hardware unit computes attribute component values for new vertices resulting from the 3-D clipping, e.g., using an ALU used for attribute gradient setup, attribute interpolation, etc. The buffer(s) store intermediate results of the 3-D clipping.

    摘要翻译: 图形处理单元(GPU)使用用于其他图形功能的处理单元有效地执行三维(3-D)剪辑。 GPU包括第一和第二硬件单元和至少一个缓冲器。 第一硬件单元使用用于第一图形功能的第一处理单元(例如用于三角形设置的ALU,深度梯度设置等)来对原语执行3-D限幅。第一硬件单元可以通过( a)计算每个图元的每个顶点的剪辑代码,(b)基于所述基元的所有顶点的剪辑代码来确定是否传递,丢弃或剪切每个图元,以及(c)剪切要针对剪切平面剪切的每个图元 。 第二硬件单元计算由3-D限幅产生的新顶点的属性分量值,例如使用用于属性梯度设置,属性插值等的ALU。该缓冲器存储3-D限幅的中间结果。

    Efficient 2-D and 3-D graphics processing
    8.
    发明授权
    Efficient 2-D and 3-D graphics processing 有权
    高效的2-D和3-D图形处理

    公开(公告)号:US08203564B2

    公开(公告)日:2012-06-19

    申请号:US11675662

    申请日:2007-02-16

    IPC分类号: G06T1/20 G06T1/00 G06T15/40

    摘要: Techniques for supporting both 2-D and 3-D graphics are described. A graphics processing unit (GPU) may perform 3-D graphics processing in accordance with a 3-D graphics pipeline to render 3-D images and may also perform 2-D graphics processing in accordance with a 2-D graphics pipeline to render 2-D images. Each stage of the 2-D graphics pipeline may be mapped to at least one stage of the 3-D graphics pipeline. For example, a clipping, masking and scissoring stage in 2-D graphics may be mapped to a depth test stage in 3-D graphics. Coverage values for pixels within paths in 2-D graphics may be determined using rasterization and depth test stages in 3-D graphics. A paint generation stage and an image interpolation stage in 2-D graphics may be mapped to a fragment shader stage in 3-D graphics. A blending stage in 2-D graphics may be mapped to a blending stage in 3-D graphics.

    摘要翻译: 描述了支持2-D和3-D图形的技术。 图形处理单元(GPU)可以根据3-D图形流水线执行3D图形处理以渲染3-D图像,并且还可以根据2-D图形流水线执行2-D图形处理以呈现2 -D图像。 2-D图形管线的每个阶段可以映射到3-D图形流水线的至少一个阶段。 例如,2-D图形中的裁剪,掩蔽和裁剪阶段可以映射到3D图形中的深度测试阶段。 2-D图形中路径内像素的覆盖值可以使用3-D图形中的光栅化和深度测试阶段来确定。 2-D图形中的油漆生成阶段和图像插值阶段可以映射到3-D图形中的片段着色器阶段。 2-D图形中的混合阶段可以映射到3-D图形的混合阶段。

    Graphics processing unit with shared arithmetic logic unit
    9.
    发明授权
    Graphics processing unit with shared arithmetic logic unit 有权
    具有共享算术逻辑单元的图形处理单元

    公开(公告)号:US08009172B2

    公开(公告)日:2011-08-30

    申请号:US11550344

    申请日:2006-10-17

    IPC分类号: G06T1/20

    CPC分类号: G06T15/005

    摘要: This disclosure describes a graphics processing unit (GPU) pipeline that uses one or more shared arithmetic logic units (ALUs). In order to facilitate such sharing of ALUs, the stages of the disclosed GPU pipeline may be rearranged relative to conventional GPU pipelines. In addition, by rearranging the stages of the GPU pipeline, efficiencies may be achieved in the image processing. Unlike conventional GPU pipelines, for example, an attribute gradient setup stage can be located much later in the pipeline, and the attribute interpolator stage may immediately follow the attribute gradient setup stage. This allows sharing of an ALU by the attribute gradient setup and attribute interpolator stages. Several other techniques and features for the GPU pipeline are also described, which may improve performance and possibly achieve additional processing efficiencies.

    摘要翻译: 本公开描述了使用一个或多个共享算术逻辑单元(ALU)的图形处理单元(GPU)流水线。 为了促进ALU的这种共享,所公开的GPU流水线的阶段可以相对于传统的GPU管线重新排列。 此外,通过重新排列GPU流水线的各个阶段,可以在图像处理中实现效率。 与传统GPU流水线不同,例如,属性梯度建立阶段可以在流水线后面定位,属性内插器阶段可以立即跟随属性梯度建立阶段。 这允许通过属性渐变设置和属性内插器阶段共享ALU。 还描述了用于GPU流水线的若干其它技术和特征,这可以提高性能并可能实现额外的处理效率。

    PROGRAMMABLE BLENDING IN A GRAPHICS PROCESSING UNIT
    10.
    发明申请
    PROGRAMMABLE BLENDING IN A GRAPHICS PROCESSING UNIT 有权
    图形处理单元中的可编程混合

    公开(公告)号:US20080094410A1

    公开(公告)日:2008-04-24

    申请号:US11550958

    申请日:2006-10-19

    IPC分类号: G09G5/02

    CPC分类号: G06T15/503 G06T2210/32

    摘要: Techniques for implementing blending equations for various blending modes with a base set of operations are described. Each blending equation may be decomposed into a sequence of operations. In one design, a device includes a processing unit that implements a set of operations for multiple blending modes and a storage unit that stores operands and results. The processing unit receives a sequence of instructions for a sequence of operations for a blending mode selected from the plurality of blending modes and executes each instruction in the sequence to perform blending in accordance with the selected blending mode. The processing unit may include (a) an ALU that performs at least one operation in the base set, e.g., a dot product, (b) a pre-formatting unit that performs gamma correction and alpha scaling of inbound color values, and (c) a post-formatting unit that performs gamma compression and alpha scaling of outbound color values.

    摘要翻译: 描述了用于具有基本操作集合的用于各种混合模式的混合方程的技术。 每个混合方程可以分解为一系列操作。 在一种设计中,设备包括一个处理单元,该处理单元实现多种混合模式的一组操作,以及存储操作数和结果的存储单元。 处理单元接收用于从多个混合模式中选择的混合模式的操作序列的指令序列,并且执行该顺序中的每个指令以根据所选择的混合模式执行混合。 处理单元可以包括(a)执行基本集合中的至少一个操作的ALU,例如点积,(b)执行伽马校正和入站颜色值的α缩放的预格式化单元,以及(c )一个后格式化单元,用于执行出色色彩值的伽玛压缩和alpha缩放。