GRAPHICS PROCESSING SYSTEM
    1.
    发明申请
    GRAPHICS PROCESSING SYSTEM 有权
    图形处理系统

    公开(公告)号:US20160267709A1

    公开(公告)日:2016-09-15

    申请号:US15161339

    申请日:2016-05-23

    Applicant: ARM Limited

    Abstract: When rendering a scene that includes a complex object made up of many individual primitives, rather than processing each primitive making up the object in turn, a bounding volume which surrounds the complex object is generated and the scene is then processed using the bounding volume in place of the actual primitives making up the complex object. If it is determined that the bounding volume representation of the object will be completely occluded in the scene (e.g. by a foreground object), then the individual primitives making up the complex object are not processed. This can save significantly on processing time and resources for the scene.

    Abstract translation: 当渲染包括由许多单独的基元组成的复杂对象的场景时,而不是依次对每个组成对象的每个基元进行处理,生成围绕复杂对象的边界体积,然后使用边界体积来处理场景 构成复杂对象的实际原语。 如果确定对象的边界体积表示将被完全遮蔽在场景中(例如,通过前景对象),则构成复杂对象的各个基元不被处理。 这可以显着节省场景的处理时间和资源。

    REDUCING ENERGY AND INCREASING SPEED BY AN INSTRUCTION SUBSTITUTING SUBSEQUENT INSTRUCTIONS WITH SPECIFIC FUNCTION INSTRUCTION
    2.
    发明申请
    REDUCING ENERGY AND INCREASING SPEED BY AN INSTRUCTION SUBSTITUTING SUBSEQUENT INSTRUCTIONS WITH SPECIFIC FUNCTION INSTRUCTION 有权
    指导性降低能源和增加速度取代具体功能指导的后续指令

    公开(公告)号:US20140176584A1

    公开(公告)日:2014-06-26

    申请号:US14190131

    申请日:2014-02-26

    Applicant: ARM Limited

    Inventor: Jorn NYSTAD

    Abstract: A data processing system is used to evaluate a data processing function by executing a sequence of program instructions including an intermediate value generating instruction Inst0 and an intermediate value consuming instruction Inst1. In dependence upon one or more input operands to the evaluation, an embedded opcode within the intermediate value passed between the intermediate value generating instruction and the intermediate value consuming instruction may be set to have a value indicating that a substitute instruction should be used in place of the intermediate value consuming instruction. The instructions may be floating point instructions, such as a floating point power instruction evaluating the data processing function ab.

    Abstract translation: 数据处理系统用于通过执行包括中间值产生指令Inst0和中间值消耗指令Inst1的程序指令序列来评估数据处理功能。 根据评估的一个或多个输入操作数,在中间值生成指令和中间值消耗指令之间传递的中间值内的嵌入式操作码可以被设置为具有指示替代指令应被代替的值 中间值消耗指令。 指令可以是浮点指令,例如评估数据处理功能ab的浮点功率指令。

    DATA PROCESSING APPARATUS AND METHOD FOR PROCESSING A RECEIVED WORKLOAD IN ORDER TO GENERATE RESULT DATA
    3.
    发明申请
    DATA PROCESSING APPARATUS AND METHOD FOR PROCESSING A RECEIVED WORKLOAD IN ORDER TO GENERATE RESULT DATA 有权
    数据处理设备和用于处理接收到的工作负载以生成结果数据的方法

    公开(公告)号:US20130332939A1

    公开(公告)日:2013-12-12

    申请号:US13909149

    申请日:2013-06-04

    Applicant: ARM Limited

    Abstract: A data processing apparatus and method are provided for processing a received workload in order to generate result data. A thread group generator generates from the received workload a plurality of thread groups to be executed to process the received workload. Each thread group consists of a plurality of threads, and at least one thread group has an inter-thread dependency existing between the plurality of threads. Each thread may be either an active thread whose output is required to form the result data, or a dummy thread required to resolve the inter-thread dependency for one of the active threads but whose output is not required to form the result data. The thread group generator identifies for each thread group any dummy thread within that thread group. A thread execution unit then executes each thread within a thread group received from the thread group generator by executing a predetermined program comprising a plurality of program instructions. Execution flow modification circuitry is responsive to the received thread group having at least one dummy thread, to cause the thread execution unit to selectively omit at least part of the execution of at least one of the plurality of instructions when executing each dummy thread, in dependence on control information associated with the predetermined program. In one particular embodiment the received workload is a graphics rendering workload and the thread execution unit performs graphics rendering operations in order to generate as the result data pixel values and associated control values. Such an approach can yield significant improvements in performance, as well as reducing power consumption.

    Abstract translation: 提供了一种数据处理装置和方法,用于处理所接收的工作负载以产生结果数据。 线程组生成器从接收到的工作负载生成要执行的多个线程组以处理所接收的工作负载。 每个线程组由多个线程组成,并且至少一个线程组具有存在于多个线程之间的线间依存关系。 每个线程可以是要求其输出来形成结果数据的活动线程,也可以是解决对其中一个活动线程但不需要输出结果数据的线程间依赖性所需的虚拟线程。 线程组生成器为每个线程组标识该线程组中的任何虚拟线程。 线程执行单元然后通过执行包括多个程序指令的预定程序来执行从线程组生成器接收的线程组内的每个线程。 执行流修改电路响应于具有至少一个虚拟线程的所接收的线程组,以使得线程执行单元在执行每个虚拟线程时有选择地省略至少一部分执行多条指令,依赖 关于与预定程序相关联的控制信息。 在一个特定实施例中,所接收的工作负载是图形渲染工作负载,并且线程执行单元执行图形绘制操作,以便生成结果数据像素值和相关联的控制值。 这种方法可以显着提高性能,同时降低功耗。

    PROCESSING ORDER WITH INTEGER INPUTS AND FLOATING POINT INPUTS
    4.
    发明申请
    PROCESSING ORDER WITH INTEGER INPUTS AND FLOATING POINT INPUTS 有权
    具有整数输入和浮点输入的处理订单

    公开(公告)号:US20140225907A1

    公开(公告)日:2014-08-14

    申请号:US14257090

    申请日:2014-04-21

    Applicant: ARM Limited

    CPC classification number: G06F12/0802 G06T1/20 G09G5/39

    Abstract: A graphics processing unit 2 includes a texture pipeline 6 which performs filter operations upon texture values. If the texture values are integer texture values, then they may be processed by the texture pipeline in a variable order corresponding to the order in which they are retrieved from a memory 4. If the texture values are floating point texture values, then they are processed in a fixed order in order to ensure result invariants as the filter operation is non-associative for floating point values. The filter operation is not commenced until all of the floating point texture values have been retrieved from the memory 4 and other available for processing.

    Abstract translation: 图形处理单元2包括对纹理值执行滤波操作的纹理管线6。 如果纹理值是整数纹理值,则它们可以由纹理流水线以与从存储器4检索的顺序相对应的可变顺序来处理。如果纹理值是浮点纹理值,则它们被处理 以固定顺序,以确保结果不变量,因为过滤器操作对于浮点值是非关联的。 过滤器操作不会开始,直到从存储器4检索到所有浮点纹理值,并且可以处理的其他值。

    GRAPHICS PROCESSING SYSTEM
    5.
    发明申请

    公开(公告)号:US20180197326A1

    公开(公告)日:2018-07-12

    申请号:US15914255

    申请日:2018-03-07

    Applicant: ARM Limited

    Abstract: When rendering a scene that includes a complex object made up of many individual primitives, rather than processing each primitive making up the object in turn, a bounding volume which surrounds the complex object is generated and the scene is then processed using the bounding volume in place of the actual primitives making up the complex object. If it is determined that the bounding volume representation of the object will be completely occluded in the scene (e.g. by a foreground object), then the individual primitives making up the complex object are not processed. This can save significantly on processing time and resources for the scene.

    NUMBER FORMAT PRE-CONVERSION INSTRUCTIONS
    6.
    发明申请
    NUMBER FORMAT PRE-CONVERSION INSTRUCTIONS 审中-公开
    NUMBER格式转换指令

    公开(公告)号:US20150120795A1

    公开(公告)日:2015-04-30

    申请号:US14584237

    申请日:2014-12-29

    Applicant: ARM Limited

    Abstract: Apparatus for processing data includes processing circuitry 16, 18, 20, 22, 24, 26 and decoder circuitry 14 for decoding program instructions. The program instructions decoded include a floating point pre-conversion instruction which performs round-to-nearest ties to even rounding upon the mantissa field of an input floating number to generate an output floating point number with the same mantissa length but with the mantissa rounded to a position corresponding to a shorter mantissa field. The output mantissa field includes a suffix of zero values concatenated the rounded value. The decoder for circuitry 14 is also responsive to an integer pre-conversion instruction to quantise and input integer value using round-to-nearest ties to even rounding to form an output integer operand with a number of significant bits matched to the mantissa size of a floating point number to which the integer is later to be converted using an integer-to-floating point conversion instruction.

    Abstract translation: 用于处理数据的装置包括用于解码程序指令的处理电路16,18,20,22,24,26和解码器电路14。 所解码的程序指令包括一个浮点预转换指令,其执行圆到最近的连接,以便在输入浮点数的尾数字上偶数舍入以产生具有相同尾数长度但尾数四舍五入的输出浮点数 对应于较短的尾数字段的位置。 输出尾数字段包括将值的后缀连接在舍入值上。 用于电路14的解码器还响应于整数预转换指令,以使用向量到最近的系数对偶数四舍五入进行定量和输入整数值,以形成输出整数运算数,该输出整数操作数的数目与 使用整数到浮点转换指令后续整数的浮点数。

    INTERMEDIATE VALUE STORAGE WITHIN A GRAPHICS PROCESSING APPARATUS
    7.
    发明申请
    INTERMEDIATE VALUE STORAGE WITHIN A GRAPHICS PROCESSING APPARATUS 有权
    在图形处理设备中的中间价值存储

    公开(公告)号:US20130155103A1

    公开(公告)日:2013-06-20

    申请号:US13658997

    申请日:2012-10-24

    Applicant: ARM LIMITED

    Abstract: A tile-based graphics processor includes tile processing circuitry that has both a tile buffer and a per-pixel general purpose data store. The per-pixel general purpose data store is read accessible and write accessible by the tile processing circuitry to store intermediate values. These intermediate values are generated by the tile processing circuitry and then consumed by the tile processing circuitry to generate the output values for the tile being processed.

    Abstract translation: 基于瓦片的图形处理器包括具有瓦片缓冲器和每像素通用数据存储器的瓦片处理电路。 每像素通用数据存储器被读取可访问并且可由片处理电路写入以存储中间值。 这些中间值由瓦片处理电路产生,然后由瓦片处理电路消耗以产生正被处理的瓦片的输出值。

Patent Agency Ranking