Per-instance preamble for graphics processing

    公开(公告)号:US09799094B1

    公开(公告)日:2017-10-24

    申请号:US15162198

    申请日:2016-05-23

    Abstract: A method for processing data in a graphics processing unit (GPU) including receiving an instance identifier for an instance and a shader program comprising a preamble code block and a main shader code block, assigning, the instance identifier to a general purpose register at wave creation, allocating address space within the constant memory for instance uniforms, and determining the preamble code block has not been executed and the wave is a first wave of the instance to be executed, based on determining the preamble code block has not been executed and the wave is the first wave to be executed, executing the preamble code block to store the plurality of instance uniforms in the constant memory and based, at least in part, on executing the preamble code block, executing the wave of the plurality of waves using at least one of the plurality of instance constants stored inconstant memory.

    Conditional page fault control for page residency
    35.
    发明授权
    Conditional page fault control for page residency 有权
    页面驻留的条件页面错误控制

    公开(公告)号:US09489313B2

    公开(公告)日:2016-11-08

    申请号:US14035643

    申请日:2013-09-24

    CPC classification number: G06F12/1009 G06F12/08 G06F2212/302 G06T1/60

    Abstract: The present disclosure provides for systems and methods to process a non-resident page that may include attempting to access the non-resident page, an address for the non-resident page pointing to a memory page containing default values, determining that the non-resident page should not cause a page fault based on an indicator indicating that a particular non-resident page should not generate a page fault, returning an indication that a memory read did not translate and returning the default value when the access of the non-resident page is a read and the non-resident page should not cause a page fault. Another example may discontinue a write when the access of the non-resident page is a write and the non-resident page should not cause a page fault.

    Abstract translation: 本公开提供了处理非居民页面的系统和方法,其可以包括尝试访问非驻留页面,非居民页面的地址指向包含默认值的存储器页面,确定非居民页面 页面不应基于指示特定非驻留页面不应生成页面错误的指示符,返回指示,读取的存储器未翻译并在非驻留页面的访问时返回默认值 是一个读取,非驻留页面不应该导致页面错误。 当非驻留页面的访问是写入而非驻留页面不应导致页面错误时,另一个示例可能会中断写入。

    EMULATION OF FUSED MULTIPLY-ADD OPERATIONS
    36.
    发明申请
    EMULATION OF FUSED MULTIPLY-ADD OPERATIONS 有权
    融合多媒体操作的仿真

    公开(公告)号:US20160048374A1

    公开(公告)日:2016-02-18

    申请号:US14461890

    申请日:2014-08-18

    CPC classification number: G06F7/5443 G06F5/01 G06F7/483 G06F7/57

    Abstract: At least one processor may emulate a fused multiply-add operation for a first operand, a second operand, and a third operand. The at least one processor may determine an intermediate value based at least in part on multiplying the first operand with the second operand, determine at least one of an upper intermediate value or a lower intermediate value, wherein determining the upper intermediate value comprises rounding, towards zero, the intermediate value by a specified number of bits, and wherein determining the lower intermediate value comprises subtracting the intermediate value by the upper intermediate value, determine an upper value and a lower value based at least in part on adding or subtracting the third operand to one of the upper intermediate value or the lower intermediate value, and determine an emulated fused multiply-add result by adding the upper value and the lower value.

    Abstract translation: 至少一个处理器可以模拟第一操作数,第二操作数和第三操作数的融合乘法运算。 至少一个处理器可以至少部分地基于将第一操作数与第二操作数相乘来确定中间值,确定上中间值或下中间值中的至少一个,其中确定上中间值包括四舍五入 零,中间值乘以指定位数,并且其中确定较低中间值包括通过上述中间值减去中间值,至少部分地基于加上或减去第三操作数来确定上限值和较低值 到较高中间值或较低中间值之一,并通过加上上限值和下限值来确定仿真融合乘法运算结果。

    On-Demand Shareability Conversion In A Heterogeneous Shared Virtual Memory
    37.
    发明申请
    On-Demand Shareability Conversion In A Heterogeneous Shared Virtual Memory 审中-公开
    异构共享虚拟内存中的按需共享性转换

    公开(公告)号:US20160019168A1

    公开(公告)日:2016-01-21

    申请号:US14510804

    申请日:2014-10-09

    Abstract: The aspects include systems and methods of managing virtual memory page shareability. A processor or memory management unit may set in a page table an indication that a virtual memory page is not shareable with an outer domain processor. The processor or memory management unit may monitor for when the outer domain processor attempts or has attempted to access the virtual memory page. In response to the outer domain processor attempting to access the virtual memory page, the processor may perform a virtual memory page operation on the virtual memory page.

    Abstract translation: 这些方面包括管理虚拟内存页面共享性的系统和方法。 处理器或存储器管理单元可以在页表中设置虚拟存储器页不能与外域处理器共享的指示。 处理器或存储器管理单元可以监视外部域处理器何时尝试或尝试访问虚拟存储器页面。 响应于外部域处理器尝试访问虚拟存储器页面,处理器可以在虚拟存储器页面上执行虚拟存储器页面操作。

    VECTOR SCALING INSTRUCTIONS FOR USE IN AN ARITHMETIC LOGIC UNIT
    38.
    发明申请
    VECTOR SCALING INSTRUCTIONS FOR USE IN AN ARITHMETIC LOGIC UNIT 审中-公开
    在算术逻辑单元中使用的矢量放大指令

    公开(公告)号:US20160019027A1

    公开(公告)日:2016-01-21

    申请号:US14331991

    申请日:2014-07-15

    Abstract: At least one processor may receive components of a vector, wherein each of the components of the vector comprises at least an exponent. The at least one processor may further determine a maximum exponent out of respective exponents of the components of the vector, and may determine a scaling value based at least in part on the maximum exponent. An arithmetic logic unit of the at least one processor may scale the vector, by subtracting the scaling value from each of the respective exponents of the components of the vector.

    Abstract translation: 至少一个处理器可以接收向量的分量,其中矢量的每个分量包括至少一个指数。 所述至少一个处理器可以进一步确定向量的分量的相应指数中的最大指数,并且可以至少部分地基于最大指数来确定缩放值。 所述至少一个处理器的算术逻辑单元可以通过从所述矢量的各个成分的各指数中减去所述缩放值来缩放所述向量。

    LOAD SCHEME FOR SHARED REGISTER IN GPU
    39.
    发明申请
    LOAD SCHEME FOR SHARED REGISTER IN GPU 有权
    GPU中共享注册表的加载方案

    公开(公告)号:US20150379680A1

    公开(公告)日:2015-12-31

    申请号:US14316391

    申请日:2014-06-26

    CPC classification number: G06T1/60 G06T15/80 G09G5/363 G09G2352/00 G09G2360/06

    Abstract: Techniques are described for determining whether data of a variable for each of a plurality of graphics items is same. If determined that the data is the same, the techniques store the data in a storage location of a specialized shared general purpose register that is associated with the variable.

    Abstract translation: 描述了用于确定多个图形项目中的每一个的变量的数据是否相同的技术。 如果确定数据相同,则该技术将数据存储在与变量相关联的专用共享通用寄存器的存储位置中。

Patent Agency Ranking