Processor and system using a mask register to track progress of gathering and prefetching elements from memory
    4.
    发明授权
    Processor and system using a mask register to track progress of gathering and prefetching elements from memory 有权
    处理器和系统使用掩码寄存器跟踪从内存中采集和预取元素的进度

    公开(公告)号:US08892848B2

    公开(公告)日:2014-11-18

    申请号:US13175953

    申请日:2011-07-05

    摘要: A device, system and method for assigning values to elements in a first register, where each data field in a first register corresponds to a data element to be written into a second register, and where for each data field in the first register, a first value may indicate that the corresponding data element has not been written into the second register and a second value indicates that the corresponding data element has been written into the second register, reading the values of each of the data fields in the first register, and for each data field in the first register having the first value, gathering the corresponding data element and writing the corresponding data element into the second register, and changing the value of the data field in the first register from the first value to the second value. Other embodiments are described and claimed.

    摘要翻译: 一种用于将值分配给第一寄存器中的元件的装置,系统和方法,其中第一寄存器中的每个数据字段对应于要写入第二寄存器的数据元素,并且对于第一寄存器中的每个数据字段, 值可以指示相应的数据元素尚未被写入第二寄存器,第二值指示对应的数据元素已被写入第二寄存器,读取第一寄存器中每个数据域的值,并且为 第一寄存器中的每个数据字段具有第一值,收集对应的数据元素并将相应的数据元素写入第二寄存器,并将第一寄存器中的数据字段的值从第一值改变为第二值。 描述和要求保护其他实施例。

    Method and apparatus for prefetching data to a lower level cache memory
    8.
    发明申请
    Method and apparatus for prefetching data to a lower level cache memory 有权
    用于将数据预取到较低级高速缓冲存储器的方法和装置

    公开(公告)号:US20060047915A1

    公开(公告)日:2006-03-02

    申请号:US10933188

    申请日:2004-09-01

    IPC分类号: G06F12/00

    摘要: A prefetching scheme to detect when a load misses the lower level cache and hits the next level cache. Consequently, the prefetching scheme utilizes the previous information for the cache miss to the lower level cache and hit to the next higher level of cache memory that may result in initiating a sidedoor prefetch load for fetching the previous or next cache line into the lower level cache. In order to generate an address for the sidedoor prefetch, a history of cache access is maintained in a queue.

    摘要翻译: 一种预取方案,用于检测何时加载错过较低级别的缓存,并触发下一级缓存。 因此,预取方案利用先前的信息将高速缓存未命中用于较低级别的高速缓存并且命中到下一个较高级别的高速缓冲存储器,这可能导致发起侧面预取负载,以将先前或下一个高速缓存行提取到下一级高速缓存 。 为了生成二进制预取的地址,在队列中保持高速缓存访​​问的历史。

    Mechanism for effectively handling texture sampling
    9.
    发明授权
    Mechanism for effectively handling texture sampling 有权
    有效处理纹理采样的机制

    公开(公告)号:US08933946B2

    公开(公告)日:2015-01-13

    申请号:US11967408

    申请日:2007-12-31

    申请人: Eric Sprangle

    发明人: Eric Sprangle

    摘要: A method and apparatus for efficiently handling texture sampling is described herein. A compiler or other software is capable of breaking a texture sampling operation for a pixel into a pre-fetch operation and a use operation. A processing element, in response to executing the pre-fetch operation, delegates computation of the texture sample of the pixel to a hardware texture sample unit. In parallel to the hardware texture sample unit performing a texture sample for the pixel and providing the result, i.e. a textured pixel (texel), to a destination address, the processing element is capable of executing other independent code. After an amount of time, the processing element executes the use operation, such as a load operation to load the texel from the destination address.

    摘要翻译: 本文描述了一种用于有效处理纹理采样的方法和装置。 编译器或其他软件能够将像素的纹理采样操作分解成预取操作和使用操作。 响应于执行预取操作,处理元件将像素的纹理样本的计算委托给硬件纹理采样单元。 与硬件纹理采样单元平行地执行像素的纹理样本并将结果(即,纹理像素(纹素))提供给目的地址,处理元件能够执行其他独立代码。 在一段时间之后,处理元件执行使用操作,例如从目的地地址加载纹素的加载操作。