Method and apparatus for prefetching data to a lower level cache memory
    1.
    发明申请
    Method and apparatus for prefetching data to a lower level cache memory 有权
    用于将数据预取到较低级高速缓冲存储器的方法和装置

    公开(公告)号:US20060047915A1

    公开(公告)日:2006-03-02

    申请号:US10933188

    申请日:2004-09-01

    IPC分类号: G06F12/00

    摘要: A prefetching scheme to detect when a load misses the lower level cache and hits the next level cache. Consequently, the prefetching scheme utilizes the previous information for the cache miss to the lower level cache and hit to the next higher level of cache memory that may result in initiating a sidedoor prefetch load for fetching the previous or next cache line into the lower level cache. In order to generate an address for the sidedoor prefetch, a history of cache access is maintained in a queue.

    摘要翻译: 一种预取方案,用于检测何时加载错过较低级别的缓存,并触发下一级缓存。 因此,预取方案利用先前的信息将高速缓存未命中用于较低级别的高速缓存并且命中到下一个较高级别的高速缓冲存储器,这可能导致发起侧面预取负载,以将先前或下一个高速缓存行提取到下一级高速缓存 。 为了生成二进制预取的地址,在队列中保持高速缓存访​​问的历史。

    Method and apparatus for prefetching data to a lower level cache memory
    2.
    发明授权
    Method and apparatus for prefetching data to a lower level cache memory 有权
    用于将数据预取到较低级高速缓冲存储器的方法和装置

    公开(公告)号:US07383418B2

    公开(公告)日:2008-06-03

    申请号:US10933188

    申请日:2004-09-01

    IPC分类号: G06F12/00 G06F9/34 G06F9/26

    摘要: A prefetching scheme to detect when a load misses the lower level cache and hits the next level cache. Consequently, the prefetching scheme utilizes the previous information for the cache miss to the lower level cache and hit to the next higher level of cache memory that may result in initiating a sidedoor prefetch load for fetching the previous or next cache line into the lower level cache. In order to generate an address for the sidedoor prefetch, a history of cache access is maintained in a queue.

    摘要翻译: 一种预取方案,用于检测何时加载错过较低级别的缓存,并触发下一级缓存。 因此,预取方案利用先前的信息将高速缓存未命中用于较低级别的高速缓存并且命中到下一个较高级别的高速缓冲存储器,这可能导致发起侧面预取负载,以将先前或下一个高速缓存行提取到下一级高速缓存 。 为了生成二进制预取的地址,在队列中保持高速缓存访​​问的历史。

    Processor and system using a mask register to track progress of gathering and prefetching elements from memory
    3.
    发明授权
    Processor and system using a mask register to track progress of gathering and prefetching elements from memory 有权
    处理器和系统使用掩码寄存器跟踪从内存中采集和预取元素的进度

    公开(公告)号:US08892848B2

    公开(公告)日:2014-11-18

    申请号:US13175953

    申请日:2011-07-05

    摘要: A device, system and method for assigning values to elements in a first register, where each data field in a first register corresponds to a data element to be written into a second register, and where for each data field in the first register, a first value may indicate that the corresponding data element has not been written into the second register and a second value indicates that the corresponding data element has been written into the second register, reading the values of each of the data fields in the first register, and for each data field in the first register having the first value, gathering the corresponding data element and writing the corresponding data element into the second register, and changing the value of the data field in the first register from the first value to the second value. Other embodiments are described and claimed.

    摘要翻译: 一种用于将值分配给第一寄存器中的元件的装置,系统和方法,其中第一寄存器中的每个数据字段对应于要写入第二寄存器的数据元素,并且对于第一寄存器中的每个数据字段, 值可以指示相应的数据元素尚未被写入第二寄存器,第二值指示对应的数据元素已被写入第二寄存器,读取第一寄存器中每个数据域的值,并且为 第一寄存器中的每个数据字段具有第一值,收集对应的数据元素并将相应的数据元素写入第二寄存器,并将第一寄存器中的数据字段的值从第一值改变为第二值。 描述和要求保护其他实施例。

    MECHANISM FOR EFFECTIVELY CACHING STREAMING AND NON-STREAMING DATA PATTERNS
    4.
    发明申请
    MECHANISM FOR EFFECTIVELY CACHING STREAMING AND NON-STREAMING DATA PATTERNS 有权
    有效的高速缓存和非流动数据模式的机制

    公开(公告)号:US20110099333A1

    公开(公告)日:2011-04-28

    申请号:US12908183

    申请日:2010-10-20

    IPC分类号: G06F12/12

    摘要: A method and apparatus for efficiently caching streaming and non-streaming data is described herein. Software, such as a compiler, identifies last use streaming instructions/operations that are the last instruction/operation to access streaming data for a number of instructions or an amount of time. As a result of performing an access to a cache line for a last use instruction/operation, the cache line is updated to a streaming data no longer needed (SDN) state. When control logic is to determine a cache line to be replaced, a modified Least Recently Used (LRU) algorithm is biased to select SDN state lines first to replace no longer needed streaming data.

    摘要翻译: 本文描述了用于有效地高速缓存流和非流数据的方法和装置。 诸如编译器的软件识别最后使用的流指令/操作,这些指令/操作是用于访问多个指令或一定时间量的最后指令/操作来访问流数据。 作为执行对最后使用指令/操作的高速缓存线的访问的结果,将高速缓存行更新为不再需要的流数据(SDN)状态。 当控制逻辑要确定要替换的高速缓存行时,修改的最近最少使用(LRU)算法被偏置以首先选择SDN状态行来替换不再需要的流数据。

    Methods, apparatus, and instructions for converting vector data
    6.
    发明授权
    Methods, apparatus, and instructions for converting vector data 有权
    用于转换矢量数据的方法,装置和指令

    公开(公告)号:US08667250B2

    公开(公告)日:2014-03-04

    申请号:US11964631

    申请日:2007-12-26

    IPC分类号: G06F9/312

    摘要: A computer processor includes a decoder for decoding machine instructions and an execution unit for executing those instructions. The decoder and the execution unit are capable of decoding and executing vector instructions that include one or more format conversion indicators. For instance, the processor may be capable of executing a vector-load-convert-and-write (VLoadConWr) instruction that provides for loading data from memory to a vector register. The VLoadConWr instruction may include a format conversion indicator to indicate that the data from memory should be converted from a first format to a second format before the data is loaded into the vector register. Other embodiments are described and claimed.

    摘要翻译: 计算机处理器包括用于解码机器指令的解码器和用于执行这些指令的执行单元。 解码器和执行单元能够解码和执行包括一个或多个格式转换指示符的向量指令。 例如,处理器可能能够执行矢量加载转换和写入(VLoadConWr)指令,该指令提供将数据从存储器加载到向量寄存器。 VLoadConWr指令可以包括格式转换指示符,以指示在将数据加载到向量寄存器之前,来自存储器的数据应该从第一格式转换为第二格式。 描述和要求保护其他实施例。

    Mechanism for effectively caching streaming and non-streaming data patterns
    8.
    发明授权
    Mechanism for effectively caching streaming and non-streaming data patterns 有权
    有效缓存流和非流数据模式的机制

    公开(公告)号:US08065488B2

    公开(公告)日:2011-11-22

    申请号:US12908183

    申请日:2010-10-20

    IPC分类号: G06F12/12

    摘要: A method and apparatus for efficiently caching streaming and non-streaming data is described herein. Software, such as a compiler, identifies last use streaming instructions/operations that are the last instruction/operation to access streaming data for a number of instructions or an amount of time. As a result of performing an access to a cache line for a last use instruction/operation, the cache line is updated to a streaming data no longer needed (SDN) state. When control logic is to determine a cache line to be replaced, a modified Least Recently Used (LRU) algorithm is biased to select SDN state lines first to replace no longer needed streaming data.

    摘要翻译: 本文描述了用于有效地高速缓存流和非流数据的方法和装置。 诸如编译器的软件识别最后使用的流指令/操作,这些指令/操作是用于访问多个指令或一定时间量的最后指令/操作来访问流数据。 作为执行对最后使用指令/操作的高速缓存线的访问的结果,将高速缓存行更新为不再需要的流数据(SDN)状态。 当控制逻辑要确定要替换的高速缓存行时,修改的最近最少使用(LRU)算法被偏置以首先选择SDN状态行来替换不再需要的流数据。

    METHODS, APPARATUS, AND INSTRUCTIONS FOR CONVERTING VECTOR DATA
    9.
    发明申请
    METHODS, APPARATUS, AND INSTRUCTIONS FOR CONVERTING VECTOR DATA 审中-公开
    用于转换矢量数据的方法,装置和说明

    公开(公告)号:US20140019720A1

    公开(公告)日:2014-01-16

    申请号:US13762220

    申请日:2013-02-07

    IPC分类号: G06F9/30

    摘要: A computer processor includes a decoder for decoding machine instructions and an execution unit for executing those instructions. The decoder and the execution unit are capable of decoding and executing vector instructions that include one or more format conversion indicators. For instance, the processor may be capable of executing a vector-load-convert-and-write (VLoadConWr) instruction that provides for loading data from memory to a vector register. The VLoadConWr instruction may include a format conversion indicator to indicate that the data from memory should be converted from a first format to a second format before the data is loaded into the vector register. Other embodiments are described and claimed.

    摘要翻译: 计算机处理器包括用于解码机器指令的解码器和用于执行这些指令的执行单元。 解码器和执行单元能够解码和执行包括一个或多个格式转换指示符的向量指令。 例如,处理器可能能够执行矢量加载转换和写入(VLoadConWr)指令,该指令提供将数据从存储器加载到向量寄存器。 VLoadConWr指令可以包括格式转换指示符,以指示在将数据加载到向量寄存器之前,来自存储器的数据应该从第一格式转换为第二格式。 描述和要求保护其他实施例。

    DEVICE, SYSTEM, AND METHOD FOR USING A MASK REGISTER TO TRACK PROGRESS OF GATHERING ELEMENTS FROM MEMORY
    10.
    发明申请
    DEVICE, SYSTEM, AND METHOD FOR USING A MASK REGISTER TO TRACK PROGRESS OF GATHERING ELEMENTS FROM MEMORY 有权
    使用掩码寄存器跟踪记忆元素进度的设备,系统和方法

    公开(公告)号:US20110264863A1

    公开(公告)日:2011-10-27

    申请号:US13175953

    申请日:2011-07-05

    IPC分类号: G06F12/08 G06F9/30

    摘要: A device, system and method for assigning values to elements in a first register, where each data field in a first register corresponds to a data element to be written into a second register, and where for each data field in the first register, a first value may indicate that the corresponding data element has not been written into the second register and a second value indicates that the corresponding data element has been written into the second register, reading the values of each of the data fields in the first register, and for each data field in the first register having the first value, gathering the corresponding data element and writing the corresponding data element into the second register, and changing the value of the data field in the first register from the first value to the second value. Other embodiments are described and claimed.

    摘要翻译: 一种用于将值分配给第一寄存器中的元件的装置,系统和方法,其中第一寄存器中的每个数据字段对应于要写入第二寄存器的数据元素,并且对于第一寄存器中的每个数据字段, 值可以指示相应的数据元素尚未被写入第二寄存器,第二值指示对应的数据元素已被写入第二寄存器,读取第一寄存器中每个数据域的值,并且为 第一寄存器中的每个数据字段具有第一值,收集对应的数据元素并将相应的数据元素写入第二寄存器,并将第一寄存器中的数据字段的值从第一值改变为第二值。 描述和要求保护其他实施例。