System and method for reducing unnecessary cache operations
    21.
    发明授权
    System and method for reducing unnecessary cache operations 失效
    减少不必要的缓存操作的系统和方法

    公开(公告)号:US07698508B2

    公开(公告)日:2010-04-13

    申请号:US11674960

    申请日:2007-02-14

    IPC分类号: G06F12/00

    摘要: A system and method for cache management in a data processing system. The data processing system includes a processor and a memory hierarchy. The memory hierarchy includes at least an upper memory cache, at least a lower memory cache, and a write-back data structure. In response to replacing data from the upper memory cache, the upper memory cache examines the write-back data structure to determine whether or not the data is present in the lower memory cache. If the data is present in the lower memory cache, the data is replaced in the upper memory cache without casting out the data to the lower memory cache.

    摘要翻译: 一种用于数据处理系统中缓存管理的系统和方法。 数据处理系统包括处理器和存储器层级。 存储器层级至少包括上部存储器高速缓存,至少下部存储器高速缓存和回写数据结构。 响应于从上部存储器高速缓存替换数据,上部存储器高速缓存检查回写数据结构以确定数据是否存在于下部存储器高速缓存中。 如果数据存在于较低存储器高速缓存中,则数据将在上部存储器高速缓存中替换,而不会将数据丢弃到较低的内存高速缓存。

    Method and system for reducing cache tag bits
    22.
    发明授权
    Method and system for reducing cache tag bits 失效
    减少缓存标签位的方法和系统

    公开(公告)号:US07546417B1

    公开(公告)日:2009-06-09

    申请号:US12173613

    申请日:2008-07-15

    IPC分类号: G06F12/00 G06F13/00 G06F13/28

    CPC分类号: G06F12/0864 G06F2212/1044

    摘要: A method of accessing data from a cache is disclosed. Tag bits of data among sets and ways of cache lines are divided into common subtags and remaining subtags. Similarly, an access address tag is divided into an address common subtag and address remaining tag. When the index of an access address selects a set, a match comparison of the address common subtag and the selected set common subtag is performed. Also, the address remaining tag and selected set remaining subtags are compared for matching before the selected set and associated data is supplied to the requester.

    摘要翻译: 公开了一种从缓存访问数据的方法。 标记数据组中的数据位和高速缓存行的方式分为常见的子标签和剩余子标签。 类似地,访问地址标签被分成地址共享子标签和地址剩余标签。 当访问地址的索引选择一个集合时,执行地址公共子标记和所选择的公共子标记的匹配比较。 此外,在选择的集合和相关联的数据被提供给请求者之前,比较地址剩余标签和选择的集合剩余子标签以进行匹配。

    System and method for reducing unnecessary cache operations
    23.
    发明申请
    System and method for reducing unnecessary cache operations 审中-公开
    减少不必要的缓存操作的系统和方法

    公开(公告)号:US20060155934A1

    公开(公告)日:2006-07-13

    申请号:US11032875

    申请日:2005-01-11

    IPC分类号: G06F12/00

    摘要: A system and method for cache management in a data processing system. The data processing system includes a processor and a memory hierarchy. The memory hierarchy includes at least an upper memory cache, at least a lower memory cache, and a write-back data structure. In response to replacing data from the upper memory cache, the upper memory cache examines the write-back data structure to determine whether or not the data is present in the lower memory cache. If the data is present in the lower memory cache, the data is replaced in the upper memory cache without casting out the data to the lower memory cache.

    摘要翻译: 一种用于数据处理系统中缓存管理的系统和方法。 数据处理系统包括处理器和存储器层级。 存储器层级至少包括上部存储器高速缓存,至少下部存储器高速缓存和回写数据结构。 响应于从上部存储器高速缓存替换数据,上部存储器高速缓存检查回写数据结构以确定数据是否存在于下部存储器高速缓存中。 如果数据存在于较低存储器高速缓存中,则数据将在上部存储器高速缓存中替换,而不会将数据丢弃到较低的内存高速缓存。

    Instruction set architecture extensions for performing power versus performance tradeoffs
    24.
    发明授权
    Instruction set architecture extensions for performing power versus performance tradeoffs 失效
    用于执行功率与性能折衷的指令集架构扩展

    公开(公告)号:US08589665B2

    公开(公告)日:2013-11-19

    申请号:US12788940

    申请日:2010-05-27

    IPC分类号: G06F9/00

    摘要: Mechanisms are provided for processing an instruction in a processor of a data processing system. The mechanisms operate to receive, in a processor of the data processing system, an instruction, the instruction including power/performance tradeoff information associated with the instruction. The mechanisms further operate to determine power/performance tradeoff priorities or criteria, specifying whether power conservation or performance is prioritized with regard to execution of the instruction, based on the power/performance tradeoff information. Moreover, the mechanisms process the instruction in accordance with the power/performance tradeoff priorities or criteria identified based on the power/performance tradeoff information of the instruction.

    摘要翻译: 提供了用于处理数据处理系统的处理器中的指令的机制。 这些机制操作以在数据处理系统的处理器中接收指令,该指令包括与指令相关联的功率/性能权衡信息。 这些机制进一步操作以基于功率/性能折衷信息来确定功率/性能折衷优先级或标准,指定功率节省或关于指令的执行是否优先的性能。 此外,机构根据功率/性能折衷优先级或基于指令的功率/性能折衷信息识别的标准处理指令。

    Techniques for dynamically sharing a fabric to facilitate off-chip communication for multiple on-chip units
    25.
    发明授权
    Techniques for dynamically sharing a fabric to facilitate off-chip communication for multiple on-chip units 失效
    用于动态共享结构以促进多个片上单元的片外通信的技术

    公开(公告)号:US08346988B2

    公开(公告)日:2013-01-01

    申请号:US12786716

    申请日:2010-05-25

    IPC分类号: G06F3/00 G06F13/00

    摘要: A technique for sharing a fabric to facilitate off-chip communication for on-chip units includes dynamically assigning a first unit that implements a first communication protocol to a first portion of the fabric when private fabrics are indicated for the on-chip units. The technique also includes dynamically assigning a second unit that implements a second communication protocol to a second portion of the fabric when the private fabrics are indicated for the on-chip units. In this case, the first and second units are integrated in a same chip and the first and second protocols are different. The technique further includes dynamically assigning, based on off-chip traffic requirements of the first and second units, the first unit or the second unit to the first and second portions of the fabric when the private fabrics are not indicated for the on-chip units.

    摘要翻译: 一种用于共享一个结构以促进片上单元的片外通信的技术包括:当针对片上单元指示专用结构时,动态分配实现第一通信协议的第一单元到该结构的第一部分。 该技术还包括当为片上单元指示专用结构时,动态地将实现第二通信协议的第二单元分配给该结构的第二部分。 在这种情况下,第一和第二单元集成在相同的芯片中,并且第一和第二协议是不同的。 该技术还包括:当私有结构未被指示用于片上单元时,基于第一单元或第二单元的片外流量要求将第一单元或第二单元动态地分配给该结构的第一和第二部分 。

    Techniques for Indirect Data Prefetching
    27.
    发明申请
    Techniques for Indirect Data Prefetching 有权
    间接数据预取技术

    公开(公告)号:US20090198950A1

    公开(公告)日:2009-08-06

    申请号:US12024239

    申请日:2008-02-01

    IPC分类号: G06F12/02 G06F12/10

    CPC分类号: G06F12/0862 G06F2212/6028

    摘要: A processor includes a first address translation engine, a second address translation engine, and a prefetch engine. The first address translation engine is configured to determine a first memory address of a pointer associated with a data prefetch instruction. The prefetch engine is coupled to the first translation engine and is configured to fetch content, included in a first data block (e.g., a first cache line) of a memory, at the first memory address. The second address translation engine is coupled to the prefetch engine and is configured to determine a second memory address based on the content of the memory at the first memory address. The prefetch engine is also configured to fetch (e.g., from the memory or another memory) a second data block (e.g., a second cache line) that includes data at the second memory address.

    摘要翻译: 处理器包括第一地址转换引擎,第二地址转换引擎和预取引擎。 第一地址转换引擎被配置为确定与数据预取指令相关联的指针的第一存储器地址。 预取引擎被耦合到第一翻译引擎,并被配置为在第一存储器地址处提取包含在存储器的第一数据块(例如,第一高速缓存行)中的内容。 第二地址转换引擎耦合到预取引擎,并且被配置为基于第一存储器地址处的存储器的内容来确定第二存储器地址。 预取引擎还被配置为从第二存储器地址提取包括数据的第二数据块(例如,第二高速缓存行)(例如,从存储器或另一存储器)。

    DATA PROCESSING SYSTEM, PROCESSOR AND METHOD THAT SUPPORT A TOUCH OF A PARTIAL CACHE LINE OF DATA
    28.
    发明申请
    DATA PROCESSING SYSTEM, PROCESSOR AND METHOD THAT SUPPORT A TOUCH OF A PARTIAL CACHE LINE OF DATA 审中-公开
    数据处理系统,处理器和方法,支持部分缓存行数据

    公开(公告)号:US20090198910A1

    公开(公告)日:2009-08-06

    申请号:US12024174

    申请日:2008-02-01

    IPC分类号: G06F12/00

    CPC分类号: G06F12/0862 G06F12/0831

    摘要: According to method of data processing in a multiprocessor data processing system, in response to a processor touch request targeting a target granule of a cache line of data containing multiple granules, a processing unit originates on an interconnect of the multiprocessor data processing system a partial touch request that requests a copy of only the target granule for subsequent query access. In response to a combined response to the partial touch request indicating success, the combined response representing a system-wide response to the partial touch request, the processing unit receives the target granule of the target cache line and updates a coherency state of the target granule while retaining a coherency state of at least one other granule of the cache line.

    摘要翻译: 根据多处理器数据处理系统中的数据处理方法,响应于针对包含多个粒子的数据的高速缓存行的目标颗粒的处理器触摸请求,处理单元起源于多处理器数据处理系统的互连部分触摸 请求仅请求目标颗粒的副本用于后续查询访问。 响应于指示成功的部分触摸请求的组合响应,表示对部分触摸请求的系统范围响应的组合响应,处理单元接收目标高速缓存行的目标颗粒并更新目标颗粒的一致性状态 同时保持高速缓存行的至少另一个颗粒的一致性状态。

    Complier assisted victim cache bypassing
    29.
    发明授权
    Complier assisted victim cache bypassing 失效
    Complier辅助受害者缓存绕过

    公开(公告)号:US07506119B2

    公开(公告)日:2009-03-17

    申请号:US11381563

    申请日:2006-05-04

    IPC分类号: G06F12/08 G06F12/12

    摘要: A method for compiler assisted victim cache bypassing including: identifying a cache line as a candidate for victim cache bypassing; conveying a bypassing-the-victim-cache information to a hardware; and checking a state of the cache line to determine a modified state of the cache line, wherein the cache line is identified for cache bypassing if the cache line that has no reuse within a loop or loop nest and there is no immediate loop reuse or there is a substantial across loop reuse distance so that it will be replaced from both main and victim cache before being reused.

    摘要翻译: 一种用于编译器辅助的受害者缓存旁路的方法,包括:将高速缓存行标识为用于受害者缓存旁路的候选者; 向硬件传送绕过受害者缓存信息; 并且检查高速缓存行的状态以确定高速缓存行的修改状态,其中如果在循环或循环嵌套内没有重用的高速缓存行并且不存在立即循环重用或那里,则高速缓存行被识别用于高速缓存绕过 是一个实质的跨循环重用距离,因此它将被重新使用之前被替换为主缓存和受害缓存。