Empirically based dynamic control of transmission of victim cache lateral castouts
    31.
    发明授权
    Empirically based dynamic control of transmission of victim cache lateral castouts 有权
    基于经验的动态控制受害者缓存横向传播的传输

    公开(公告)号:US08347036B2

    公开(公告)日:2013-01-01

    申请号:US12421180

    申请日:2009-04-09

    IPC分类号: G06F12/08

    摘要: In response to a data request, a victim cache line is selected for castout from a lower level cache, and a target lower level cache of one of the plurality of processing units is selected. A determination is made whether the selected target lower level cache has provided more than a threshold number of retry responses to lateral castout (LCO) commands of the first lower level cache, and if so, a different target lower level cache is selected. The first processing unit thereafter issues a LCO command on the interconnect fabric. The LCO command identifies the victim cache line to be castout and indicates that the target lower level cache is an intended destination of the victim cache line. In response to a successful coherence response to the LCO command, the victim cache line is removed from the first lower level cache and held in the second lower level cache.

    摘要翻译: 响应于数据请求,选择从较低级别高速缓冲存储器进行丢弃的受害者高速缓存行,并且选择多个处理单元之一的目标下级高速缓存。 确定所选择的目标下层高速缓存是否为第一较低级别高速缓存的横向转移(LCO)命令提供了超过阈值数量的重试响应,如果是,则选择不同的目标低级高速缓存。 此后,第一处理单元在互连结构上发出LCO命令。 LCO命令标识要丢弃的受害者缓存行,并指示目标下级缓存是受害缓存行的预期目标。 响应于对LCO命令的成功的一致性响应,从第一低级缓存中移除受害者高速缓存行并保存在第二较低级缓存中。

    Victim cache replacement
    32.
    发明授权
    Victim cache replacement 有权
    受害者缓存替换

    公开(公告)号:US08327072B2

    公开(公告)日:2012-12-04

    申请号:US12177912

    申请日:2008-07-23

    IPC分类号: G06F12/00

    摘要: A data processing system includes a processor core having an associated upper level cache and a lower level victim cache. In response to a memory access request of the processor core, the lower level cache victim determines whether the memory access request hits or misses in the directory of the lower level victim cache, and the upper level cache determines whether a castout from the upper level cache is to be performed and selects a victim coherency granule for eviction from the upper level cache. In response to determining that a castout from the upper level cache is to be performed, the upper level cache evicts the selected victim coherency granule. In the eviction, the upper level cache reads out the victim coherency granule from the data array of the upper level cache only in response to an indication that the memory access request misses in the directory of the lower level victim cache.

    摘要翻译: 数据处理系统包括具有相关联的高级缓存和较低级别的受害缓存的处理器核心。 响应于处理器核心的存储器访问请求,下级缓存受害者确定存储器访问请求是否在较低级别的受害者高速缓存的目录中命中或丢失,并且上级缓存确定来自上级缓存的丢弃 将被执行,并从上级缓存中选择被驱逐的受害者一致性粒子。 响应于确定要执行来自上级高速缓存的停顿,上级高速缓存驱逐所选择的受害者一致性粒子。 在逐出时,高级缓存器只有在响应于存储器访问请求在较低级别的受害者缓存的目录中丢失的指示时才从高级缓存的数据阵列读出受害者一致性粒子。

    Mode-Based Castout Destination Selection
    33.
    发明申请
    Mode-Based Castout Destination Selection 失效
    基于模式的Castout目的地选择

    公开(公告)号:US20100262783A1

    公开(公告)日:2010-10-14

    申请号:US12420933

    申请日:2009-04-09

    IPC分类号: G06F12/08

    CPC分类号: G06F12/0811 G06F12/12

    摘要: In response to a data request of a first of a plurality of processing units, the first processing unit selects a victim cache line to be castout from the lower level cache of the first processing unit and determines whether a mode is set. If not, the first processing unit issues on the interconnect fabric an LCO command identifying the victim cache line and indicating that a lower level cache is the intended destination. If the mode is set, the first processing unit issues a castout command with an alternative intended destination. In response to a coherence response to the LCO command indicating success of the LCO command, the first processing unit removes the victim cache line from its lower level cache, and the victim cache line is held elsewhere in the data processing system. The mode can be set to inhibit castouts to system memory, for example, for testing.

    摘要翻译: 响应于多个处理单元中的第一处理单元的数据请求,第一处理单元从第一处理单元的较低级高速缓存中选择要丢弃的牺牲高速缓存行,并且确定是否设置了模式。 如果不是,则第一处理单元在互连结构上发出识别受害者高速缓存行的LCO命令,并指示较低级别的高速缓存是预期的目的地。 如果模式被设置,则第一处理单元发出具有替代预定目的地的停顿命令。 响应于指示LCO命令成功的LCO命令的一致性响应,第一处理单元从其较低级高速缓存中去除受害者高速缓存行,并且将受害者高速缓存行保持在数据处理系统的其他地方。 该模式可以设置为抑制系统内存的丢弃,例如进行测试。

    Victim Cache Replacement
    34.
    发明申请
    Victim Cache Replacement 有权
    受害者缓存替换

    公开(公告)号:US20100023695A1

    公开(公告)日:2010-01-28

    申请号:US12177912

    申请日:2008-07-23

    IPC分类号: G06F12/08

    摘要: A data processing system includes a processor core having an associated upper level cache and a lower level victim cache. In response to a memory access request of the processor core, the lower level cache victim determines whether the memory access request hits or misses in the directory of the lower level victim cache, and the upper level cache determines whether a castout from the upper level cache is to be performed and selects a victim coherency granule for eviction from the upper level cache. In response to determining that a castout from the upper level cache is to be performed, the upper level cache evicts the selected victim coherency granule. In the eviction, the upper level cache reads out the victim coherency granule from the data array of the upper level cache only in response to an indication that the memory access request misses in the directory of the lower level victim cache.

    摘要翻译: 数据处理系统包括具有相关联的高级缓存和较低级别的受害缓存的处理器核心。 响应于处理器核心的存储器访问请求,下级缓存受害者确定存储器访问请求是否在较低级别的受害者高速缓存的目录中命中或丢失,并且上级缓存确定来自上级缓存的丢弃 将被执行,并从上级缓存中选择被驱逐的受害者一致性粒子。 响应于确定要执行来自上级高速缓存的停顿,上级高速缓存驱逐所选择的受害者一致性粒子。 在逐出时,高级缓存器只有在响应于存储器访问请求在较低级别的受害者缓存的目录中丢失的指示时才从高级缓存的数据阵列读出受害者一致性粒子。

    Facilitating data coherency using in-memory tag bits and tag test instructions
    35.
    发明授权
    Facilitating data coherency using in-memory tag bits and tag test instructions 失效
    使用内存中标记位和标签测试指令促进数据一致性

    公开(公告)号:US08656121B2

    公开(公告)日:2014-02-18

    申请号:US13109254

    申请日:2011-05-17

    IPC分类号: G06F12/00

    摘要: Fine-grained detection of data modification of original data is provided by associating separate guard bits with granules of memory storing original data from which translated data has been obtained. The guard bits indicating whether the original data stored in the associated granule is protected for data coherency. The guard bits are set and cleared by special-purpose instructions. Responsive to attempting access to translated data obtained from the original data, the guard bit(s) associated with the original data is checked to determine whether the guard bit(s) fail to indicate coherency of the original data, and if so, discarding of the translated data is initiated to facilitate maintaining data coherency between the original data and the translated data.

    摘要翻译: 通过将单独的保护位与存储已经获得翻译数据的原始数据的存储器的颗粒相关联来提供对原始数据的数据修改的细粒度检测。 保护位指示存储在相关联的粒子中的原始数据是否受到数据一致性的保护。 保护位通过专用指令进行设置和清除。 响应于尝试访问从原始数据获得的翻译数据,检查与原始数据相关联的保护位,以确定保护位是否不能指示原始数据的一致性,如果是,则丢弃 启动翻译的数据以便于维持原始数据和翻译数据之间的数据一致性。

    Facilitating data coherency using in-memory tag bits and tag test instructions
    36.
    发明授权
    Facilitating data coherency using in-memory tag bits and tag test instructions 失效
    使用内存中标记位和标签测试指令促进数据一致性

    公开(公告)号:US08645644B2

    公开(公告)日:2014-02-04

    申请号:US13451682

    申请日:2012-04-20

    IPC分类号: G06F12/00

    摘要: A method is provided for fine-grained detection of data modification of original data by associating separate guard bits with granules of memory storing original data from which translated data has been obtained. The guard bits indicating whether the original data stored in the associated granule is protected for data coherency. The guard bits are set and cleared by special-purpose instructions. Responsive to attempting access to translated data obtained from the original data, the guard bit(s) associated with the original data is checked to determine whether the guard bit(s) fail to indicate coherency of the original data, and if so, discarding of the translated data is initiated to facilitate maintaining data coherency between the original data and the translated data.

    摘要翻译: 提供了一种通过将单独的保护位与存储已经获得翻译数据的原始数据的存储器的颗粒相关联的方法来对原始数据的数据修改进行细粒度检测。 保护位指示存储在相关联的粒子中的原始数据是否受到数据一致性的保护。 保护位通过专用指令进行设置和清除。 响应于尝试访问从原始数据获得的翻译数据,检查与原始数据相关联的保护位,以确定保护位是否不能指示原始数据的一致性,如果是,则丢弃 启动翻译的数据以便于维持原始数据和翻译数据之间的数据一致性。

    Victim cache lateral castout targeting
    37.
    发明授权
    Victim cache lateral castout targeting 失效
    受害者缓存横向倾斜定位

    公开(公告)号:US08489819B2

    公开(公告)日:2013-07-16

    申请号:US12340511

    申请日:2008-12-19

    IPC分类号: G06F12/00 G06F13/00 G06F13/28

    摘要: A data processing system includes a plurality of processing units coupled by an interconnect fabric. In response to a data request, a victim cache line is selected for castout from a first lower level cache of a first processing unit, and a target lower level cache of one of the plurality of processing units is selected based upon architectural proximity of the target lower level cache to a home system memory to which the address of the victim cache line is assigned. The first processing unit issues on the interconnect fabric a lateral castout (LCO) command that identifies the victim cache line to be castout from the first lower level cache and indicates that the target lower level cache is an intended destination. In response to a coherence response indicating success of the LCO command, the victim cache line is removed from the first lower level cache and held in the second lower level cache.

    摘要翻译: 数据处理系统包括通过互连结构耦合的多个处理单元。 响应于数据请求,从第一处理单元的第一较低级别高速缓存中选择牺牲缓存行来进行舍弃,并且基于目标的体系结构接近来选择多个处理单元之一的目标下级高速缓存 低级缓存到分配了受害者缓存行的地址的归属系统存储器。 所述第一处理单元在所述互连结构上发出侧向锁定(LCO)命令,所述侧向锁定(LCO)命令标识要从所述第一低级缓存中抛出的所述牺牲缓存线,并且指示所述目标低级缓存是预期目的地。 响应于指示LCO命令的成功的一致性响应,从第一低级缓存中删除受害者高速缓存行并保存在第二较低级高速缓存中。

    Cache-based speculation of stores following synchronizing operations
    38.
    发明授权
    Cache-based speculation of stores following synchronizing operations 失效
    同步操作后,存储器中基于缓存的推测

    公开(公告)号:US08412888B2

    公开(公告)日:2013-04-02

    申请号:US12985590

    申请日:2011-01-06

    IPC分类号: G06F12/00

    CPC分类号: G06F12/0837 G06F12/0895

    摘要: A store request in enqueued in a store queue of a cache memory of the data processing system. The store request identifies a target memory block by a target address and specifies store data. While the store request and a barrier request older than the store request are enqueued in the store queue, a read-claim machine of the cache memory is dispatched to acquire coherence ownership of target memory block of the store request. After coherence ownership of the target memory block is acquired and the barrier request has been retired from the store queue, a cache array of the cache memory is updated with the store data.

    摘要翻译: 在数据处理系统的高速缓冲存储器的存储队列中排队的存储请求。 存储请求通过目标地址识别目标存储器块并指定存储数据。 当存储请求和存储请求之前的屏障请求在存储队列中排队时,调度高速缓冲存储器的读取机器以获取存储请求的目标存储器块的一致性所有权。 在获取目标存储器块的一致性所有权并且屏障请求已经从存储队列中退出之后,用存储数据更新高速缓冲存储器的高速缓存阵列。

    Aggregate data processing system having multiple overlapping synthetic computers
    39.
    发明授权
    Aggregate data processing system having multiple overlapping synthetic computers 有权
    具有多个重叠合成计算机的综合数据处理系统

    公开(公告)号:US08370595B2

    公开(公告)日:2013-02-05

    申请号:US12643800

    申请日:2009-12-21

    IPC分类号: G06F12/00

    摘要: A first SMP computer has first and second processing units and a first system memory pool, a second SMP computer has third and fourth processing units and a second system memory pool, and a third SMP computer has at least fifth and sixth processing units and third, fourth and fifth system memory pools. The fourth system memory pool is inaccessible to the third, fourth and sixth processing units and accessible to at least the second and fifth processing units, and the fifth system memory pool is inaccessible to the first, second and sixth processing units and accessible to at least the fourth and fifth processing units. A first interconnect couples the second processing unit for load-store coherent, ordered access to the fourth system memory pool, and a second interconnect couples the fourth processing unit for load-store coherent, ordered access to the fifth system memory pool.

    摘要翻译: 第一SMP计算机具有第一和第二处理单元和第一系统存储器池,第二SMP计算机具有第三和第四处理单元和第二系统存储器池,并且第三SMP计算机具有至少第五和第六处理单元,第三SMP计算机具有至少第五和第六处理单元, 第四和第五系统内存池。 第四系统存储器池对于第三,第四和第六处理单元是不可访问的,并且可访问至少第二和第五处理单元,并且第五系统存储器池对于第一,第二和第六处理单元是不可访问的,并且至少可访问 第四和第五处理单元。 第一互连耦合第二处理单元,用于对第四系统存储池进行加载存储相关的有序访问,并且第二互连耦合第四处理单元,用于加载存储相关的有序访问到第五系统存储池。

    MEMORY COHERENCE DIRECTORY SUPPORTING REMOTELY SOURCED REQUESTS OF NODAL SCOPE
    40.
    发明申请
    MEMORY COHERENCE DIRECTORY SUPPORTING REMOTELY SOURCED REQUESTS OF NODAL SCOPE 失效
    记忆协调指导原则支持远程请求的NODAL范围

    公开(公告)号:US20120203976A1

    公开(公告)日:2012-08-09

    申请号:US13445010

    申请日:2012-04-12

    IPC分类号: G06F12/08

    CPC分类号: G06F12/0817

    摘要: A data processing system includes at least a first through third processing nodes coupled by an interconnect fabric. The first processing node includes a master, a plurality of snoopers capable of participating in interconnect operations, and a node interface that receives a request of the master and transmits the request of the master to the second processing unit with a nodal scope of transmission limited to the second processing node. The second processing node includes a node interface having a directory. The node interface of the second processing node permits the request to proceed with the nodal scope of transmission if the directory does not indicate that a target memory block of the request is cached other than in the second processing node and prevents the request from succeeding if the directory indicates that the target memory block of the request is cached other than in the second processing node.

    摘要翻译: 数据处理系统至少包括通过互连结构耦合的第一至第三处理节点。 第一处理节点包括主机,能够参与互连操作的多个侦听器,以及接收主机请求的节点接口,并将主机的请求传送到第二处理单元,传送范围限于 第二处理节点。 第二处理节点包括具有目录的节点接口。 第二处理节点的节点接口允许请求继续进行节点传输范围,如果该目录没有指示该请求的目标存储器块不是在第二处理节点中被缓存,并且如果该请求成功 目录指示除第二处理节点之外的请求的目标存储块被缓存。