PROCESSOR PERFORMANCE IMPROVEMENT FOR INSTRUCTION SEQUENCES THAT INCLUDE BARRIER INSTRUCTIONS
    32.
    发明申请
    PROCESSOR PERFORMANCE IMPROVEMENT FOR INSTRUCTION SEQUENCES THAT INCLUDE BARRIER INSTRUCTIONS 有权
    包括障碍指示的指令序列的处理器性能改进

    公开(公告)号:US20130205120A1

    公开(公告)日:2013-08-08

    申请号:US13369029

    申请日:2012-02-08

    IPC分类号: G06F9/312

    摘要: A technique for processing an instruction sequence that includes a barrier instruction, a load instruction preceding the barrier instruction, and a subsequent memory access instruction following the barrier instruction includes determining that the load instruction is resolved based upon receipt of an earliest of a good combined response for a read operation corresponding to the load instruction and data for the load instruction. The technique also includes if execution of the subsequent memory access instruction is not initiated prior to completion of the barrier instruction, initiating in response to determining the barrier instruction completed, execution of the subsequent memory access instruction. The technique further includes if execution of the subsequent memory access instruction is initiated prior to completion of the barrier instruction, discontinuing in response to determining the barrier instruction completed, tracking of the subsequent memory access instruction with respect to invalidation.

    摘要翻译: 一种用于处理指示序列的技术,该指令序列包括屏障指令,屏障指令之前的加载指令,以及跟随障碍指令之后的随后存储器访问指令,包括:基于接收到最早的良好组合响应来确定加载指令是否被解决 用于与加载指令相对应的读取操作和用于加载指令的数据。 该技术还包括如果在完成屏障指令之前没有启动后续存储器访问指令的执行,则响应于确定完成的屏障指令启动后续存储器访问指令的执行。 该技术还包括如果在完成屏障指令之前启动后续存储器访问指令的执行,则响应于确定所完成的屏障指令而中断,跟踪关于无效的后续存储器访问指令。

    Fault tolerant encoding of directory states for stuck bits
    33.
    发明授权
    Fault tolerant encoding of directory states for stuck bits 有权
    卡位的目录状态的容错编码

    公开(公告)号:US08205136B2

    公开(公告)日:2012-06-19

    申请号:US12189808

    申请日:2008-08-12

    IPC分类号: G11C29/00

    CPC分类号: G11C29/832 G06F11/1064

    摘要: A method of handling a stuck bit in a directory of a cache memory, by defining multiple binary encodings to indicate a defective cache state, detecting an error in a tag stored in a member of the directory (wherein the tag at least includes an address field, a state field and an error-correction field), determining that the error is associated with a stuck bit of the directory member, and writing new state information to the directory member which is selected from one of the binary encodings based on a field location of the stuck bit within the directory member. The multiple binary encodings may include a first binary encoding when the stuck bit is in the address field, a second binary encoding when the stuck bit is in the state field, and a third binary encoding when the stuck bit is in the error-correction field. The new state information may also further be selected based on the value of the stuck bit, e.g., a state bit corresponding to the stuck bit is assigned a bit value from the new state information which matches the value of the stuck bit.

    摘要翻译: 一种通过定义多个二进制编码来指示缺陷高速缓存状态来处理高速缓冲存储器的目录中的卡住位的方法,检测存储在目录成员中的标签中的错误(其中标签至少包括地址字段 ,状态字段和纠错字段),确定错误与目录成员的卡住位相关联,并且基于字段位置将新状态信息写入从二进制编码之一中选择的目录成员 的目录成员中的卡住位。 多个二进制编码可以包括当卡住位在地址字段中时的第一二进制编码,当卡位位于状态字段时的第二二进制编码,以及当卡位位于错误校正字段中时的第三二进制编码 。 还可以基于卡住位的值进一步选择新的状态信息,例如,对应于该卡住位的状态位从与该卡位的值匹配的新状态信息中分配一位值。

    MEMORY COHERENCE DIRECTORY SUPPORTING REMOTELY SOURCED REQUESTS OF NODAL SCOPE
    34.
    发明申请
    MEMORY COHERENCE DIRECTORY SUPPORTING REMOTELY SOURCED REQUESTS OF NODAL SCOPE 失效
    记忆协调指导原则支持远程请求的NODAL范围

    公开(公告)号:US20110047352A1

    公开(公告)日:2011-02-24

    申请号:US12545246

    申请日:2009-08-21

    IPC分类号: G06F15/76 G06F9/02 G06F12/08

    CPC分类号: G06F12/0817

    摘要: A data processing system includes at least a first through third processing nodes coupled by an interconnect fabric. The first processing node includes a master, a plurality of snoopers capable of participating in interconnect operations, and a node interface that receives a request of the master and transmits the request of the master to the second processing unit with a nodal scope of transmission limited to the second processing node. The second processing node includes a node interface having a directory. The node interface of the second processing node permits the request to proceed with the nodal scope of transmission if the directory does not indicate that a target memory block of the request is cached other than in the second processing node and prevents the request from succeeding if the directory indicates that the target memory block of the request is cached other than in the second processing node.

    摘要翻译: 数据处理系统至少包括通过互连结构耦合的第一至第三处理节点。 第一处理节点包括主机,能够参与互连操作的多个侦听器,以及接收主机请求的节点接口,并将主机的请求传送到第二处理单元,传送范围限于 第二处理节点。 第二处理节点包括具有目录的节点接口。 第二处理节点的节点接口允许请求继续进行节点传输范围,如果该目录没有指示该请求的目标存储器块不是在第二处理节点中被缓存,并且如果该请求成功 目录指示除第二处理节点之外的请求的目标存储块被缓存。

    Virtual Barrier Synchronization Cache
    35.
    发明申请
    Virtual Barrier Synchronization Cache 失效
    虚拟障碍同步缓存

    公开(公告)号:US20100257317A1

    公开(公告)日:2010-10-07

    申请号:US12419364

    申请日:2009-04-07

    IPC分类号: G06F12/08 G06F12/00

    CPC分类号: G06F12/0811 G06F9/522

    摘要: A data processing system includes an interconnect fabric, a system memory coupled to the interconnect fabric and including a virtual barrier synchronization region allocated to storage of virtual barrier synchronization registers (VBSRs), and a plurality of processing units coupled to the interconnect fabric and operable to access the virtual barrier synchronization region of the system memory. Each of the plurality of processing units includes a processor core and a cache memory including a cache array that caches VBSR lines from the virtual barrier synchronization region of the system memory and a cache controller. The cache controller, responsive to a store request from the processor core to update a particular VBSR line, performs a non-blocking update of the cache array in each other of the plurality of processing units contemporaneously holding a copy of the particular VBSR line by transmitting a VBSR update command on the interconnect fabric.

    摘要翻译: 数据处理系统包括互连结构,耦合到互连结构并包括分配给虚拟屏障同步寄存器(VBSR)的存储的虚拟屏障同步区域的系统存储器,以及耦合到互连结构的多个处理单元, 访问系统内存的虚拟屏障同步区域。 多个处理单元中的每一个包括处理器核心和高速缓存存储器,其包括从系统存储器的虚拟屏障同步区域缓存VBSR行的缓存阵列和高速缓存控制器。 高速缓存控制器响应于来自处理器核心的存储请求来更新特定VBSR线路,通过发送来同时保存特定VBSR线路的副本的多个处理单元中的彼此之间的高速缓存阵列的非阻塞更新 互连结构上的VBSR更新命令。

    VICTIM CACHE LATERAL CASTOUT TARGETING

    公开(公告)号:US20100235577A1

    公开(公告)日:2010-09-16

    申请号:US12340511

    申请日:2008-12-19

    IPC分类号: G06F12/08 G06F12/00

    摘要: A data processing system includes a plurality of processing units coupled by an interconnect fabric. In response to a data request, a victim cache line is selected for castout from a first lower level cache of a first processing unit, and a target lower level cache of one of the plurality of processing units is selected based upon architectural proximity of the target lower level cache to a home system memory to which the address of the victim cache line is assigned. The first processing unit issues on the interconnect fabric a lateral castout (LCO) command that identifies the victim cache line to be castout from the first lower level cache and indicates that the target lower level cache is an intended destination. In response to a coherence response indicating success of the LCO command, the victim cache line is removed from the first lower level cache and held in the second lower level cache.

    摘要翻译: 数据处理系统包括通过互连结构耦合的多个处理单元。 响应于数据请求,从第一处理单元的第一较低级别高速缓存中选择牺牲缓存行来进行舍弃,并且基于目标的体系结构接近来选择多个处理单元之一的目标下级高速缓存 低级缓存到分配了受害者缓存行的地址的归属系统存储器。 所述第一处理单元在所述互连结构上发出侧向锁定(LCO)命令,所述侧向锁定(LCO)命令标识要从所述第一低级缓存中抛出的所述牺牲缓存线,并且指示所述目标低级缓存是预期目的地。 响应于指示LCO命令的成功的一致性响应,从第一低级缓存中删除受害者高速缓存行并保存在第二较低级高速缓存中。

    Data processing system, method and interconnect fabric supporting destination data tagging
    37.
    发明授权
    Data processing system, method and interconnect fabric supporting destination data tagging 失效
    数据处理系统,方法和互连结构支持目标数据标记

    公开(公告)号:US07761631B2

    公开(公告)日:2010-07-20

    申请号:US12117539

    申请日:2008-05-08

    IPC分类号: G06F13/42 G06F13/00

    CPC分类号: G06F15/16

    摘要: A data processing system includes a plurality of communication links and a plurality of processing units including a local master processing unit. The local master processing unit includes interconnect logic that couples the processing unit to one or more of the plurality of communication links and an originating master coupled to the interconnect logic. The originating master originates an operation by issuing a write-type request on at least one of the one or more communication links, receives from a snooper in the data processing system a destination tag identifying a route to the snooper, and, responsive to receipt of the combined response and the destination tag, initiates a data transfer including a data payload and a data tag identifying the route provided within the destination tag.

    摘要翻译: 数据处理系统包括多个通信链路和包括本地主处理单元的多个处理单元。 本地主处理单元包括将处理单元耦合到多个通信链路中的一个或多个以及耦合到互连逻辑的始发主机的互连逻辑。 始发主机通过在一个或多个通信链路中的至少一个发出写入请求来发起操作,从数据处理系统中的窥探者接收标识到窥探者的路由的目的地标签,并且响应于接收到 组合响应和目的地标签,发起包括数据有效载荷和标识目的地标签内提供的路由的数据标签的数据传输。

    Method and system for handling stuck bits in cache directories
    38.
    发明授权
    Method and system for handling stuck bits in cache directories 有权
    用于处理缓存目录中的卡位的方法和系统

    公开(公告)号:US07689891B2

    公开(公告)日:2010-03-30

    申请号:US11225640

    申请日:2005-09-13

    IPC分类号: G11C29/00

    CPC分类号: G06F11/1064

    摘要: A method of handling a stuck bit in a directory of a cache memory which detects an error in a stored tag having an address field, a state field and an error-correction field, determines that the error is associated with a stuck bit of the directory member, marks the directory member as defective, and casts out corrected address information. The error is detected during processing of a cache directory access request, and is determined to be associated with a stuck bit of the directory member by attempting to correct a first error and then detecting a second error after the first correction attempt. The address information is cast out by routing a surrogate tag contained in a surrogate member of the cache directory through error-correction pipeline circuitry while transmitting the address information from the surrogate member to a cast-out machine.

    摘要翻译: 一种处理高速缓冲存储器的目录中的卡住位的方法,该高速缓冲存储器的目录中检测到具有地址字段,状态字段和纠错字段的存储标签中的错误,确定该错误与该目录的卡住位相关联 会员,将目录成员标记为有缺陷,并丢弃修正的地址信息。 在处理高速缓存目录访问请求期间检测到错误,并且通过尝试校正第一错误然后在第一次校正尝试之后检测第二错误来确定与目录成员的卡住位相关联。 通过错误校正流水线电路路由包含在高速缓存目录的代理成员中的代理标签,同时将地址信息从代理成员发送到投放机器,来丢弃地址信息。

    Method and Apparatus for Handling Multiple Memory Requests Within a Multiprocessor System
    39.
    发明申请
    Method and Apparatus for Handling Multiple Memory Requests Within a Multiprocessor System 有权
    在多处理器系统中处理多个存储器请求的方法和装置

    公开(公告)号:US20090198933A1

    公开(公告)日:2009-08-06

    申请号:US12024181

    申请日:2008-02-01

    IPC分类号: G06F12/14

    CPC分类号: G06F9/526

    摘要: A method for handling multiple memory requests within a multi-processor system is disclosed. A lock control section is initially assigned to a data block within a system memory. In response to a request for accessing the data block by a processing unit, a determination is made whether or not the lock control section of the data block has been set. If the lock control section has been set, another determination is made whether or not the requesting processing unit is located beyond a predetermined distance from a memory controller. If the requesting processing unit is located beyond a predetermined distance from the memory controller, the requesting processing unit is invited to perform other functions; otherwise, the number of the requesting processing unit is placed in a queue table. However, if the lock control section has not been set, the lock control section of the data block is set, and the access request is allowed.

    摘要翻译: 公开了一种在多处理器系统内处理多个存储器请求的方法。 锁控制部分最初被分配给系统存储器内的数据块。 响应于由处理单元访问数据块的请求,确定数据块的锁定控制部分是否已经被设置。 如果已经设置了锁定控制部分,则另外确定请求处理单元是否位于距离存储器控制器超过预定距离的位置。 如果请求处理单元位于距存储器控制器超过预定距离的位置,则请求处理单元被邀请执行其他功能; 否则,请求处理单元的号码被放置在队列表中。 然而,如果锁定控制部分尚未设置,则数据块的锁定控制部分被设置,并且允许访问请求。

    Data processing system, cache system and method for issuing a request on an interconnect fabric without reference to a lower level cache based upon a tagged cache state
    40.
    发明授权
    Data processing system, cache system and method for issuing a request on an interconnect fabric without reference to a lower level cache based upon a tagged cache state 有权
    数据处理系统,缓存系统和方法,用于基于标记的高速缓存状态在不引用低级缓存的情况下在互连结构上发出请求

    公开(公告)号:US07536513B2

    公开(公告)日:2009-05-19

    申请号:US11095734

    申请日:2005-03-31

    IPC分类号: G06F13/00 G06F13/28 G06F12/00

    摘要: In response to a master receiving a memory access request indicating a target address, the master accesses a first cache directory of an upper level cache of a cache hierarchy. In response to the target address being associated in the first cache directory with an entry having a valid address tag and a first invalid coherency state, the master issues a request specifying the target address on an interconnect fabric without regard to a coherency state associated with the target address in a second cache directory of a lower level cache of the cache hierarchy. In response to the target address having a second invalid coherency state with respect to the first cache directory, the master issues a request specifying the target address on an interconnect fabric after determining a coherency state associated with the target address in the second cache directory of the lower level cache of the cache hierarchy.

    摘要翻译: 响应于主机接收到指示目标地址的存储器访问请求,主机访问高速缓存层级的高级缓存的第一高速缓存目录。 响应于目标地址与具有有效地址标签的条目和第一无效一致性状态在第一高速缓存目录中相关联的目标地址,主机在不考虑与该互连结构相关联的一致性状态的情况下发布指定互连结构上的目标地址的请求 目标地址在缓存层次结构的较低级缓存的第二高速缓存目录中。 响应于相对于第一高速缓存目录具有第二无效一致性状态的目标地址,主机在确定与第二高速缓存目录中的目标地址相关联的一致性状态之后,在互连结构上发出指定目标地址的请求 缓存层次结构的较低级缓存。