Mechanism for selectively imposing interference order between page-table fetches and corresponding data fetches
    1.
    发明授权
    Mechanism for selectively imposing interference order between page-table fetches and corresponding data fetches 失效
    选择性地强制页表提取之间的干扰顺序和相应数据提取的机制

    公开(公告)号:US06286090B1

    公开(公告)日:2001-09-04

    申请号:US09084621

    申请日:1998-05-26

    IPC分类号: G06F1200

    CPC分类号: G06F12/1054 G06F12/0813

    摘要: A technique selectively imposes inter-reference ordering between memory reference operations issued by a processor of a multiprocessor system to addresses within a page pertaining to a page table entry (PTE) that is affected by a translation buffer (TB) miss flow routine. The TB miss flow is used to retrieve information contained in the PTE for mapping a virtual address to a physical address and, subsequently, to allow retrieval of data at the mapped physical address. The PTE that is retrieved in response to a memory reference (read) operation is not loaded into the TB until a commit-signal associated with that read operation is returned to the processor. Once the PTE and associated commit-signal are returned, the processor loads the PTE into the TB so that it can be used for a subsequent read operation directed to the data at the physical address.

    摘要翻译: 一种技术选择性地将由多处理器系统的处理器发出的存储器参考操作之间的参考间排序施加于与由翻译缓冲器(TB)错过流程程影响的页表项(PTE)相关的页面内的地址。 TB错误流被用于检索包含在PTE中的信息,用于将虚拟地址映射到物理地址,并且随后允许在映射的物理地址处检索数据。 响应于存储器引用(读取)操作检索的PTE不会被加载到TB中,直到与该读取操作相关联的提交信号返回到处理器。 一旦返回了PTE和相关联的提交信号,处理器将PTE加载到TB中,以便它可以用于针对物理地址的数据的后续读取操作。

    Mechanism for optimizing generation of commit-signals in a distributed shared-memory system
    2.
    发明授权
    Mechanism for optimizing generation of commit-signals in a distributed shared-memory system 失效
    优化分布式共享内存系统中提交信号生成的机制

    公开(公告)号:US06209065B1

    公开(公告)日:2001-03-27

    申请号:US08957230

    申请日:1997-10-24

    IPC分类号: G06F1314

    CPC分类号: G06F9/542 G06F9/52

    摘要: A mechanism optimizes the generation of a commit-signal by control logic of the multiprocessor system in response to a memory reference operation issued by a processor to a local node of a multiprocessor system having a hierarchical switch for interconnecting a plurality of nodes. The mechanism generally comprises a structure that indicates whether the memory reference operation affects other processors of other nodes of the multiprocessor system. An ordering point of the local node generates an optimized commit-signal when the structure indicates that the memory reference operation does not affect the other processors.

    摘要翻译: 一种机制响应于处理器向具有用于互连多个节点的分层交换机的多处理器系统的本地节点发出的存储器参考操作来优化多处理器系统的控制逻辑的生成提交信号。 该机制通常包括指示存储器参考操作是否影响多处理器系统的其他节点的其他处理器的结构。 当结构指示存储器参考操作不影响其他处理器时,本地节点的排序点生成优化的提交信号。

    Mechanism for reducing latency of memory barrier operations on a
multiprocessor system
    3.
    发明授权
    Mechanism for reducing latency of memory barrier operations on a multiprocessor system 失效
    减少多处理器系统上存储器屏障操作延迟的机制

    公开(公告)号:US6088771A

    公开(公告)日:2000-07-11

    申请号:US957501

    申请日:1997-10-24

    IPC分类号: G06F9/45 G06F13/00

    摘要: A technique reduces the latency of a memory barrier (MB) operation used to impose an inter-reference order between sets of memory reference operations issued by a processor to a multiprocessor system having a shared memory. The technique comprises issuing the MB operation immediately after issuing a first set of memory reference operations (i.e., the pre-MB operations) without waiting for responses to those pre-MB operations. Issuance of the MB operation to the system results in serialization of that operation and generation of a MB Acknowledgment (MB-Ack) command. The MB-Ack is loaded into a probe queue of the issuing processor and, according to the invention, functions to pull-in all previously ordered invalidate and probe commands in that queue. By ensuring that the probes and invalidates are ordered before the MB-Ack is received at the issuing processor, the inventive technique provides the appearance that all pre-MB references have completed.

    摘要翻译: 一种技术减少了用于在处理器向具有共享存储器的多处理器系统发出的存储器参考操作的集合之间施加参考间顺序的存储器屏障(MB)操作的等待时间。 该技术包括在发出第一组存储器参考操作(即,预MB操作)之前立即发出MB操作,而不等待对那些MB前操作的响应。 向系统发出MB操作会导致该操作的序列化和生成MB确认(MB-Ack)命令。 MB-Ack被加载到发布处理器的探测队列中,并且根据本发明,该功能用于在该队列中引入所有先前订购的无效和探测命令。 通过确保在发布处理器接收到MB-Ack之前对探测和无效进行排序,本发明技术提供了所有pre-MB引用完成的外观。

    Technique for reducing latency of inter-reference ordering using commit
signals in a multiprocessor system having shared caches
    4.
    发明授权
    Technique for reducing latency of inter-reference ordering using commit signals in a multiprocessor system having shared caches 失效
    用于在具有共享高速缓存的多处理器系统中使用提交信号来减少参考间排序的等待时间的技术

    公开(公告)号:US6055605A

    公开(公告)日:2000-04-25

    申请号:US957544

    申请日:1997-10-24

    IPC分类号: G06F12/08 G06F13/00 G06F12/00

    CPC分类号: G06F12/084

    摘要: A technique reduces the latency of inter-reference ordering between sets of memory reference operations in a multiprocessor system having a shared memory that is distributed among a plurality of processors that share a cache. According to the technique, each processor sharing a cache inherits a commit-signal that is generated by control logic of the multiprocessor system in response to a memory reference operation issued by another processor sharing that cache. The commit-signal facilitates serialization among the processors and shared memory entities of the multiprocessor system by indicating the apparent completion of the memory reference operation to those entities of the system.

    摘要翻译: 一种技术减少了具有分配在共享高速缓存的多个处理器之间的共享存储器的多处理器系统中的存储器参考操作组之间的参考间排序的等待时间。 根据该技术,共享高速缓存的每个处理器响应由共享该高速缓存的另一个处理器发出的存储器引用操作而继承由多处理器系统的控制逻辑产生的提交信号。 提交信号通过指示对系统的那些实体的存储器参考操作的明显完成来促进多处理器系统的处理器和共享存储器实体之间的串行化。

    Method and apparatus for employing commit-signals and prefetching to
maintain inter-reference ordering in a high-performance I/O processor
    5.
    发明授权
    Method and apparatus for employing commit-signals and prefetching to maintain inter-reference ordering in a high-performance I/O processor 失效
    用于采用提交信号和预取以在高性能I / O处理器中维持参考间排序的方法和装置

    公开(公告)号:US6085263A

    公开(公告)日:2000-07-04

    申请号:US956861

    申请日:1997-10-24

    IPC分类号: G06F12/08 G06F13/12 G06F13/14

    摘要: An improved I/O processor (IOP) delivers high I/O performance while maintaining inter-reference ordering among memory reference operations issued by an I/O device as specified by a consistency model in a shared memory multiprocessor system. The IOP comprises a retire controller which imposes inter-reference ordering among the operations based on receipt of a commit signal for each operation, wherein the commit signal for a memory reference operation indicates the apparent completion of the operation rather than actual completion of the operation. In addition, the IOP comprises a prefetch controller coupled to an I/O cache for prefetching data into cache without any ordering constraints (or out-of-order). The ordered retirement functions of the IOP are separated from its prefetching operations, which enables the latter operations to be performed in an arbitrary manner so as to improve the overall performance of the system.

    摘要翻译: 改进的I / O处理器(IOP)提供高I / O性能,同时在由共享存储器多处理器系统中的一致性模型指定的I / O设备发出的存储器参考操作之间保持参考间排序。 IOP包括退出控制器,其基于对每个操作的提交信号的接收,在操作之间施加参考间排序,其中用于存储器参考操作的提交信号指示操作的明显完成,而不是实际完成操作。 此外,IOP包括耦合到I / O缓存的预取控制器,用于将数据预取到高速缓存中,而没有任何排序限制(或无序)。 IOP的有序退休功能与其预取操作分离,这使得后面的操作能够以任意方式执行,从而提高系统的整体性能。

    Livelock prevention by delaying surrender of ownership upon intervening ownership request during load locked / store conditional atomic memory operation
    7.
    发明授权
    Livelock prevention by delaying surrender of ownership upon intervening ownership request during load locked / store conditional atomic memory operation 失效
    在加载锁定/存储条件原子存储器操作期间,通过延迟所有权所有权投降来实现预防行为

    公开(公告)号:US06801986B2

    公开(公告)日:2004-10-05

    申请号:US09933536

    申请日:2001-08-20

    IPC分类号: G06F1200

    摘要: A method, for executing a load locked and a store conditional instruction in a processor, achieves an atomic read-write operation to a memory block. First the load locked instruction is executed to read a memory block, and the processor in response to executing the load locked instruction issues a read modify system command to read the block and to take ownership of the block by the processor, and also sets a lock flag for the address of the memory block, and writes a value of the memory block into a cache of the processor as a cache copy of the memory block. The lock flag, upon receipt of an invalidate message by the processor for the cache copy of the memory block, is reset if any invalidate messages for the memory block are received by the processor. The processor waits for a selected time interval before the processor surrenders ownership of the memory block upon receipt of an ownership request message, if any is received by the processor after execution of the load locked instruction. The processor executes the store conditional instruction, and the processor in response to executing the store conditional instruction tests the lock flag, and if the lock flag is set, writing to the cache copy of the memory block. The processor ends, in the event that the lock flag is reset, the store conditional instruction and does not write to the cache copy of the memory block.

    摘要翻译: 一种用于在处理器中执行加载锁定和存储条件指令的方法,对存储器块实现原子读写操作。 首先执行加载锁定指令以读取存储器块,并且响应于执行加载锁定指令的处理器发出读取修改系统命令来读取块并由处理器获取块的所有权,并且还设置锁定 标记存储器块的地址,并将存储器块的值写入处理器的高速缓存作为存储器块的高速缓存副本。 如果处理器接收到存储块的任何无效消息,则锁定标志在由处理器接收到存储器块的高速缓存副本的无效消息时被重置。 处理器在接收到所有权请求消息之后处理器递交所述存储器块的所有权,等待处理器选定的时间间隔(如果在执行加载锁定指令之后由处理器接收到)。 处理器执行存储条件指令,并且处理器响应于执行存储条件指令测试锁定标志,并且如果设置了锁定标志,则写入存储器块的高速缓存副本。 处理器在锁定标志被复位的情况下结束,存储条件指令,并且不写入存储器块的高速缓存副本。

    Low latency inter-reference ordering in a multiple processor system employing a multiple-level inter-node switch
    8.
    发明授权
    Low latency inter-reference ordering in a multiple processor system employing a multiple-level inter-node switch 失效
    在采用多级节点间交换机的多处理器系统中的低延迟相互参考排序

    公开(公告)号:US06904465B2

    公开(公告)日:2005-06-07

    申请号:US09843228

    申请日:2001-04-26

    摘要: A multiple-processor system in which a commit message is returned to a source processor that requests a memory access operation so as to indicate the apparent completion of the operation includes a multiple-level switch unit linking nodes that contain the processors. The switch unit includes multiple input switches each of which receives messages from multiple nodes, and a set of output switches whose inputs are the outputs of the input switches and whose outputs are the inputs of the nodes. Each switch processes messages in the order in which they are received by the switch and each output switch follows the same rule as the other output switches.

    摘要翻译: 将提交消息返回给源处理器的多处理器系统,其请求存储器访问操作以指示操作的明显完成包括链接包含处理器的节点的多级交换单元。 开关单元包括多个输入开关,每个输入开关从多个节点接收消息,以及一组输出开关,其输入是输入开关的输出,其输出是节点的输入。 每个交换机按交换机接收的顺序处理消息,每个输出交换机遵循与其他输出交换机相同的规则。

    Transaction references for requests in a multi-processor network
    9.
    发明授权
    Transaction references for requests in a multi-processor network 失效
    多处理器网络中的请求的事务引用

    公开(公告)号:US07856534B2

    公开(公告)日:2010-12-21

    申请号:US10758352

    申请日:2004-01-15

    IPC分类号: G06F12/00

    CPC分类号: G06F12/0828 G06F12/0831

    摘要: One disclosed embodiment may comprise a system that includes a home node that provides a transaction reference to a requester in response to a request from the requester. The requester provides an acknowledgement message to the home node in response to the transaction reference, the transaction reference enabling the requester to determine an order of requests at the home node relative to the request from the requester.

    摘要翻译: 一个公开的实施例可以包括系统,其包括家庭节点,其响应于来自请求者的请求向请求者提供事务参考。 请求者响应于事务参考向家庭节点提供确认消息,事务参考使得请求者能够相对于来自请求者的请求确定家庭节点处的请求的顺序。