LOW LATENCY MEMORY ACCESS AND SYNCHRONIZATION
    32.
    发明申请
    LOW LATENCY MEMORY ACCESS AND SYNCHRONIZATION 失效
    低延迟存储器访问和同步

    公开(公告)号:US20070204112A1

    公开(公告)日:2007-08-30

    申请号:US11617276

    申请日:2006-12-28

    IPC分类号: G06F12/14

    摘要: A low latency memory system access is provided in association with a weakly-ordered multiprocessor system. Each processor in the multiprocessor shares resources, and each shared resource has an associated lock within a locking device that provides support for synchronization between the multiple processors in the multiprocessor and the orderly sharing of the resources. A processor only has permission to access a resource when it owns the lock associated with that resource, and an attempt by a processor to own a lock requires only a single load operation, rather than a traditional atomic load followed by store, such that the processor only performs a read operation and the hardware locking device performs a subsequent write operation rather than the processor. A simple prefetching for non-contiguous data structures is also disclosed. A memory line is redefined so that in addition to the normal physical memory data, every line includes a pointer that is large enough to point to any other line in the memory, wherein the pointers to determine which memory line to prefetch rather than some other predictive algorithm. This enables hardware to effectively prefetch memory access patterns that are non-contiguous, but repetitive.

    摘要翻译: 与弱有序的多处理器系统相关联地提供低延迟存储器系统访问。 多处理器中的每个处理器共享资源,并且每个共享资源在锁定设备内具有关联的锁,其提供对多处理器中的多个处理器之间的同步的支持以及资源的有序共享。 当处理器拥有与该资源相关联的锁定时,处理器仅具有访问资源的权限,并且处理器拥有锁的尝试仅需要单个加载操作,而不是传统的原子负载后跟存储,使得处理器 只执行读取操作,并且硬件锁定装置执行后续的写入操作而不是处理器。 还公开了用于非连续数据结构的简单预取。 重新定义存储器线,使得除了正常的物理存储器数据之外,每行包括足够大的指针以指向存储器中的任何其他行,其中指针用于确定要预取的存储器行而不是一些其它预测 算法。 这使得硬件能够有效地预取不连续但重复的存储器访问模式。

    REMOTE PROCESSING AND MEMORY UTILIZATION
    34.
    发明申请
    REMOTE PROCESSING AND MEMORY UTILIZATION 有权
    远程处理和存储器的使用

    公开(公告)号:US20140047060A1

    公开(公告)日:2014-02-13

    申请号:US13570916

    申请日:2012-08-09

    IPC分类号: G06F15/167

    摘要: According to one embodiment of the present invention, a system for operating memory includes a first node coupled to a second node by a network, the system configured to perform a method including receiving the remote transaction message from the second node in a processing element in the first node via the network, wherein the remote transaction message bypasses a main processor in the first node as it is transmitted to the processing element. In addition, the method includes accessing, by the processing element, data from a location in a memory in the first node based on the remote transaction message, and performing, by the processing element, computations based on the data and the remote transaction message.

    摘要翻译: 根据本发明的一个实施例,一种用于操作存储器的系统包括由网络耦合到第二节点的第一节点,所述系统被配置为执行一种方法,该方法包括从所述第二节点接收来自所述第二节点的处理元件中的所述远程事务消息 第一节点经由网络,其中当所述远程事务消息被传送到所述处理元件时,所述远程事务消息绕过所述第一节点中的主处理器。 此外,该方法包括基于远程事务消息,由处理元件访问来自第一节点中的存储器中的位置的数据,以及由处理元件基于数据和远程事务消息执行计算。

    Preventing messaging queue deadlocks in a DMA environment
    35.
    发明授权
    Preventing messaging queue deadlocks in a DMA environment 失效
    防止DMA环境中的消息队列死锁

    公开(公告)号:US08631086B2

    公开(公告)日:2014-01-14

    申请号:US12241514

    申请日:2008-09-30

    IPC分类号: G06F15/167

    CPC分类号: G06F15/17331

    摘要: Embodiments of the invention may be used to manage message queues in a parallel computing environment to prevent message queue deadlock. A direct memory access controller of a compute node may determine when a messaging queue is full. In response, the DMA may generate an interrupt. An interrupt handler may stop the DMA and swap all descriptors from the full messaging queue into a larger queue (or enlarge the original queue). The interrupt handler then restarts the DMA. Alternatively, the interrupt handler stops the DMA, allocates a memory block to hold queue data, and then moves descriptors from the full messaging queue into the allocated memory block. The interrupt handler then restarts the DMA. During a normal messaging advance cycle, a messaging manager attempts to inject the descriptors in the memory block into other messaging queues until the descriptors have all been processed.

    摘要翻译: 本发明的实施例可以用于在并行计算环境中管理消息队列以防止消息队列死锁。 计算节点的直接存储器访问控制器可以确定消息队列何时已满。 作为响应,DMA可能会产生中断。 中断处理程序可能会停止DMA,并将所有描述符从完整消息队列交换到更大的队列(或放大原始队列)。 然后中断处理程序重新启动DMA。 或者,中断处理程序停止DMA,分配存储块来保存队列数据,然后将描述符从完整消息队列移动到分配的内存块中。 然后中断处理程序重新启动DMA。 在正常消息传递提前周期期间,消息收发管理器尝试将描述符注入到其他消息队列中,直到描述符全部被处理。

    Multiprocessor system with multiple concurrent modes of execution
    36.
    发明授权
    Multiprocessor system with multiple concurrent modes of execution 有权
    具有多个并发执行模式的多处理器系统

    公开(公告)号:US08621478B2

    公开(公告)日:2013-12-31

    申请号:US13008502

    申请日:2011-01-18

    IPC分类号: G06F9/46

    CPC分类号: G06F9/524 G06F12/08

    摘要: A multiprocessor system supports multiple concurrent modes of speculative execution. Speculation identification numbers (IDs) are allocated to speculative threads from a pool of available numbers. The pool is divided into domains, with each domain being assigned to a mode of speculation. Modes of speculation include TM, TLS, and rollback. Allocation of the IDs is carried out with respect to a central state table and using hardware pointers. The IDs are used for writing different versions of speculative results in different ways of a set in a cache memory.

    摘要翻译: 多处理器系统支持多种并发模式的推测执行。 投机标识号(ID)从可用数字池中分配给投机线程。 池被分为域,每个域被分配到一种投机模式。 投机模式包括TM,TLS和回滚。 对于中央状态表并使用硬件指针执行ID的分配。 ID用于以高速缓冲存储器中的集合的不同方式写入不同版本的推测结果。

    Calculating A Checksum With Inactive Networking Components In A Computing System
    37.
    发明申请
    Calculating A Checksum With Inactive Networking Components In A Computing System 有权
    在计算系统中使用非活动网络组件计算校验和

    公开(公告)号:US20130212253A1

    公开(公告)日:2013-08-15

    申请号:US13370059

    申请日:2012-02-09

    IPC分类号: G06F15/173

    CPC分类号: H04L43/04 H04L1/00 H04L1/0061

    摘要: Calculating a checksum utilizing inactive networking components in a computing system, including: identifying, by a checksum distribution manager, an inactive networking component, wherein the inactive networking component includes a checksum calculation engine for computing a checksum; sending, to the inactive networking component by the checksum distribution manager, metadata describing a block of data to be transmitted by an active networking component; calculating, by the inactive networking component, a checksum for the block of data; transmitting, to the checksum distribution manager from the inactive networking component, the checksum for the block of data; and sending, by the active networking component, a data communications message that includes the block of data and the checksum for the block of data.

    摘要翻译: 使用计算系统中的非活动网络组件来计算校验和,包括:由校验和分发管理器识别非活动网络组件,其中所述非活动网络组件包括用于计算校验和的校验和计算引擎; 由校验和分发管理器向不活动网络组件发送描述要由主动网络组件发送的数据块的元数据; 由非活动网络组件计算数据块的校验和; 从非活动网络组件向校验和分发管理器发送数据块的校验和; 以及由所述主动网络组件发送包括所述数据块和所述数据块的校验和的数据通信消息。

    Extended write combining using a write continuation hint flag
    38.
    发明授权
    Extended write combining using a write continuation hint flag 失效
    使用写入连续提示标志进行扩展写入组合

    公开(公告)号:US08458282B2

    公开(公告)日:2013-06-04

    申请号:US11768593

    申请日:2007-06-26

    摘要: A computing apparatus for reducing the amount of processing in a network computing system which includes a network system device of a receiving node for receiving electronic messages comprising data. The electronic messages are transmitted from a sending node. The network system device determines when more data of a specific electronic message is being transmitted. A memory device stores the electronic message data and communicating with the network system device. A memory subsystem communicates with the memory device. The memory subsystem stores a portion of the electronic message when more data of the specific message will be received, and the buffer combines the portion with later received data and moves the data to the memory device for accessible storage.

    摘要翻译: 一种用于减少网络计算系统中的处理量的计算装置,其包括用于接收包括数据的电子消息的接收节点的网络系统设备。 从发送节点发送电子消息。 网络系统设备确定何时正在发送特定电子消息的更多数据。 存储装置存储电子消息数据并与网络系统装置进行通信。 存储器子系统与存储器件通信。 当更多的特定消息的数据将被接收时,存储器子系统存储电子消息的一部分,并且缓冲器将该部分与稍后接收的数据组合,并将数据移动到存储器装置以进行存取。

    Network support for system initiated checkpoints
    40.
    发明授权
    Network support for system initiated checkpoints 失效
    网络支持系统发起的检查点

    公开(公告)号:US08359367B2

    公开(公告)日:2013-01-22

    申请号:US12731796

    申请日:2010-03-25

    IPC分类号: G06F15/167 G06F11/00 G06F7/38

    CPC分类号: G06F15/167 G06F11/141

    摘要: A system, method and computer program product for supporting system initiated checkpoints in parallel computing systems. The system and method generates selective control signals to perform checkpointing of system related data in presence of messaging activity associated with a user application running at the node. The checkpointing is initiated by the system such that checkpoint data of a plurality of network nodes may be obtained even in the presence of user applications running on highly parallel computers that include ongoing user messaging activity.

    摘要翻译: 一种用于在并行计算系统中支持系统启动的检查点的系统,方法和计算机程序产品。 系统和方法产生选择性控制信号,以在存在与在节点处运行的用户应用相关联的消息传送活动的情况下执行系统相关数据的检查点。 检查点由系统启动,使得即使在存在包括正在进行的用户消息活动的高度并行计算机上的用户应用的情况下,也可以获得多个网络节点的检查点数据。