Failover mechanisms in RDMA operations
    2.
    发明申请
    Failover mechanisms in RDMA operations 失效
    RDMA操作中的故障切换机制

    公开(公告)号:US20060045005A1

    公开(公告)日:2006-03-02

    申请号:US11017574

    申请日:2004-12-20

    IPC分类号: H04J1/16

    摘要: In remote direct memory access transfers in a multinode data processing system in which the nodes communicate with one another through communication adapters coupled to a switch or network, failures in the nodes or in the communication adapters can produce the phenomenon known as trickle traffic, which is data that has been received from the switch or from the network that is stale but which may have all the signatures of a valid packet data. The present invention addresses the trickle traffic problem in two situations: node failure and adapter failure. In the node failure situation randomly generated keys are used to reestablish connections to the adapter while providing a mechanism for the recognition of stale packets. In the adapter failure situation, a round robin context allocation approach is used with adapter state contexts being provided with state information which helps to identify stale packets. In another approach to handling the adapter failure situation counts are assigned which provide an adapter failure number to the node which will not match a corresponding number in a context field in the adapter, thus enabling the identification of stale packets.

    摘要翻译: 在多节点数据处理系统中的远程直接存储器访问传输中,其中节点通过耦合到交换机或网络的通信适配器彼此通信,节点或通信适配器中的故障可能产生称为流量流量的现象, 已经从交换机接收到的数据或者来自网络的数据已经过时,但是可能具有有效分组数据的所有签名。 本发明解决了两种情况下的流量流量问题:节点故障和适配器故障。 在节点故障情况下,随机生成的密钥用于重新建立与适配器的连接,同时提供用于识别过时数据包的机制。 在适配器故障情况下,使用循环上下文分配方法,适配器状态上下文被提供有状态信息,其有助于识别过时的分组。 在处理适配器故障情况的另一种方法中,分配了向适配器上下文字段中不匹配相应号码的节点提供适配器故障号,从而能够识别过时的数据包。

    Remote direct memory access system and method
    5.
    发明申请
    Remote direct memory access system and method 审中-公开
    远程直接内存访问系统和方法

    公开(公告)号:US20060075057A1

    公开(公告)日:2006-04-06

    申请号:US10929943

    申请日:2004-08-30

    IPC分类号: G06F15/167

    CPC分类号: H04L67/1097 H04L67/18

    摘要: A remote direct memory access (RDMA) system is provided in which data is transferred over a network by DMA between from a memory of a first node of a multi-processor system having a plurality of nodes connected by a network and a memory of a second node of the multi-processor system. The system includes a first network adapter at the first node, operable to transmit data stored in the memory of the first node to a second node in a plurality of portions in fulfillment of a DMA request. The first network adapter is operable to transmit each portion together with identifying information and information identifying a location for storing the transmitted portion in the memory of the second node, such that each portion is capable of being received independently by the second node according to the identifying information. Each portion is further capable of being stored in the memory of the second node at the location identified by the location identifying information.

    摘要翻译: 提供了一种远程直接存储器访问(RDMA)系统,其中通过DMA在具有由网络连接的多个节点的多处理器系统的第一节点的存储器和第二个存储器连接的存储器之间的网络上通过网络传送数据 多处理器系统的节点。 该系统包括第一节点处的第一网络适配器,可操作以在DMA请求的多个部分中将存储在第一节点的存储器中的数据发送到多个部分中的第二节点。 第一网络适配器可操作以将每个部分与识别信息和信息一起发送,识别信息和信息标识用于存储第二节点的存储器中的发送部分的位置,使得每个部分能够由第二节点独立地根据识别 信息。 每个部分还能够在由位置识别信息标识的位置处存储在第二节点的存储器中。

    Efficient probabilistic duplicate packet detector in computer networks
    6.
    发明申请
    Efficient probabilistic duplicate packet detector in computer networks 失效
    计算机网络中的高效概率重复数据包检测器

    公开(公告)号:US20070097875A1

    公开(公告)日:2007-05-03

    申请号:US11264529

    申请日:2005-11-01

    IPC分类号: H04L12/26

    摘要: In order to solve the problem of the detection of the arrival of duplicate data packets in an interconnected, multinode data processing system, each data packet is provided with a field of r bits that are randomly generated for each data packet. However, one of the packets is provided with a field that is computed from the other randomly generated field entries in a checksum computation which yields a selected nonzero checksum value. A running checksum at the receiver is used to determine whether or not, after the receipt of the specified number, k, of data packets, a duplicate packet has been received.

    摘要翻译: 为了解决在互连的多节点数据处理系统中检测到重复数据分组的问题的问题,每个数据分组被提供有为每个数据分组随机生成的r比特的字段。 然而,其中一个分组被提供有一个字段,该字段是在产生选定的非零校验和值的校验和计算中的其他随机生成的字段条目中计算的。 在接收机上运行的校验和用于确定在接收到数据包的指定数量k之后是否已经接收到重复分组。

    Processing system and method including a dedicated collective offload engine providing collective processing in a distributed computing environment
    7.
    发明申请
    Processing system and method including a dedicated collective offload engine providing collective processing in a distributed computing environment 审中-公开
    处理系统和方法包括在分布式计算环境中提供集体处理的专用集合卸载引擎

    公开(公告)号:US20050097300A1

    公开(公告)日:2005-05-05

    申请号:US10697859

    申请日:2003-10-30

    IPC分类号: G06F9/50 G06F15/00

    CPC分类号: G06F9/5066 G06F2209/509

    摘要: A dedicated collective offload engine provides collective processing of data from processing nodes in a distributed computing environment. The dedicated collective offload engine and the processing nodes are coupled to a switch fabric. A result is produced by the collective offload engine based on the collective processing of the data, and is forwarded to at least one processing node. Collective processing is facilitated by communication among a plurality of dedicated collective offload engines via the switch fabric or via a private channel disposed between the collective offload engines.

    摘要翻译: 专用的集体卸载引擎在分布式计算环境中提供来自处理节点的数据的集体处理。 专用集体卸载引擎和处理节点耦合到交换结构。 结果是由集体卸载引擎基于数据的集合处理产生的,并被转发到至少一个处理节点。 通过经由交换结构的多个专用集合卸载发动机之间的通信或经由设置在集体卸载发动机之间的专用通道进行通信来促进集体处理。

    Efficient and flexible memory copy operation
    8.
    发明申请
    Efficient and flexible memory copy operation 失效
    高效灵活的内存复制操作

    公开(公告)号:US20070150676A1

    公开(公告)日:2007-06-28

    申请号:US11316663

    申请日:2005-12-22

    IPC分类号: G06F12/16

    摘要: A system, method, and computer program product for semi-synchronously copying data from a first portion of memory to a second portion of memory are disclosed. The method comprises receiving, in a processor, a call for a semi-synchronous memory copy operation. The semi-synchronous memory copy operation preserves temporal persistence of validity for a virtual source address corresponding to a source location in a memory and a virtual target address corresponding to a target location in the memory by setting a flag bit. The call includes at least the virtual source address, the virtual target address, and an indicator identifying a number of bytes to be copied. The memory copy operation is placed in a queue for execution by a memory controller. The queue is coupled to the memory controller. At least one subsequent instruction is continued to be executed as the subsequent instruction becomes available from an instruction pipeline.

    摘要翻译: 公开了一种用于将数据从存储器的第一部分半数同步地复制到存储器的第二部分的系统,方法和计算机程序产品。 该方法包括在处理器中接收对半同步存储器复制操作的呼叫。 半同步存储器复制操作通过设置标志位来保持对应于存储器中的源位置的虚拟源地址和对应于存储器中的目标位置的虚拟目标地址的有效性的时间持续性。 该呼叫至少包括虚拟源地址,虚拟目标地址和标识要复制的字节数的指示符。 存储器复制操作被放置在队列中以由存储器控制器执行。 队列耦合到存储器控制器。 随着随后的指令从指令流水线可用,继续执行至少一个后续指令。

    Validity of address ranges used in semi-synchronous memory copy operations
    9.
    发明申请
    Validity of address ranges used in semi-synchronous memory copy operations 有权
    在半同步存储器复制操作中使用的地址范围的有效性

    公开(公告)号:US20070150675A1

    公开(公告)日:2007-06-28

    申请号:US11315757

    申请日:2005-12-22

    IPC分类号: G06F12/16

    摘要: A system, method, and a computer readable for protecting content of a memory page are disclosed. The method includes determining a start of a semi-synchronous memory copy operation. A range of addresses is determined where the semi-synchronous memory copy operation is being performed. An issued instruction that removes a page table entry is detected. The method further includes determining whether the issued instruction is destined to remove a page table entry associated with at least one address in the range of addresses. In response to the issued instruction being destined to remove the page table entry, the execution of the issued instruction is stalled until the semi-synchronous memory copy operation is completed.

    摘要翻译: 公开了一种用于保护存储器页面的内容的系统,方法和可读取的计算机。 该方法包括确定半同步存储器复制操作的开始。 确定正在执行半同步存储器复制操作的地址范围。 检测到发出的删除页表条目的指令。 所述方法还包括确定所发出的指令是否旨在去除与地址范围中的至少一个地址相关联的页表条目。 响应于发出的指令旨在去除页表条目,所发出的指令的执行停止,直到半同步存储器复制操作完成。