Failover mechanisms in RDMA operations
    1.
    发明申请
    Failover mechanisms in RDMA operations 失效
    RDMA操作中的故障切换机制

    公开(公告)号:US20060045005A1

    公开(公告)日:2006-03-02

    申请号:US11017574

    申请日:2004-12-20

    IPC分类号: H04J1/16

    摘要: In remote direct memory access transfers in a multinode data processing system in which the nodes communicate with one another through communication adapters coupled to a switch or network, failures in the nodes or in the communication adapters can produce the phenomenon known as trickle traffic, which is data that has been received from the switch or from the network that is stale but which may have all the signatures of a valid packet data. The present invention addresses the trickle traffic problem in two situations: node failure and adapter failure. In the node failure situation randomly generated keys are used to reestablish connections to the adapter while providing a mechanism for the recognition of stale packets. In the adapter failure situation, a round robin context allocation approach is used with adapter state contexts being provided with state information which helps to identify stale packets. In another approach to handling the adapter failure situation counts are assigned which provide an adapter failure number to the node which will not match a corresponding number in a context field in the adapter, thus enabling the identification of stale packets.

    摘要翻译: 在多节点数据处理系统中的远程直接存储器访问传输中,其中节点通过耦合到交换机或网络的通信适配器彼此通信,节点或通信适配器中的故障可能产生称为流量流量的现象, 已经从交换机接收到的数据或者来自网络的数据已经过时,但是可能具有有效分组数据的所有签名。 本发明解决了两种情况下的流量流量问题:节点故障和适配器故障。 在节点故障情况下,随机生成的密钥用于重新建立与适配器的连接,同时提供用于识别过时数据包的机制。 在适配器故障情况下,使用循环上下文分配方法,适配器状态上下文被提供有状态信息,其有助于识别过时的分组。 在处理适配器故障情况的另一种方法中,分配了向适配器上下文字段中不匹配相应号码的节点提供适配器故障号,从而能够识别过时的数据包。

    Remote direct memory access system and method
    4.
    发明申请
    Remote direct memory access system and method 审中-公开
    远程直接内存访问系统和方法

    公开(公告)号:US20060075057A1

    公开(公告)日:2006-04-06

    申请号:US10929943

    申请日:2004-08-30

    IPC分类号: G06F15/167

    CPC分类号: H04L67/1097 H04L67/18

    摘要: A remote direct memory access (RDMA) system is provided in which data is transferred over a network by DMA between from a memory of a first node of a multi-processor system having a plurality of nodes connected by a network and a memory of a second node of the multi-processor system. The system includes a first network adapter at the first node, operable to transmit data stored in the memory of the first node to a second node in a plurality of portions in fulfillment of a DMA request. The first network adapter is operable to transmit each portion together with identifying information and information identifying a location for storing the transmitted portion in the memory of the second node, such that each portion is capable of being received independently by the second node according to the identifying information. Each portion is further capable of being stored in the memory of the second node at the location identified by the location identifying information.

    摘要翻译: 提供了一种远程直接存储器访问(RDMA)系统,其中通过DMA在具有由网络连接的多个节点的多处理器系统的第一节点的存储器和第二个存储器连接的存储器之间的网络上通过网络传送数据 多处理器系统的节点。 该系统包括第一节点处的第一网络适配器,可操作以在DMA请求的多个部分中将存储在第一节点的存储器中的数据发送到多个部分中的第二节点。 第一网络适配器可操作以将每个部分与识别信息和信息一起发送,识别信息和信息标识用于存储第二节点的存储器中的发送部分的位置,使得每个部分能够由第二节点独立地根据识别 信息。 每个部分还能够在由位置识别信息标识的位置处存储在第二节点的存储器中。

    Communication resource reservation system for improved messaging performance
    5.
    发明申请
    Communication resource reservation system for improved messaging performance 审中-公开
    通信资源预留系统,提高消息传递性能

    公开(公告)号:US20060034167A1

    公开(公告)日:2006-02-16

    申请号:US10903322

    申请日:2004-07-30

    IPC分类号: H04L12/26

    CPC分类号: G06F15/17375

    摘要: A system and method are provided for facilitating zero-copy communications between computing systems of a group of computing systems. The method includes allocating, in a first computing system of the group of computing systems, a pool of privileged communication resources from a privileged resource controller to a communications controller. The communications controller designates the privileged communication resources from the pool for use in handling individual ones of the zero-copy communications, thereby avoiding a requirement to obtain individual ones of the privileged resources from the owner of the privileged resources at setup time for each zero-copy communication.

    摘要翻译: 提供了一种用于促进一组计算系统的计算系统之间的零复制通信的系统和方法。 该方法包括在该组计算系统的第一计算系统中将特权通信资源池从特权资源控制器分配给通信控制器。 通信控制器从池中指定用于处理零拷贝通信中的各个的特权通信资源,从而避免在建立时针对每个零拷贝通信从特权资源的所有者获得各个特权资源的要求, 复制通讯。

    Method and system for interfacing components of a computing system with a pair of unidirectional, point-to-point buses
    6.
    发明申请
    Method and system for interfacing components of a computing system with a pair of unidirectional, point-to-point buses 失效
    用于将计算系统的组件与一对单向点对点总线接口的方法和系统

    公开(公告)号:US20070143511A1

    公开(公告)日:2007-06-21

    申请号:US11304474

    申请日:2005-12-15

    IPC分类号: G06F13/00

    CPC分类号: G06F13/4269

    摘要: A method of interfacing two components of a computing system is provided wherein the method includes providing a pair of unidirectional, point-to-point buses to transmit data between a master bus controller of the computing system and a slave bus controller of a processor unit of the computing system. The method also includes providing means for transmitting a command packet with an address associated with data pertaining to the command from the master bus controller to the slave bus controller. In addition, the method includes providing means for determining by the slave bus controller whether the slave bus controller can accept the command. The method further includes providing means for transmitting an acknowledgement from the slave bus controller to the master bus controller after the slave bus controller receives a first signaling interval for the command packet if the slave bus controller can accept the command packet.

    摘要翻译: 提供了一种接口计算系统的两个组件的方法,其中所述方法包括提供一对单向点对点总线以在所述计算系统的主总线控制器与所述计算系统的总线控制器之间传送数据, 计算系统。 该方法还包括提供用于发送具有与从主总线控制器到从总线控制器的命令有关的数据相关联的地址的命令分组的装置。 此外,该方法包括提供用于由从总线控制器确定从总线控制器是否可以接受命令的装置。 该方法还包括提供用于在从总线控制器接收到命令分组之后从属总线控制器接收到用于命令分组的第一信令间隔的从总线控制器向主总线控制器发送确认的装置。