Node self-start in a decentralized cluster
    1.
    发明授权
    Node self-start in a decentralized cluster 有权
    在分散式集群中节点自启动

    公开(公告)号:US07240088B2

    公开(公告)日:2007-07-03

    申请号:US10057188

    申请日:2002-01-25

    IPC分类号: G06F15/16

    摘要: Methods, systems and articles of manufacture for automatically starting a node in a clustered computer system. A starting state value may be assigned to the node and a discovery process initiated to find a sponsor node. If a sponsor node is found, the node is joined with the sponsor node in the clustered computer system. If a sponsor node is not found, the node is started as a one-node cluster in the clustered computer system. An active state value is assigned to the node upon inclusion into the clustered computer system.

    摘要翻译: 用于在集群计算机系统中自动启动节点的方法,系统和制品。 可以向节点分配起始状态值,并且启动发现进程以找到赞助者节点。 如果找到赞助商节点,则该节点与群集计算机系统中的赞助商节点相连。 如果未找到赞助商节点,则该节点作为群集计算机系统中的单节点群集启动。 在包含在群集计算机系统中时,活动状态值被分配给该节点。

    Node shutdown in clustered computer system
    2.
    发明授权
    Node shutdown in clustered computer system 失效
    集群计算机系统中的节点关闭

    公开(公告)号:US06918051B2

    公开(公告)日:2005-07-12

    申请号:US09827804

    申请日:2001-04-06

    IPC分类号: G06F11/00 G06F15/16 H04L12/28

    CPC分类号: G06F11/0796 G06F11/0715

    摘要: A clustered computer system, apparatus, program product and method utilize a group member-initiated shutdown process to terminate clustering on a node in an automated and orderly fashion, typically in the event of a failure detected by a group member residing on that node. As a component of such a process, node leave operations are initiated on the other nodes in a clustered computer system, thereby permitting any dependency failovers to occur in an automated fashion. Moreover, other group members on a node to be shutdown are preemptively terminated prior to local detection of the failure within those other group members, so that termination of clustering on the node may be initiated to complete a shutdown operation.

    摘要翻译: 集群计算机系统,装置,程序产品和方法利用组成员启动的关闭过程以自动和有序的方式终止节点上的聚类,通常在驻留在该节点上的组成员检测到故障的情况下。 作为这种过程的组成部分,在集群计算机系统中的其他节点上启动节点离开操作,从而允许以自动方式发生任何依赖关系故障转移。 此外,在本地检测到那些其他组成员内的故障之前,要关闭的节点上的其他组成员被抢先终止,从而可以启动节点上的聚类终止以完成关机操作。

    Dynamic modification of fragmentation size cluster communication parameter in clustered computer system
    3.
    发明授权
    Dynamic modification of fragmentation size cluster communication parameter in clustered computer system 失效
    群集计算机系统中分段大小集群通信参数的动态修改

    公开(公告)号:US07529855B2

    公开(公告)日:2009-05-05

    申请号:US11154228

    申请日:2005-06-16

    IPC分类号: G06F15/173 G06F15/16

    摘要: An apparatus, program product and method support the dynamic modification of cluster communication parameters such as a fragmentation size parameter through controllably deferring the processing of a requested fragmentation size change in a source node until after receipt an acknowledgment message for at least one unacknowledged message sent by the source node to a plurality of target nodes. By controllably deferring such processing until it is confirmed that any such previously-unacknowledged messages sent by a source node have been received by any target nodes, synchronization between the source node and the target nodes may be obtained, and a fragmentation size change may occur in a coordinated fashion such that future messages from the source node to the target node will be processed by both the source and the target nodes using the modified fragmentation size parameter.

    摘要翻译: 装置,程序产品和方法通过可控地推迟在源节点中的所请求的分片大小改变的处理直到在收到至少一个未确认的消息的确认消息之后支持群集通信参数(例如分段大小参数)的动态修改, 源节点到多个目标节点。 通过可控地推迟这种处理,直到确认由源节点发送的任何此前未确认的消息已经被任何目标节点接收到,可以获得源节点和目标节点之间的同步,并且可能发生分段大小改变 协调的方式,使得来自源节点到目标节点的未来消息将由源和目标节点使用修改的分段大小参数来处理。

    Dynamic modification of cluster communication parameters in clustered computer system
    4.
    发明授权
    Dynamic modification of cluster communication parameters in clustered computer system 失效
    群集计算机系统中集群通信参数的动态修改

    公开(公告)号:US06983324B1

    公开(公告)日:2006-01-03

    申请号:US09694586

    申请日:2000-10-23

    IPC分类号: G06F15/16

    CPC分类号: H04L67/1002 H04L69/40

    摘要: An apparatus, program product and method support the dynamic modification of cluster communication parameters through a distributed protocol whereby individual nodes locally confirm initiation and status information for every node participating in a parameter modification operation. By doing so, individual nodes are also able to locally determine the need to undo locally-performed parameter modifications should any other node be incapable of performing a parameter modification. Moreover, specifically with respect to cluster communication parameters such as heartbeat parameters, such parameters may be dynamically modified by configuring a sending node to send a heartbeat message to a receiving node, with the heartbeat message indicating that a heartbeat parameter is to be modified. In response to the heartbeat message, the receiving node may then send an acknowledgment message to the sending node that indicates whether the heartbeat parameter has been modified in the receiving node. Further, modification of the heartbeat parameter in the sending node may be deferred until the acknowledgment message from the receiving node indicates that the heartbeat parameter has been modified in the receiving node.

    摘要翻译: 装置,程序产品和方法通过分布式协议支持集群通信参数的动态修改,从而各个节点本地确认参与参数修改操作的每个节点的启动和状态信息。 通过这样做,如果任何其他节点不能执行参数修改,则各个节点还能够本地地确定需要撤销本地执行的参数修改。 此外,具体地,关于诸如心跳参数的集群通信参数,可以通过配置发送节点向接收节点发送心跳消息来动态地修改这些参数,其中心跳消息指示心跳参数将被修改。 响应于心跳消息,接收节点可以然后向发送节点发送指示接收节点中是否已经修改了心跳参数的确认消息。 此外,可以延迟发送节点中的心跳参数的修改,直到来自接收节点的确认消息指示在接收节点中已经修改了心跳参数。

    Dynamic modification of fragmentation size cluster communication parameter in clustered computer system
    5.
    发明授权
    Dynamic modification of fragmentation size cluster communication parameter in clustered computer system 失效
    群集计算机系统中分段大小集群通信参数的动态修改

    公开(公告)号:US06934768B1

    公开(公告)日:2005-08-23

    申请号:US09694599

    申请日:2000-10-23

    IPC分类号: G06F11/00 G06F15/16

    摘要: An apparatus, program product and method support the dynamic modification of cluster communication parameters such as a fragmentation size parameter through controllably deferring the processing of a requested fragmentation size change in a source node until after receipt an acknowledgment message for at least one unacknowledged message sent by the source node to a plurality of target nodes. By controllably deferring such processing until it is confirmed that any such previously-unacknowledged messages sent by a source node have been received by any target nodes, synchronization between the source node and the target nodes may be obtained, and a fragmentation size change may occur in a coordinated fashion such that future messages from the source node to the target node will be processed by both the source and the target nodes using the modified fragmentation size parameter.

    摘要翻译: 装置,程序产品和方法通过可控地推迟在源节点中的所请求的分片大小改变的处理直到在收到至少一个未确认的消息的确认消息之后支持群集通信参数(例如分段大小参数)的动态修改, 源节点到多个目标节点。 通过可控地推迟这种处理,直到确认由源节点发送的任何此前未确认的消息已经被任何目标节点接收到,可以获得源节点和目标节点之间的同步,并且可能发生分段大小改变 协调的方式,使得来自源节点到目标节点的未来消息将由源和目标节点使用修改的分段大小参数来处理。

    Cluster destination address table—IP routing for clusters
    6.
    发明授权
    Cluster destination address table—IP routing for clusters 失效
    集群目的地址表 - 集群的IP路由

    公开(公告)号:US06993034B1

    公开(公告)日:2006-01-31

    申请号:US09173090

    申请日:1998-10-15

    IPC分类号: H04L12/56

    摘要: According to the present invention, a communications protocol supporting cluster configurations more complex than a single LAN is disclosed. A cluster destination address table (CDAT) is used in conjunction with a network message servicer to communicate between computer systems in a cluster. Each computer system preferably contains a cluster servicer, a CDAT, and a network message servicer. The CDAT contains network addresses, status and adapter information for each computer system in a cluster. Although computer systems may have alternate network addresses when they have multiple adapters, the CDAT indexes primary and alternate address information under a single named system. Thus, redundant connections amongst computer systems are identified, while still using the numeric addresses upon which the network message servicer is based. To send a message using the methods of the present invention, the cluster servicer retrieves a network address for a computer system from a CDAT. A message to be sent and the retrieved address are passed to the network message servicer, preferably an Internet Protocol suite. The network message servicer formats the information into a packet and routes the packet.

    摘要翻译: 根据本发明,公开了支持比单个LAN更复杂的集群配置的通信协议。 集群目的地址表(CDAT)与网络消息服务器结合使用,以在集群中的计算机系统之间进行通信。 每个计算机系统优选地包含集群服务器,CDAT和网络消息服务器。 CDAT包含集群中每个计算机系统的网络地址,状态和适配器信息。 虽然计算机系统在具有多个适配器时可能具有备用网络地址,但CDAT将在单个命名系统下索引主地址和备用地址信息。 因此,在仍然使用网络消息服务器所基于的数字地址的同时,识别计算机系统之间的冗余连接。 要使用本发明的方法发送消息,集群服务器从CDAT检索计算机系统的网络地址。 要发送的消息和检索到的地址被传递到网络消息服务器,优选地是因特网协议套件。 网络消息服务器将信息格式化为数据包,并对数据包进行路由。

    Data tracing identifiers
    7.
    发明授权
    Data tracing identifiers 有权
    数据跟踪标识符

    公开(公告)号:US07281052B2

    公开(公告)日:2007-10-09

    申请号:US10020382

    申请日:2001-12-14

    IPC分类号: G06F15/16 G06F15/173

    摘要: A method for assigning an identifier to data processed through protocol layers in one or more computers over a network. A space for the identifier is reserved in the header of each protocol layer. The identifier is then generated at one of the protocol layers. In an embodiment, the identifier is generated at the lowest protocol layer of a computer that sends the data, i.e., the sending computer. Once the identifier is generated, it is then stored in the reserved space in the header.

    摘要翻译: 通过网络将标识符分配给在一个或多个计算机中通过协议层处理的数据的方法。 标识符的空间保留在每个协议层的报头中。 然后在协议层之一生成标识符。 在一个实施例中,标识符是在发送数据的计算机的最低协议层(即发送计算机)上生成的。 一旦生成标识符,就将其存储在标题中的保留空间中。

    Implementing System to System Communication in a Switchless Non-IB Compliant Environment Using Infiniband Multicast Facilities
    8.
    发明申请
    Implementing System to System Communication in a Switchless Non-IB Compliant Environment Using Infiniband Multicast Facilities 失效
    使用Infiniband组播设施实现系统在无开关非符合IB标准的环境中进行系统通信

    公开(公告)号:US20100082853A1

    公开(公告)日:2010-04-01

    申请号:US12240059

    申请日:2008-09-29

    IPC分类号: G06F13/00 G06F3/00

    CPC分类号: G06F13/409 G06F13/387

    摘要: A method and apparatus are provided for implementing system to system communication in a switchless non-InfiniBand (IB) compliant environment. IB architected multicast facilities are used to communicate between HCAs in a loop or string topology. Multiple HCAs in the network subscribe to a predetermined multicast address. Multicast messages sent by one HCA destined to the pre-determined multicast address are received by other HCAs in the network. Intermediate TCA hardware, per IB architected multicast support, forward the multicast messages on via hardware facilities, which do not require invocation of software facilities thereby providing performance efficiencies. The messages flow until picked up by an HCA on the network. Architected higher level IB connections, such as IB supported Reliable Connections (RCs) are established using the multicast message flow, eliminating the need for an IB Subnet Manager (SM).

    摘要翻译: 提供了一种用于在不涉及非InfiniBand(IB)兼容环境中实现系统到系统通信的方法和装置。 IB架构的多播设备用于以循环或串联拓扑结构在HCA之间进行通信。 网络中的多个HCA订阅预定的多播地址。 一个HCA发送到预定的多播地址的组播消息由网络中的其他HCA接收。 中间TCA硬件,根据IB架构的组播支持,通过硬件设施转发组播消息,不需要调用软件设施,从而提供性能效率。 消息流向网络上的HCA接收。 建立的更高级别的IB连接(如IB支持的可靠连接(RC))使用多播消息流建立,从而无需IB子网管理器(SM)。

    Multicast cluster servicer for communicating amongst a plurality of nodes without a dedicated local area network
    10.
    发明授权
    Multicast cluster servicer for communicating amongst a plurality of nodes without a dedicated local area network 有权
    用于在没有专用局域网的多个节点之间通信的多播集群服务器

    公开(公告)号:US06192417B1

    公开(公告)日:2001-02-20

    申请号:US09280469

    申请日:1999-03-30

    IPC分类号: G06F1516

    摘要: According to the present invention, a cluster communications system is provided that supports reliable and efficient cluster communications. The preferred embodiment cluster communication systems can be used to provide this reliable and efficient cluster communication for cluster configurations extending beyond a single local area network (LAN). The cluster communications system provides reliable and efficient cluster communication by facilitating multicast messaging between systems in the cluster. In particular, the preferred embodiment provides for the establishment of multicast groups in between which multicast messaging is provided. The preferred embodiment provides this multicasting while providing the needed mechanisms to assure ordered message delivery between systems. The preferred embodiment extends this efficient and reliable cluster communication by providing for additional point-to-point communication between systems not on the same LAN. Thus, the preferred embodiment provides a cluster communication system that uses reliable multicasting for efficient cluster communication in a way that can be used for clusters that extend beyond a single local area network.

    摘要翻译: 根据本发明,提供一种支持可靠且高效的集群通信的集群通信系统。 可以使用优选实施例的集群通信系统来为超出单个局域网(LAN)的集群配置提供这种可靠和有效的集群通信。 集群通信系统通过促进集群中的系统之间的多播消息传递提供可靠且高效的集群通信。 特别地,优选实施例提供了在提供多播消息传送之间的多播组的建立。 优选实施例提供了这种组播,同时提供所需的机制以确保系统之间的有序消息传递。 优选实施例通过在不在同一LAN上的系统之间提供额外的点对点通信来扩展这种有效和可靠的集群通信。 因此,优选实施例提供了一种集群通信系统,其以可用于扩展到单个局域网之外的集群的方式使用可靠的多播来进行有效的集群通信。