Method for detecting the quick restart of liveness daemons in a distributed multinode data processing system
    12.
    发明授权
    Method for detecting the quick restart of liveness daemons in a distributed multinode data processing system 失效
    用于检测分布式多节点数据处理系统中活动守护进程快速重启的方法

    公开(公告)号:US07203748B2

    公开(公告)日:2007-04-10

    申请号:US10078076

    申请日:2002-02-15

    IPC分类号: G06F15/16 G06F15/173

    CPC分类号: H04L29/06 H04L69/40

    摘要: In distributed multinode data processing systems, mechanisms are employed to insure that the nodes are properly informed about the liveness of the other nodes in node groups in the network. In particular, the present invention employs group membership indicia as part of a mechanism for detecting that a node and/or its adapter have failed and have been recently restarted. Having detected this situation, group membership inconsistencies which it can engender are avoided.

    摘要翻译: 在分布式多节点数据处理系统中,采用机制来确保节点正确地了解网络中节点组中其他节点的活动性。 特别地,本发明使用组成员标记作为用于检测节点和/或其适配器已经失败并且最近重新启动的机制的一部分。 检测到这种情况后,可以避免组织成员身份不一致。

    Topology propagation in a distributed computing environment with no topology message traffic in steady state
    13.
    发明授权
    Topology propagation in a distributed computing environment with no topology message traffic in steady state 失效
    在稳定状态下没有拓扑消息流量的分布式计算环境中的拓扑传播

    公开(公告)号:US06885644B1

    公开(公告)日:2005-04-26

    申请号:US09580945

    申请日:2000-05-30

    摘要: A topology propagation facility is provided for maintaining a common network topology database at different nodes in a distributed computing environment. The facility generates no message traffic when the distributed computing environment is in steady state. This is accomplished by discontinuing sending of group connectivity messages during a time period of no topology change within the distributed environment. Sending of group connectivity messages is reinitiated from at least one group leader upon identification by the group leader of at least one topology change within the distributed computing environment. Group connectivity messages are forwarded from one group of nodes on a first network to another group of nodes on a second network using a node common to both groups of nodes. The networks of the distributed computing environment can comprise heterogenous networks such that the topology propagation facility presented facilitates interoperability of the networks.

    摘要翻译: 提供拓扑传播设施用于在分布式计算环境中的不同节点处维护公共网络拓扑数据库。 当分布式计算环境处于稳定状态时,该设施不会生成消息流量。 这是通过在分布式环境中的无拓扑变化的时间段期间中止发送组连接消息来实现的。 组群连接消息的发送在分组计算环境中由群组领导确定至少一个拓扑变化后,从至少一个组长中重新启动。 组连接消息使用两个节点组共同的节点从第一网络上的一组节点转发到第二网络上的另一组节点。 分布式计算环境的网络可以包括异构网络,使得所呈现的拓扑传播设施有助于网络的互操作性。

    Reconfiguring a network by utilizing a predetermined length quiescent state
    14.
    发明授权
    Reconfiguring a network by utilizing a predetermined length quiescent state 有权
    通过利用预定长度的静态状态重新配置网络

    公开(公告)号:US07069320B1

    公开(公告)日:2006-06-27

    申请号:US09411515

    申请日:1999-10-04

    IPC分类号: G06F15/173

    摘要: A network having a plurality of nodes is reconfigured to reflect a change in topology of the network. In particular, upon receiving a reconfiguration request, each node enters a quiescent state for a predetermined period of time sufficient to allow at least one other node to also enter a quiescent state. Then, upon termination of the quiescent state, the node is reconfigured to reflect the change in the topology of the network without having to check with any other nodes of the network. In other embodiments, the predetermined period of time is sufficient to allow currently executing protocols to complete execution as well as to allow the transmission of reconfiguration requests for propagating reconfiguration in the network.

    摘要翻译: 重新配置具有多个节点的网络以反映网络拓扑的变化。 特别地,在接收到重新配置请求时,每个节点进入静态状态达预定时间段,足以允许至少一个其他节点也进入静止状态。 然后,在静止状态终止时,重新配置节点以反映网络拓扑的变化,而不必与网络的任何其他节点进行检查。 在其他实施例中,预定时间段足以允许当前执行的协议完成执行以及允许传输重新配置请求以在网络中传播重新配置。

    Fast node failure detection via disk based last gasp mechanism
    15.
    发明授权
    Fast node failure detection via disk based last gasp mechanism 失效
    基于最后一个喘气机制的快速节点故障检测

    公开(公告)号:US07937610B2

    公开(公告)日:2011-05-03

    申请号:US11741384

    申请日:2007-04-27

    IPC分类号: G06F11/00

    CPC分类号: G06F11/0709 G06F11/0757

    摘要: A method for communicating node liveness in a multinode data processing system employs an operating system function that provides a degree of self-awareness in “sensing” an imminent but still pending failure as the basis for providing special flag signals over a channel which employs nonvolatile storage in a heartbeat signaling path.

    摘要翻译: 用于在多节点数据处理系统中传送节点活动性的方法采用操作系统功能,其提供一定程度的自我意识,以“感测”迫在眉睫但仍然未决的故障,作为在使用非易失性存储器的信道上提供特殊标志信号的基础 在心跳信号通路。

    Method for managing cluster node-specific quorum roles
    16.
    发明授权
    Method for managing cluster node-specific quorum roles 有权
    管理集群节点特定仲裁角色的方法

    公开(公告)号:US07543046B1

    公开(公告)日:2009-06-02

    申请号:US12130080

    申请日:2008-05-30

    摘要: A method for managing node-specific quorum roles in a cluster of nodes. The method comprises assigning a configuration change of node role attributes to a node in the cluster, with each attribute being represented by a bit of a binary value. The binary value corresponding to the specified node role attributes is converted to an integer value and the integer value is stored in a node record associated with the node. The node record is accessed to isolate each bit of the integer value associated with the node and the node role attributes assigned to the node are identified from each of the isolated bits of the integer value associated with the node. A determination of whether the configuration change can be safely supported by a configuration quorum of nodes required to represent the cluster's configuration accurately is made based on the identified node role attributes. An additional determination of whether a required operational quorum of nodes permits access to critical and shared resources is made based on the identified node role attributes. Endorsement for the configuration change is sought from subsystems that may be affected by the configuration change. The identified node role attributes are propagated to other nodes within the cluster.

    摘要翻译: 在节点集群中管理节点特定仲裁角色的方法。 该方法包括将节点角色属性的配置改变分配给集群中的节点,每个属性由二进制值的位表示。 对应于指定节点角色属性的二进制值被转换为整数值,整数值存储在与节点相关联的节点记录中。 访问节点记录以隔离与节点相关联的整数值的每一位,并且从与节点相关联的整数值的每个孤立位识别分配给节点的节点角色属性。 基于所识别的节点角色属性,确定配置更改是否可以由正确表示集群配置所需的节点的配置数量安全地支持。 基于所识别的节点角色属性,进一步确定节点的所需操作定额是否允许访问关键和共享资源。 从可能受配置更改影响的子系统寻求配置更改的认可。 所识别的节点角色属性将传播到群集中的其他节点。

    Method for improving cluster bring-up in a distributed topology liveness system
    17.
    发明申请
    Method for improving cluster bring-up in a distributed topology liveness system 失效
    在分布式拓扑活动系统中改进集群提升的方法

    公开(公告)号:US20070121667A1

    公开(公告)日:2007-05-31

    申请号:US11290032

    申请日:2005-11-30

    申请人: John Hare Felipe Knop

    发明人: John Hare Felipe Knop

    CPC分类号: H04L45/02 H04L45/46

    摘要: A method is provided for establishing clusters in a distributed data processing environment having a plurality of nodes. The frequency of messages sent for the purpose of cluster formation is selected based on ranking of the nodes. Higher ranking nodes transmit such messages more frequently than lower ranking nodes. Lower ranked nodes thus are provided with an opportunity to join a cluster without first having to transmit their own messages to effectuate the joining.

    摘要翻译: 提供了一种在具有多个节点的分布式数据处理环境中建立簇的方法。 基于节点的排名选择发送用于群集形成目的的消息的频率。 较高等级的节点比较低等级的节点更频繁地发送这样的消息。 因此,较低等级的节点有机会加入群集,而无需先发送自己的消息来实现加入。

    Use of a storage medium as a communications network for liveness determination in a high-availability cluster
    19.
    发明授权
    Use of a storage medium as a communications network for liveness determination in a high-availability cluster 失效
    使用存储介质作为高可用性集群中的活动确定的通信网络

    公开(公告)号:US07003645B2

    公开(公告)日:2006-02-21

    申请号:US10323010

    申请日:2002-12-18

    IPC分类号: G06F12/06

    CPC分类号: H04L12/66

    摘要: Liveness determination in a multinode data processing system is enhanced through the use of a shared nonvolatile memory, typically a disk, which is utilized in conjunction with defined transmission protocols to provide an independent communication path usable for both the transmission of node liveness information and for other, more general, data transfer purposes. This path is considered as having the same degree of importance as other network paths and liveness indications from the shared memory path are likewise considered at the same time as liveness indications from another path or paths.

    摘要翻译: 通过使用共享的非易失性存储器(通常为磁盘)来增强多节点数据处理系统中的生命力确定,所述非易失性存储器通常与定义的传输协议一起使用,以提供可用于节点活动信息传输和其他用途的独立通信路径 ,更一般,数据传输的目的。 该路径被认为具有与其他网络路径相同程度的重要性,并且来自共享存储器路径的活动性指示同时被考虑在来自另一路径或路径的活动指示的同时。

    Method, system and program products for controlling system traffic of a clustered computing environment
    20.
    发明授权
    Method, system and program products for controlling system traffic of a clustered computing environment 失效
    用于控制集群计算环境的系统流量的方法,系统和程序产品

    公开(公告)号:US06925490B1

    公开(公告)日:2005-08-02

    申请号:US09583849

    申请日:2000-05-31

    IPC分类号: G06F9/46 G06F15/16

    CPC分类号: G06F9/465

    摘要: Various components are provided to manage a clustered environment. These components include a System Registry that provides a global data storage; a Configuration manager that stores data locally on nodes of the clustered environment and globally within the System Registry; a Liveness component to provide status of communications paths of the cluster; a Group Services component that provides services to one or more other components of the clustered environment; and a Resource Management component that communicates with one or more resource controllers of the clustered environment. Each of the components of the clustered environment has one or more data and/or functional dependencies on one or more other components of the environment. However, relationships between the components are created such that the data and functional dependencies form an acyclic graph (i.e., a cycle of dependency relationships is avoided).

    摘要翻译: 提供了各种组件来管理群集环境。 这些组件包括提供全局数据存储的系统注册表; 一个配置管理器,用于将数据本地存储在集群环境的节点上,并在系统注册表中全局; 提供集群通信路径状态的Liveness组件; 组服务组件,为集群环境的一个或多个其他组件提供服务; 以及与集群环境的一个或多个资源控制器进行通信的资源管理组件。 集群环境的每个组件对环境的一个或多个其他组件具有一个或多个数据和/或功能依赖性。 然而,创建组件之间的关系使得数据和功能依赖性形成非循环图(即,避免了依赖关系的循环)。