Abstract:
An apparatus, program product and method utilize hidden group membership to facilitate the processing of originator requests to a group in a clustered computer system. With hidden group membership, a requesting originator is temporarily joined to a group in such a manner that the originator is both hidden and provided with limited access rights, e.g., so that some of the messages sent by the members of a group when processing the request are neither sent to nor received by the originator.
Abstract:
Members of a primary-backup group in a clustered computer system are organized into subgroups to manage primary and backup resources being managed by the group. Group members are placed into subgroups based upon their access to particular resources, such that a primary subgroup may be defined comprised of members having access to a common primary resource, with one or more backup subgroups defined comprised of members having access to a common backup resource. A join protocol is used to determine to which of a plurality of resources managed by the primary-backup group a joining member has access, and to add the joining member to a subgroup for a resource to which the joining member has access.
Abstract:
An apparatus, program product and method utilize subgroup-specific leader members to exchange group data between group members during the handling of a request to organize members into a group in a clustered computer system, e.g., when handling a membership change operation such as a merge or join. Such subgroup leaders may be determined locally within individual subgroup members, and moreover, the subgroup members may locally track the transmission status of group data for the various subgroups. Each subgroup includes one or more members that are known to store group data that is coherent among all subgroup members.
Abstract:
An apparatus, program product and method support the dynamic modification of cluster communication parameters such as a fragmentation size parameter through controllably deferring the processing of a requested fragmentation size change in a source node until after receipt an acknowledgment message for at least one unacknowledged message sent by the source node to a plurality of target nodes. By controllably deferring such processing until it is confirmed that any such previously-unacknowledged messages sent by a source node have been received by any target nodes, synchronization between the source node and the target nodes may be obtained, and a fragmentation size change may occur in a coordinated fashion such that future messages from the source node to the target node will be processed by both the source and the target nodes using the modified fragmentation size parameter.
Abstract:
A computing node that functions as a member within a computing system group, such as a cluster, that has a status allowing receipt of group messages even though the node is not an active member of the cluster. The node is able to function as a primary member or as a backup member that controls redundant resources to be utilized in case of a failure. The computing node is able to have one of two status values, an “Active” status and an “Ineligible” status. Members that are able to function as a primary member have an “Active” status assigned, and a member that is not configured or otherwise eligible to perform as a primary member is assigned an “Ineligible” status. Members with an Ineligible status receive all group messages and therefore are able to become configured and eligible to become a primary member.
Abstract:
An apparatus, program product and method utilize ordered messages in a clustered computer system to defer the execution of a merge protocol in a cluster group until all pending protocols in each partition of a group are handled, typically by ensuring either cancellation or completion of each pending protocol prior to execution of the merge protocol. From the perspective of each group member, the execution of the merge protocol is deferred by inhibiting processing of the merge request by such member until after processing of all earlier-received pending requests has been completed.
Abstract:
Members of a primary-backup group in a clustered computer system are organized into subgroups to manage primary and backup resources being managed by the group. Group members are placed into subgroups based upon their access to particular resources, such that a primary subgroup may be defined comprised of members having access to a common primary resource, with one or more backup subgroups defined comprised of members having access to a common backup resource. A join protocol is used to determine to which of a plurality of resources managed by the primary-backup group a joining member has access, and to add the joining member to a subgroup for a resource to which the joining member has access.
Abstract:
An apparatus, program product and method support the dynamic modification of cluster communication parameters through a distributed protocol whereby individual nodes locally confirm initiation and status information for every node participating in a parameter modification operation. By doing so, individual nodes are also able to locally determine the need to undo locally-performed parameter modifications should any other node be incapable of performing a parameter modification. Moreover, specifically with respect to cluster communication parameters such as heartbeat parameters, such parameters may be dynamically modified by configuring a sending node to send a heartbeat message to a receiving node, with the heartbeat message indicating that a heartbeat parameter is to be modified. In response to the heartbeat message, the receiving node may then send an acknowledgment message to the sending node that indicates whether the heartbeat parameter has been modified in the receiving node. Further, modification of the heartbeat parameter in the sending node may be deferred until the acknowledgment message from the receiving node indicates that the heartbeat parameter has been modified in the receiving node.
Abstract:
An apparatus, program product and method support the dynamic modification of cluster communication parameters such as a fragmentation size parameter through controllably deferring the processing of a requested fragmentation size change in a source node until after receipt an acknowledgment message for at least one unacknowledged message sent by the source node to a plurality of target nodes. By controllably deferring such processing until it is confirmed that any such previously-unacknowledged messages sent by a source node have been received by any target nodes, synchronization between the source node and the target nodes may be obtained, and a fragmentation size change may occur in a coordinated fashion such that future messages from the source node to the target node will be processed by both the source and the target nodes using the modified fragmentation size parameter.
Abstract:
Method and apparatus for validating and ranking of resources that may be switched between a primary system and one or more backup systems at a single site. One embodiment provides a method for ensuring accessibility of one or more disk units by a system, comprising: configuring a disk pool for the system; validating availability of the one or more disk units for the disk pool; verifying that the disk units are at the same site as the system, and selecting one or more valid disk units for the disk pool. The method may further comprise ranking of each disk unit for the disk pool and selecting one or more valid disk units for the disk pool according to ranking.