Dynamic cluster versioning for a group

    公开(公告)号:US07130897B2

    公开(公告)日:2006-10-31

    申请号:US09975442

    申请日:2001-10-11

    IPC分类号: G06F15/167 G06F9/44

    摘要: An apparatus, program product, and method update the cluster infrastructure version used by a group resident in a clustered computer system without requiring a shut down of the group during the update. The cluster infrastructure software in individual nodes in the clustered computer system is updated while the group is maintained in an active state. After the cluster infrastructure software is updated, the group is then notified of the update. In response to the notification, the cluster infrastructure version used by the group is dynamically updated to that of the updated cluster infrastructure software, thus making additional functions supported by the new version of the cluster infrastructure software available for use by all group members.

    Merge protocol for clustered computer system
    2.
    发明授权
    Merge protocol for clustered computer system 失效
    集群计算机系统合并协议

    公开(公告)号:US06968359B1

    公开(公告)日:2005-11-22

    申请号:US09638328

    申请日:2000-08-14

    CPC分类号: G06F9/5061 G06F2209/505

    摘要: An apparatus, program product and method utilize ordered messages in a clustered computer system to defer the execution of a merge protocol in a cluster group until all pending protocols in each partition of a group are handled, typically by ensuring either cancellation or completion of each pending protocol prior to execution of the merge protocol. From the perspective of each group member, the execution of the merge protocol is deferred by inhibiting processing of the merge request by such member until after processing of all earlier-received pending requests has been completed.

    摘要翻译: 装置,程序产品和方法利用集群计算机系统中的有序消息推迟集群组中的合并协议的执行,直到处理组中每个分区中的所有未决协议,通常通过确保取消或完成每个待处理 协议在执行合并协议之前。 从每个组成员的角度来看,合并协议的执行通过禁止这种成员的合并请求的处理来延迟,直到所有早先接收到的等待请求的处理已经完成为止。

    Apparatus and method for processing a task in a clustered computing environment
    3.
    发明授权
    Apparatus and method for processing a task in a clustered computing environment 失效
    用于在集群计算环境中处理任务的装置和方法

    公开(公告)号:US06625639B1

    公开(公告)日:2003-09-23

    申请号:US09438207

    申请日:1999-11-12

    IPC分类号: G06F900

    摘要: A clustered computer system includes multiple computer systems (or nodes) on a network that can become members of a group to work on a particular task. Each node includes group state data that represents the status of all members of the group. A group state data update mechanism in each node updates the group state data at acknowledge (ACK) rounds, so that all the group state data in all nodes are synchronized and identical if all members respond properly during the ACK round. Each node also includes a main thread and one or more work threads. The main thread receives messages from other computer systems in the group, and routes messages intended for the work thread to either a response queue or a work queue in the work thread, depending on the type of the message. If the message is a response to a currently-executing task, the message is placed in the response queue. Otherwise, the message is placed in the work queue for processing at a later time.

    摘要翻译: 集群计算机系统包括网络上的多个计算机系统(或节点),其可以成为组成特定任务的组成员。 每个节点包括表示组的所有成员的状态的组状态数据。 每个节点中的组状态数据更新机制在确认(ACK)循环中更新组状态数据,使得如果所有成员在ACK循环期间正确响应,则所有节点中的所有组状态数据都是同步的和相同的。 每个节点还包括主线程和一个或多个工作线程。 主线程从组中的其他计算机系统接收消息,并根据消息的类型将针对工作线程的消息路由到工作线程中的响应队列或工作队列。 如果消息是对当前正在执行的任务的响应,则该消息被放置在响应队列中。 否则,消息将放置在工作队列中,以便稍后进行处理。

    Group access privatization in clustered computer system
    4.
    发明授权
    Group access privatization in clustered computer system 失效
    集群计算机系统中的组访问私有化

    公开(公告)号:US08078754B2

    公开(公告)日:2011-12-13

    申请号:US12131475

    申请日:2008-06-02

    IPC分类号: G06F15/16

    摘要: An apparatus, clustered computer system, and program product rely on cluster-private group names to perform accesses to groups that are resident in a clustered computer system. Thus, for a cluster-accessible group, all nodes capable of participating in a cluster are configured to map to the same cluster-private group name for that group, so that any external user that has access to the clustered computer system can access the group name and utilize the group name to initiate operations by the group.

    摘要翻译: 设备,集群计算机系统和程序产品依赖于集群 - 专用组名来执行对驻留在群集计算机系统中的组的访问。 因此,对于集群可访问组,能够参与集群的所有节点都配置为映射到该组的相同集群 - 专用组名称,以便可以访问集群计算机系统的任何外部用户可以访问该组 名称并利用组名来启动组的操作。

    Group access privatization in clustered computer system
    5.
    发明授权
    Group access privatization in clustered computer system 失效
    集群计算机系统中的组访问私有化

    公开(公告)号:US07433957B2

    公开(公告)日:2008-10-07

    申请号:US09845596

    申请日:2001-04-30

    IPC分类号: G06F15/16 G06F15/173

    摘要: An apparatus, clustered computer system, program product and method rely on cluster-private group names to perform accesses to groups that are resident in a clustered computer system. Thus, for a cluster-accessible group, all nodes capable of participating in a cluster are configured to map to the same cluster-private group name for that group, so that any external user that has access to the clustered computer system can access the group name and utilize the group name to initiate operations by the group.

    摘要翻译: 设备,集群计算机系统,程序产品和方法依赖集群 - 专用组名来执行对驻留在群集计算机系统中的组的访问。 因此,对于集群可访问组,能够参与集群的所有节点都配置为映射到该组的相同集群 - 专用组名称,以便可以访问集群计算机系统的任何外部用户可以访问该组 名称并利用组名来启动组的操作。

    GROUP ACCESS PRIVATIZATION IN CLUSTERED COMPUTER SYSTEM
    6.
    发明申请
    GROUP ACCESS PRIVATIZATION IN CLUSTERED COMPUTER SYSTEM 失效
    集群计算机系统中的组访问私有化

    公开(公告)号:US20080235333A1

    公开(公告)日:2008-09-25

    申请号:US12131475

    申请日:2008-06-02

    IPC分类号: G06F15/16 G06F17/30

    摘要: An apparatus, clustered computer system, and program product rely on cluster-private group names to perform accesses to groups that are resident in a clustered computer system. Thus, for a cluster-accessible group, all nodes capable of participating in a cluster are configured to map to the same cluster-private group name for that group, so that any external user that has access to the clustered computer system can access the group name and utilize the group name to initiate operations by the group.

    摘要翻译: 设备,集群计算机系统和程序产品依赖于集群 - 专用组名来执行对驻留在群集计算机系统中的组的访问。 因此,对于集群可访问组,能够参与集群的所有节点都配置为映射到该组的相同集群 - 专用组名称,以便可以访问集群计算机系统的任何外部用户可以访问该组 名称并利用组名来启动组的操作。

    Group data sharing during membership change in clustered computer system
    7.
    发明授权
    Group data sharing during membership change in clustered computer system 失效
    集群计算机系统中成员资格更改期间的组数据共享

    公开(公告)号:US06839752B1

    公开(公告)日:2005-01-04

    申请号:US09697398

    申请日:2000-10-27

    IPC分类号: G06F15/173

    CPC分类号: G06F11/1479 G06F11/1425

    摘要: An apparatus, program product and method utilize subgroup-specific leader members to exchange group data between group members during the handling of a request to organize members into a group in a clustered computer system, e.g., when handling a membership change operation such as a merge or join. Such subgroup leaders may be determined locally within individual subgroup members, and moreover, the subgroup members may locally track the transmission status of group data for the various subgroups. Each subgroup includes one or more members that are known to store group data that is coherent among all subgroup members.

    摘要翻译: 一种装置,程序产品和方法利用子组特定的领导成员在处理组群成员到集群计算机系统中的组的请求期间在组成员之间交换组数据,例如,当处理诸如合并的成员变更操作 或加入。 这样的小组领导可以在各个子组成员内本地确定,而且,小组成员可以在本地跟踪各个子组的组数据的传输状态。 每个子组包括一个或多个已知存储在所有子组成员中相干的组数据的成员。

    Synchronization of group state data when rejoining a member to a primary-backup group in a clustered computer system
    8.
    发明授权
    Synchronization of group state data when rejoining a member to a primary-backup group in a clustered computer system 有权
    在将成员重新连接到群集计算机系统中的主备份组时,将组状态数据同步

    公开(公告)号:US07231461B2

    公开(公告)日:2007-06-12

    申请号:US09952392

    申请日:2001-09-14

    IPC分类号: G06F15/16

    摘要: An apparatus, program product and method to synchronize group state data in a primary-backup group in connection with the rejoining of a member to the primary-backup group in a clustered computer system. Each member in the group includes a copy of replicated group state data for the primary-backup group. In connection with rejoining the member, it is determined whether the rejoining member is the primary member for the primary-backup group. Then, a selection is made between member and group overwrite operations based upon such determination. The member overwrite operation includes overwriting the copy of the replicated group state data for the rejoining member with data from the copy of the replicated group state data for an existing member in the primary-backup group. The group overwrite operation includes overwriting the copy of the replicated group state data for the existing member in the primary-backup group with data from the copy of the replicated group state data for the rejoining member.

    摘要翻译: 一种用于将主备份组中的组状态数据与集群计算机系统中的主备份组重新连接相关联的装置,程序产品和方法。 组中的每个成员包括主备份组的复制组状态数据的副本。 关于重新加入成员,确定重新连接成员是否是主备份组的主要成员。 然后,基于这样的确定,在成员和组覆盖操作之间进行选择。 成员覆盖操作包括使用来自主备份组中现有成员的复制组状态数据的副本的数据覆盖重新连接成员的复制组状态数据的副本。 组重写操作包括使用来自重新连接成员的复制组状态数据的副本的数据覆盖主备份组中现有成员的复制组状态数据的副本。

    Apparatus and method for passively monitoring liveness of jobs in a clustered computing environment
    9.
    发明授权
    Apparatus and method for passively monitoring liveness of jobs in a clustered computing environment 失效
    在集群计算环境中被动地监视作业活动的装置和方法

    公开(公告)号:US06990668B1

    公开(公告)日:2006-01-24

    申请号:US09421585

    申请日:1999-10-20

    IPC分类号: G06F9/461

    摘要: An apparatus and method passively determine when a job in a clustered computing environment is dead. Each node in the cluster has a cluster engine for communicating between each job on the node and jobs on other nodes. A protocol is defined that includes one or more acknowledge (ACK) rounds, and that only performs local processing between ACK rounds. The protocol is executed by jobs that are members of a defined group. Each job in the group has one or more work threads that execute the protocol. In addition, each job has a main thread that communicates between the job and jobs on other nodes (through the cluster engine), routes appropriate messages from the cluster engine to a work thread, and signals to the cluster engine when a fault occurs when the work thread executes the protocol. By assuring that a dead job is reported to other members of the group, liveness information for group members can be monitored without the overhead associated with active liveness checking.

    摘要翻译: 设备和方法被动地确定群集计算环境中的作业何时死亡。 集群中的每个节点都有一个集群引擎,用于在节点上的每个作业和其他节点上的作业之间进行通信。 定义了包括一个或多个确认(ACK)轮,并且仅在ACK轮之间执行本地处理的协议。 协议由作为定义组成员的作业执行。 组中的每个作业都有一个或多个执行协议的工作线程。 此外,每个作业都有一个主线程,通过其他节点(通过群集引擎)在作业和作业之间进行通信,将适当的消息从群集引擎路由到工作线程,并在发生故障时向群集引擎发出信号 工作线程执行协议。 通过确保向组织的其他成员报告死亡的工作,可以监视组成员的活动信息,而不需要与主动活动检查相关的开销。