Handling restart attempts for high availability managed resources
    1.
    发明授权
    Handling restart attempts for high availability managed resources 有权
    处理高可用性管理资源的重新启动尝试

    公开(公告)号:US07496789B2

    公开(公告)日:2009-02-24

    申请号:US11146531

    申请日:2005-06-06

    IPC分类号: G06F11/00

    CPC分类号: G06F11/0793 G06F11/0709

    摘要: Techniques are provided for managing a resource in a High Availability (HA) system. The techniques involve incrementing a count when a particular type of remedial action is performed on a resource, so that the count that reflects how often the particular type of remedial action has been performed for the resource. When it is determined that the resource has been in stable operation, the count is automatically reduced. After a failure, the count is used to determine whether to attempt to perform the particular type of remedial action on the resource. Examples of remedial actions include restarting the resource, and relocating the resource to another node of a cluster. By using the count, the system insures that a faulty resource does not get constantly “bounced”. By reducing the count when a resource has become stable, there is less likelihood that failure of otherwise stable resources will require manual intervention.

    摘要翻译: 提供了用于管理高可用性(HA)系统中的资源的技术。 这些技术涉及在对资源执行特定类型的补救措施时增加计数,以便反映针对资源执行特定类型的补救措施的次数。 当确定资源处于稳定运行状态时,计数自动减少。 失败后,计数用于确定是否尝试对资源执行特定类型的补救措施。 补救措施的示例包括重新启动资源,以及将资源重定位到群集的另一个节点。 通过使用计数,系统确保错误的资源不会不断“反弹”。 通过在资源变得稳定的情况下减少计数,则不太可能需要手动干预资源的稳定。

    Handling restart attempts for high availability managed resources

    公开(公告)号:US20060277429A1

    公开(公告)日:2006-12-07

    申请号:US11146531

    申请日:2005-06-06

    IPC分类号: G06F11/00

    CPC分类号: G06F11/0793 G06F11/0709

    摘要: Techniques are provided for managing a resource in a High Availability (HA) system. The techniques involve incrementing a count when a particular type of remedial action is performed on a resource, so that the count that reflects how often the particular type of remedial action has been performed for the resource. When it is determined that the resource has been in stable operation, the count is automatically reduced. After a failure, the count is used to determine whether to attempt to perform the particular type of remedial action on the resource. Examples of remedial actions include restarting the resource, and relocating the resource to another node of a cluster. By using the count, the system insures that a faulty resource does not get constantly “bounced”. By reducing the count when a resource has become stable, there is less likelihood that failure of otherwise stable resources will require manual intervention.

    DEPENDENCY ON A RESOURCE TYPE
    3.
    发明申请
    DEPENDENCY ON A RESOURCE TYPE 有权
    资源类型依赖

    公开(公告)号:US20110179419A1

    公开(公告)日:2011-07-21

    申请号:US12688739

    申请日:2010-01-15

    IPC分类号: G06F9/46

    CPC分类号: G06F9/5061

    摘要: A clusterware manager on a cluster of nodes interprets a resource profile. The resource profile defines resource profile attributes. The attributes include at least one attribute that defines a cluster dependency based on resource type. The attribute does not identify any particular resource of that resource type. Dependencies between resources are managed based on the attribute that specifies the cluster dependency.

    摘要翻译: 节点集群上的集群管理器解释资源配置文件。 资源配置文件定义资源配置文件属性。 属性包括至少一个基于资源类型定义集群依赖关系的属性。 该属性不标识该资源类型的任何特定资源。 基于指定集群依赖关系的属性来管理资源之间的依赖关系。

    Self-testable HA framework library infrastructure
    4.
    发明授权
    Self-testable HA framework library infrastructure 有权
    自检HA框架库基础设施

    公开(公告)号:US09069619B2

    公开(公告)日:2015-06-30

    申请号:US12688735

    申请日:2010-01-15

    摘要: A method for a self-testing clusterware agent is provided. A clusterware agent that includes clusterware-side components and application-side components is configured to interface between a cluster manager and an application. The application-side components are invoked by clusterware-side components via an application programming interface, or API that includes API functions that are invocable by a cluster manager. Without any cluster manager invoking the clusterware agent, one or more of the API functions are invoked.

    摘要翻译: 提供了一种自检集群代理的方法。 包括集群件组件和应用程序侧组件的集群代理配置为在集群管理器和应用程序之间进行接口。 应用程序侧组件通过应用程序编程接口或包含可由集群管理器调用的API函数的API调用集群件组件。 没有任何集群管理器调用集群件代理,一个或多个API函数被调用。

    Dependency on a resource type
    5.
    发明授权
    Dependency on a resource type 有权
    对资源类型的依赖

    公开(公告)号:US08438573B2

    公开(公告)日:2013-05-07

    申请号:US12688739

    申请日:2010-01-15

    IPC分类号: G06F9/46 G06F15/173

    CPC分类号: G06F9/5061

    摘要: A clusterware manager on a cluster of nodes interprets a resource profile. The resource profile defines resource profile attributes. The attributes include at least one attribute that defines a cluster dependency based on resource type. The attribute does not identify any particular resource of that resource type. Dependencies between resources are managed based on the attribute that specifies the cluster dependency.

    摘要翻译: 节点集群上的集群管理器解释资源配置文件。 资源配置文件定义资源配置文件属性。 属性包括至少一个基于资源类型定义集群依赖关系的属性。 该属性不标识该资源类型的任何特定资源。 基于指定集群依赖关系的属性来管理资源之间的依赖关系。

    Automated integrated high availability of the in-memory database cache and the backend enterprise database
    6.
    发明授权
    Automated integrated high availability of the in-memory database cache and the backend enterprise database 有权
    自动集成高可用性的内存数据库缓存和后端企业数据库

    公开(公告)号:US08306951B2

    公开(公告)日:2012-11-06

    申请号:US12719264

    申请日:2010-03-08

    IPC分类号: G06F17/30

    摘要: A cluster manager is configured to manage a plurality of copies of a mid-tier database as a mid-tier database cluster. The cluster manager may concurrently manage a backend database system. The cluster manager is configured to monitor for and react to failures of mid-tier database nodes. The cluster manager may react to a mid-tier database failure by, for example, assigning a new active node, creating a new standby node, creating new copies of the mid-tier databases, implementing new replication or backup schemes, reassigning the node's virtual address to another node, or relocating applications that were directly linked to the mid-tier database to another host. Each node or an associated agent may configure the cluster manager to behave in this fashion during initialization, based on common cluster configuration information. Each copy of the mid-tier database may be, for example, a memory resident database. Thus, a node must reload the entire database into memory to recover a copy of the database.

    摘要翻译: 集群管理器被配置为将中间层数据库的多个副本作为中间层数据库集群进行管理。 集群管理器可以同时管理后端数据库系统。 集群管理器配置为监视中层数据库节点的故障并对其做出反应。 集群管理器可以通过例如分配新的活动节点,创建新的备用节点,创建中间层数据库的新副本,实现新的复制或备份方案来对中间层数据库故障作出反应,重新分配节点的虚拟 地址到另一个节点,或将直接链接到中间层数据库的应用程序重新定位到另一个主机。 基于常见的集群配置信息,每个节点或相关联的代理可以在初始化期间将集群管理器配置为以这种方式行事。 中间层数据库的每个副本可以是例如内存驻留数据库。 因此,节点必须将整个数据库重新加载到内存中以恢复数据库的副本。

    AUTOMATED INTEGRATED HIGH AVAILABILITY OF THE IN-MEMORY DATABASE CACHE AND THE BACKEND ENTERPRISE DATABASE
    7.
    发明申请
    AUTOMATED INTEGRATED HIGH AVAILABILITY OF THE IN-MEMORY DATABASE CACHE AND THE BACKEND ENTERPRISE DATABASE 有权
    内存数据库高速缓存和后端企业数据库的自动化高可用性

    公开(公告)号:US20110071981A1

    公开(公告)日:2011-03-24

    申请号:US12719264

    申请日:2010-03-08

    IPC分类号: G06F17/30

    摘要: A cluster manager is configured to manage a plurality of copies of a mid-tier database as a mid-tier database cluster. The cluster manager may concurrently manage a backend database system. The cluster manager is configured to monitor for and react to failures of mid-tier database nodes. The cluster manager may react to a mid-tier database failure by, for example, assigning a new active node, creating a new standby node, creating new copies of the mid-tier databases, implementing new replication or backup schemes, reassigning the node's virtual address to another node, or relocating applications that were directly linked to the mid-tier database to another host. Each node or an associated agent may configure the cluster manager to behave in this fashion during initialization, based on common cluster configuration information. Each copy of the mid-tier database may be, for example, a memory resident database. Thus, a node must reload the entire database into memory to recover a copy of the database.

    摘要翻译: 集群管理器被配置为将中间层数据库的多个副本作为中间层数据库集群进行管理。 集群管理器可以同时管理后端数据库系统。 集群管理器配置为监视中层数据库节点的故障并对其做出反应。 集群管理器可以通过例如分配新的活动节点,创建新的备用节点,创建中间层数据库的新副本,实现新的复制或备份方案来对中间层数据库故障作出反应,重新分配节点的虚拟 地址到另一个节点,或将直接链接到中间层数据库的应用程序重新定位到另一个主机。 基于常见的集群配置信息,每个节点或相关联的代理可以在初始化期间将集群管理器配置为以这种方式行事。 中间层数据库的每个副本可以是例如内存驻留数据库。 因此,节点必须将整个数据库重新加载到内存中以恢复数据库的副本。

    SELF-TESTABLE HA FRAMEWORK LIBRARY INFRASTRUCTURE
    8.
    发明申请
    SELF-TESTABLE HA FRAMEWORK LIBRARY INFRASTRUCTURE 有权
    自我测试框架图书馆基础设施

    公开(公告)号:US20110179428A1

    公开(公告)日:2011-07-21

    申请号:US12688735

    申请日:2010-01-15

    IPC分类号: G06F9/46

    摘要: A method for a self-testing clusterware agent is provided. A clusterware agent that includes clusterware-side components and application-side components is configured to interface between a cluster manager and an application. The application-side components are invoked by clusterware-side components via an application programming interface, or API that includes API functions that are invocable by a cluster manager. Without any cluster manager invoking the clusterware agent, one or more of the API functions are invoked.

    摘要翻译: 提供了一种自检集群代理的方法。 包括集群件组件和应用程序侧组件的集群代理配置为在集群管理器和应用程序之间进行接口。 应用程序侧组件通过应用程序编程接口或包含可由集群管理器调用的API函数的API调用集群件组件。 没有任何集群管理器调用集群件代理,一个或多个API函数被调用。

    Method to avoid continuous application failovers in a cluster
    9.
    发明申请
    Method to avoid continuous application failovers in a cluster 有权
    避免集群中连续应​​用程序故障转移的方法

    公开(公告)号:US20080244307A1

    公开(公告)日:2008-10-02

    申请号:US11728663

    申请日:2007-03-26

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1482 G06F11/2028

    摘要: A method and mechanism for failing over applications in a clustered computing system is provided. In an embodiment, the methodology is implemented by a high-availability failover mechanism. Upon detecting a failure of an application that is currently designated to be executing on a particular node of the system, the mechanism may attempt to failover the application onto a different node. The mechanism keeps track of a number of nodes on which a failover of the application is attempted. Then, based on one or more factors including the number of nodes on which a failover of the application is attempted, the mechanism may cease to attempt to failover the application onto a node of the system.

    摘要翻译: 提供了一种在集群计算系统中失败应用程序的方法和机制。 在一个实施例中,该方法由高可用性故障切换机制来实现。 当检测到当前被指定为在系统的特定节点上执行的应用程序的故障时,机制可以尝试将应用故障转移到不同的节点上。 该机制跟踪尝试应用程序故障转移的多个节点。 然后,基于一个或多个因素,包括尝试应用程序的故障切换的节点数,该机制可能会停止将应用程序故障转移到系统的节点上。

    Method to avoid continuous application failovers in a cluster
    10.
    发明授权
    Method to avoid continuous application failovers in a cluster 有权
    避免集群中连续应​​用程序故障转移的方法

    公开(公告)号:US07802128B2

    公开(公告)日:2010-09-21

    申请号:US11728663

    申请日:2007-03-26

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1482 G06F11/2028

    摘要: A method and mechanism for failing over applications in a clustered computing system is provided. In an embodiment, the methodology is implemented by a high-availability failover mechanism. Upon detecting a failure of an application that is currently designated to be executing on a particular node of the system, the mechanism may attempt to failover the application onto a different node. The mechanism keeps track of a number of nodes on which a failover of the application is attempted. Then, based on one or more factors including the number of nodes on which a failover of the application is attempted, the mechanism may cease to attempt to failover the application onto a node of the system.

    摘要翻译: 提供了一种在集群计算系统中失败应用程序的方法和机制。 在一个实施例中,该方法由高可用性故障切换机制来实现。 当检测到当前被指定为在系统的特定节点上执行的应用程序的故障时,机制可以尝试将应用故障转移到不同的节点上。 该机制跟踪尝试应用程序故障转移的多个节点。 然后,基于一个或多个因素,包括尝试应用程序的故障切换的节点数,该机制可能会停止将应用程序故障转移到系统的节点上。