摘要:
A method and system to add a storage controller in a SAN includes receiving a storage controller node to add to a logical storage controller in the storage area network having a logical nodename and a sequence of logical ports, adopting the logical nodename from the logical storage controller in place of the predetermined nodename associated with the storage controller and renumbering a set of ports associated with the storage controller to extend the sequence of logical ports associated with the logical storage controller. Removing the storage controller includes removing a storage controller node from the logical storage controller in the storage area network having a logical nodename and a sequence of logical ports, deleting the set of ports associated with the removed storage controller from the sequence of logical ports associated with the logical storage controller and renumbering the sequence of logical ports to accommodate the deleted ports.
摘要:
Failover processing in storage server system utilizes policies for managing fault tolerance (FT) and high availability (HA) configurations. The approach encapsulates the knowledge of failover recovery between components within a storage server and between storage server systems. This knowledge includes information about what components are participating in a Failover Set, how they are configured for failover, what is the Fail-Stop policy, and what are the steps to perform when “failing-over” a component.
摘要:
A cluster management system dynamically assigns a category to a server node based on the server node's attributes. The system can detect a change in value of a server attribute of a first server node. Then, in response to detecting the change in value of the server attribute, the system identifies a plurality of server attributes for the first server node, and dynamically assigns to the first server node a category that matches the server attributes. Further, the system can add the server node to a server pool that has a matching category with the server node.
摘要:
A clusterware manager on a cluster of nodes interprets a resource profile. The resource profile defines resource profile attributes. The attributes include at least one attribute that defines a cluster dependency based on resource type. The attribute does not identify any particular resource of that resource type. Dependencies between resources are managed based on the attribute that specifies the cluster dependency.
摘要:
A method for a self-testing clusterware agent is provided. A clusterware agent that includes clusterware-side components and application-side components is configured to interface between a cluster manager and an application. The application-side components are invoked by clusterware-side components via an application programming interface, or API that includes API functions that are invocable by a cluster manager. Without any cluster manager invoking the clusterware agent, one or more of the API functions are invoked.
摘要:
A cluster management system dynamically assigns a category to a server node based on the server node's attributes. The system can detect a change in value of a server attribute of a first server node. Then, in response to detecting the change in value of the server attribute, the system identifies a plurality of server attributes for the first server node, and dynamically assigns to the first server node a category that matches the server attributes. Further, the system can add the server node to a server pool that has a matching category with the server node.
摘要:
A method and apparatus is provided for automatically performing an operation for one or more resources of a computing cluster when a conditional dependency is satisfied. The conditional dependency may be based on the operating state, load, performance metric, or performance statistic of one or more other resources. A resource profile for a resource stores a conditional dependency that, when satisfied, causes a centralized policy engine to send a command to the resource or an agent for the resource. The policy engine receives notifications of operating state changes from agents that manage resources in the cluster. The policy engine determines that one or more conditional dependencies is satisfied when one or more resources change state to satisfy conditions specified by the conditional dependencies. The policy engine responds to detecting that a conditional dependency is satisfied by sending a command that causes the dependent resource to change its operating state.
摘要:
A method and apparatus for resource dispersion in a clusterware system is provided. The resource profiles for resources within a cluster of nodes are stored. Each resource profile includes attributes for a resource. One or more of the attributes in one of the resource profiles includes a dispersion dependency attribute that specifies that the resource is preferably not co-located with another resource. Resources are placed within the cluster based, at least in part, on the dispersion preference.
摘要:
A method and computer-readable storage representing resources in a cluster by a plurality of attribute-value pairs that together are part of a “resource profile,” in which each attribute-value pair defines all, or a portion of, a management policy that applies to the resource. Included is a built-in resource type called a “Local Resource” type. Local resources that are attached to the server are instantiated with a combination of the resource profile and values of the Local Resource type.
摘要:
A cluster management system can facilitate determining a response to a hypothetical event by a computer cluster. During operation, the system can receive a request for a policy outcome from a user, such that the request can indicate a hypothetical event within a computer cluster. Next, the system determines a cluster state that includes state information from a plurality of server nodes in the cluster. The system then generates a response plan to the hypothetical event based on the cluster state, without executing the response plan in the cluster. The response plan indicates actions that are to be performed by server nodes in the cluster in response to the hypothetical event. The system then provides the response plan to the user.