摘要:
A distributed computing system conforms to a multi-level, hierarchical organizational model. One or more control nodes provide for the efficient and automated allocation and management of computing functions and resources within the distributed computing system in accordance with the organization model. A user, such as a system administrator, interacts with the control nodes to logically define the hierarchical organization of distributed computing system. The control node includes an automation subsystem having one or more rule engines that provide autonomic control of the application nodes in accordance with a set of one or more rules. A pluggable, application-specific application governor is selected and installed within the control node to provide an application-independent interface through which the rule engines interact to control the deployment, execution and monitoring of the applications within the distributed computing system.
摘要:
In general, the techniques of this invention are directed to determining whether a component failure in a distributed computing system is genuine. In particular, embodiments of this invention analyze monitoring data from other application nodes in a distributed computing system to determine whether the component failure is genuine. If the component failure is not genuine, the embodiments may adjust a fault tolerance parameter that caused the component failure to be perceived.
摘要:
A distributed computing system conforms to a multi-level, hierarchical organizational model. One or more control nodes provide for the efficient and automated allocation and management of computing functions and resources within the distributed computing system in accordance with the organization model. A user, such as a system administrator, interacts with the control nodes to logically define the hierarchical organization of distributed computing system. The control node includes an automation subsystem having one or more rule engines that provide autonomic control of the application nodes in accordance with a set of one or more rules. A pluggable, application-specific application matrix is selected and installed within the control node. The application matrix contains a logical definition of the applications, and parameters for controlling the deployment of the applications within the distributed computing system.
摘要:
In general, the techniques of this invention are directed to determining whether a component failure in a distributed computing system is genuine. In particular, embodiments of this invention analyze monitoring data from other application nodes in a distributed computing system to determine whether the component failure is genuine. If the component failure is not genuine, the embodiments may adjust a fault tolerance parameter that caused the component failure to be perceived.
摘要:
In general, the techniques of this invention are directed to autonomic management of autonomic management systems. In particular, the embodiments of this invention use a measure, analyze, and respond model to autonomically manage one or more autonomic management systems. By understanding specific state information of these autonomic management systems, embodiments of the invention may achieve target performance for the autonomic management systems through operations monitoring, analyzing current system state against target state, and modifying the configurations or resources of the autonomic management systems.
摘要:
Systems and methods include receiving notification of an anomaly in a particular component within a data center environment. Such systems and methods include receiving configuration information identifying positions of tags disposed proximate to and representing respective infrastructure components within the data center environment. Such systems and methods include detecting a first tag representing a first infrastructure component and, in response thereto, determining, using the configuration information, a first positional relationship between the first tag and a second tag representing the particular component. Such systems and methods include determining a second positional relationship between the mobile device and the particular component based on the first positional relationship and providing, based on the second positional relationship, directional guidance from the first infrastructure component to the particular component. Such systems and methods include detecting, by the sensor, the second tag and, in response thereto, providing information identifying the particular component.
摘要:
In general, the techniques of this invention are directed to determining whether a component failure in a distributed computing system is genuine. In particular, embodiments of this invention analyze monitoring data from other application nodes in a distributed computing system to determine whether the component failure is genuine. If the component failure is not genuine, the embodiments may adjust a fault tolerance parameter that caused the component failure to be perceived.
摘要:
A distributed computing system conforms to a multi-level, hierarchical organizational model. One or more control nodes provide for the efficient and automated allocation and management of computing functions and resources within the distributed computing system in accordance with the organization model. The model includes four distinct levels: fabric, domains, tiers and nodes that provide for the logical abstraction and containment of the physical components as well as system and service application software of the enterprise. A user, such as a system administrator, interacts with the control nodes to logically define the hierarchical organization of distributed computing system. The control node includes an automation subsystem having one or more rule engines that provide autonomic control of the application nodes in accordance with a set of one or more rules.
摘要:
Systems and methods include receiving notification of an anomaly in a particular component within a data center environment. Such systems and methods include receiving configuration information identifying positions of tags disposed proximate to and representing respective infrastructure components within the data center environment. Such systems and methods include detecting a first tag representing a first infrastructure component and, in response thereto, determining, using the configuration information, a first positional relationship between the first tag and a second tag representing the particular component. Such systems and methods include determining a second positional relationship between the mobile device and the particular component based on the first positional relationship and providing, based on the second positional relationship, directional guidance from the first infrastructure component to the particular component. Such systems and methods include detecting, by the sensor, the second tag and, in response thereto, providing information identifying the particular component.
摘要:
In general, the techniques of this invention are directed to autonomic management of autonomic management systems. In particular, the embodiments of this invention use a measure, analyze, and respond model to autonomically manage one or more autonomic management systems. By understanding specific state information of these autonomic management systems, embodiments of the invention may achieve target performance for the autonomic management systems through operations monitoring, analyzing current system state against target state, and modifying the configurations or resources of the autonomic management systems.