摘要:
To achieve consistent health service measures, a method and an apparatus combine various metrics from both internal sources and external sources that relate to the service under observation. This service health information is generated independently from specific provider applications and performance monitoring tool sets, thereby allowing shorter time-to-market for service management solutions. The output of the method may be in the form of a programmatic or scriptable interface to be used by high-level performance monitoring tools that are capable of reporting status of many disparate computer services. The performance monitoring tools may reside on different systems and architectures and may be supplied by different vendors. As a result, the programmatic or scriptable interfaces are designed to be generic and flexible.
摘要:
One embodiment includes a method for tracking usage of system components such that for each system component to be tracked the value of that component is measured on a successive time unit basis and the measured value is stored together with the number of successive time units that value repeats. Another embodiment includes a system for handling data representative of system conditions in which rapidly changing data values are received from at least one of the monitored sources such that each data value is representative of system conditions with respect to a small period of time; and such that the data is compressed while preserving the data values for each of the small periods of time.
摘要:
An apparatus and method to automatically activate a reserve resource when the load on a number of active resources (e.g., a number of CPUs or servers) meets a threshold. A resource usage policy specifying at least one threshold and having a corresponding task is compared to a monitored load on the active resource. When the monitored load meets the threshold as specified in the resource usage policy, the corresponding task is performed. For instance, the corresponding task can be to signal an event manager (e.g., a pager), to activate a reserve resource, etc. Once a reserve resource is activated, the load is balanced among the number of active resources and the activated reserve resource. When the load drops below the threshold, the processor can deactivate the reserve resource. In a preferred embodiment, the processor updates a configuration profile for tracking each of the resources. Another embodiment includes charging a fee for monitoring and activation.
摘要:
Apparatus, methods, and other embodiments associated with providing a correlation between a power distribution unit(s) and a device(s) are described. One example method includes storing first time series data that identifies, on a per power distribution unit (PDU) basis, current drawn from a set of PDUs. The example method may include storing second time series data that identifies, on a per device basis, power used by a set of related devices. With the two time series data available, the method may then provide a PDU-device correlation signal that identifies a correlation between current drawn from a PDU and power used by a device.
摘要:
Systems, methods, and other embodiments associated with event based correlation of power events are described. One example method includes storing a power distribution unit (PDU) event data that identifies an occurrence of a suspected power event associated with a device. The method can then provide a PDU-server correlation signal that identifies a connection between a PDU and a device.
摘要:
Apparatus, methods, and other embodiments associated with providing a correlation between a power distribution unit(s) and a device(s) are described. One example method includes storing first time series data that identifies, on a per power distribution unit (PDU) basis, current drawn from a set of PDUs. The example method may include storing second time series data that identifies, on a per device basis, power used by a set of related devices. With the two time series data available, the method may then provide a PDU-device correlation signal that identifies a correlation between current drawn from a PDU and power used by a device.
摘要:
Systems, methods, and other embodiments associated with event based correlation of power events are described. One example method includes storing a power distribution unit (PDU) event data that identifies an occurrence of a suspected power event associated with a device. The method can then provide a PDU-server correlation signal that identifies a connection between a PDU and a device.