摘要:
A method of sampling sensor data from a computing system is presented. The computing system includes a plurality of components and a sensor network for monitoring the computing system. The sensor network includes primary sensor nodes operable to obtain primary parameter data from a measurement of a primary parameter of the components, and secondary sensor nodes operable to obtain secondary parameter data from a measurement of secondary parameters of the components. The method includes: a) obtaining secondary parameter data from secondary sensor nodes relating to components; b) processing, in a computing device, the secondary parameter data; c) determining, based upon determined or pre-determined relationships between the secondary parameters and the primary parameter, a sample rate for the primary parameter data for the components; and d) obtaining primary parameter data from the primary sensor nodes relating to components at the determined sample rate.
摘要:
A method of sampling sensor data from a computing system is presented. The computing system includes a plurality of components and a sensor network for monitoring the computing system. The sensor network includes primary sensor nodes operable to obtain primary parameter data from a measurement of a primary parameter of the components, and secondary sensor nodes operable to obtain secondary parameter data from a measurement of secondary parameters of the components. The method includes: a) obtaining secondary parameter data from secondary sensor nodes relating to components; b) processing, in a computing device, the secondary parameter data; c) determining, based upon determined or pre-determined relationships between the secondary parameters and the primary parameter, a sample rate for the primary parameter data for the components; and d) obtaining primary parameter data from the primary sensor nodes relating to components at the determined sample rate.
摘要:
A method of monitoring the performance of a device is presented, each storage device being operable to receive storage requests and includes a queue for incoming storage requests and a physical storage medium, the method includes: a) obtaining data relating to the device parameters of: throughput of storage requests to the storage device; queue time for one or more requests on the storage device; and the time required to complete one or more storage requests on the storage device; b) generating a model relating to the expected distribution of the device parameters on the storage device during normal operation; c) monitoring the parameters during operation of the storage device; d) comparing the distribution of the monitored parameters during operation of the storage device with the expected distribution of the parameters; and e) identifying a fault on the storage device based on the comparison.