摘要:
One embodiment of the present invention provides a system that estimates the relative humidity inside a computer system. During operation, a set of performance parameters of the computer system and an external relative humidity outside of the computer system are monitored. Then, the relative humidity inside the computer system is estimated based on the set of performance parameters, the external relative humidity, and a relative humidity model, wherein training of the relative humidity model includes measuring an external training relative humidity outside of the computer system and a training relative humidity inside the computer system while monitoring the set of performance parameters of the computer system.
摘要:
One embodiment of the present invention provides a system that estimates the relative humidity inside a computer system. During operation, a set of performance parameters of the computer system and an external relative humidity outside of the computer system are monitored. Then, the relative humidity inside the computer system is estimated based on the set of performance parameters, the external relative humidity, and a relative humidity model, wherein training of the relative humidity model includes measuring an external training relative humidity outside of the computer system and a training relative humidity inside the computer system while monitoring the set of performance parameters of the computer system.
摘要:
Some embodiments provide a system that analyzes telemetry data from a computer system. During operation, the system obtains the telemetry data as a set of telemetric signals from the computer system and validates the telemetric signals using a nonlinear, nonparametric regression technique. Next, the system assesses the integrity of a power supply unit (PSU) in the computer system by comparing the telemetric signals to one or more reference telemetric signals associated with the computer system. If the assessed integrity falls below a threshold, the system performs a remedial action for the computer system.
摘要:
Embodiments of a computer system that includes a vibration-cancelling mode, and a related method and computer-program product (e.g., software) for use with the computer system, are described. During operation, a processor monitors operations in the computer system, and may select either the vibration-cancelling mode or an inactive mode based on the monitored operations. For example, the processor may select the vibration-cancelling mode when there are input/output-(I/O) intensive workloads to an array of one or more hard disk drives (HDDs) in the computer system. In this way, the processor may reduce the energy consumption associated with vibration-induced retries to the HDDs (and reduced I/O throughput) without increasing the energy consumption associated with active vibration damping at other times, such as when the computer system is idle or during processor-intensive workloads.
摘要:
A system for generating a power consumption model of at least one server includes one or more computers configured to obtain n time series telemetry signals indicative of operating parameters of the at least one server, obtain a time series power signal indicative of power consumed by the at least one server, and correlate each of the n time series telemetry signals with the time series power signal. The one or more computers are further configured to select a set of the n time series telemetry signals having an overall correlation with the time series power signal greater than a predetermined threshold, and generate a power consumption model of the at least one server based on at least the set of the n time series telemetry signals.
摘要:
One embodiment provides a system that analyzes telemetry data from a computer system. During operation, the system periodically obtains the telemetry data from the computer system. Next, the system preprocesses the telemetry data using a sequential-analysis technique. If a statistical deviation is found in the telemetry data using the sequential-analysis technique, the system identifies a subset of the telemetry data associated with the statistical deviation and applies a root-cause-analysis technique to the subset of the telemetry data to determine a source of the statistical deviation. Finally, the system uses the source of the statistical deviation to perform a remedial action for the computer system, which involves correcting a fault in the computer system corresponding to the source of the statistical deviation.
摘要:
One embodiment of the present invention provides a system that mitigates the effects of multiple vibration sources on a set of hard disk drives (HDDs) within a computer system. During operation, the system identifies a target HDD in the set of HDDs, wherein the performance of the target HDD is affected by mechanical vibrations. The system also identifies one or more primary vibration sources from the multiple vibration sources that affect the performance of the target HDD. Next, for each of the primary vibration sources, the system measures a first time-domain signal associated with the operation of the primary vibration source using a first vibration transducer associated with the primary vibration source. The system also measures a second time-domain signal associated with the target HDD using a second vibration transducer associated with the target HDD. Next, for each of the primary vibration sources, the system then computes a cross-power-spectral-density (CPSD) between the first and the second time-domain signals. The system then selectively mitigates the primary vibration sources based on the CPSDs between the primary vibration sources and the target HDD.
摘要:
A system that determines whether components are not present in a computer system is presented. During operation the system receives telemetry signals from sensors within the computer system. Next, the system dynamically generates a temperature map for the computer system based on the telemetry signals. The system then analyzes the temperature map to determine whether components are not present in the computer system.
摘要:
One embodiment of the present invention provides a system that mitigates the effects of multiple vibration sources on a set of hard disk drives (HDDs) within a computer system. During operation, the system identifies a target HDD in the set of HDDs, wherein the performance of the target HDD is affected by mechanical vibrations. The system also identifies one or more primary vibration sources from the multiple vibration sources that affect the performance of the target HDD. Next, for each of the primary vibration sources, the system measures a first time-domain signal associated with the operation of the primary vibration source using a first vibration transducer associated with the primary vibration source. The system also measures a second time-domain signal associated with the target HDD using a second vibration transducer associated with the target HDD. Next, for each of the primary vibration sources, the system then computes a cross-power-spectral-density (CPSD) between the first and the second time-domain signals. The system then selectively mitigates the primary vibration sources based on the CPSDs between the primary vibration sources and the target HDD.
摘要:
One embodiment of the present invention provides a system that generates a synthetic workload to test power utilization in a computer system. During operation, the system monitors power utilization of a reference computer system while the reference computer system executes a workload-of interest, wherein the monitoring process produces a power profile. Next, the system determines characteristics of the workload-of-interest from the power profile. Finally, the system uses the determined characteristics to construct the synthetic workload, wherein the synthetic workload has similar power utilization to the workload-of-interest.