摘要:
One embodiment of the present invention provides a technique for detecting anomalies during operation of a test computer system. Initially, a golden system and the test system are equipped with the same hardware configuration, wherein the golden system has gone through extensive qualification testing and is presumed to be operating correctly. Next, a deterministic load is executed on the golden system, and values for performance parameters from the golden system are monitored while the deterministic load is executing. Similarly, the deterministic load is also executed on the test system, and values for performance parameters from the test system are monitored while the deterministic load is executing. Next, pairwise differences are computed between values for performance parameters received from the test system and values for performance parameters received from the golden system. Finally, change detection techniques are applied to the pairwise differences to detect anomalies during operation of the test system.
摘要:
Embodiments of a computer system that includes a vibration-cancelling mode, and a related method and computer-program product (e.g., software) for use with the computer system, are described. During operation, a processor monitors operations in the computer system, and may select either the vibration-cancelling mode or an inactive mode based on the monitored operations. For example, the processor may select the vibration-cancelling mode when there are input/output-(I/O) intensive workloads to an array of one or more hard disk drives (HDDs) in the computer system. In this way, the processor may reduce the energy consumption associated with vibration-induced retries to the HDDs (and reduced I/O throughput) without increasing the energy consumption associated with active vibration damping at other times, such as when the computer system is idle or during processor-intensive workloads.
摘要:
Some embodiments of the present invention provide a system that controls temperature variations in a computer system. During operation, a telemetry variable of the computer system is monitored. Next, a future temperature of the computer system is predicted based on the telemetry variable. A signal is then generated in response to the future temperature. Then, the signal is sent to a cooling device in the computer system to control temperature variations of the computer system.
摘要:
A system that controls the temperature of a power supply in a computer system is presented. A state of the computer system is monitored. Next a signal is generated in response to the state of the computer system. The signal is then used to control the temperature of the power supply. Controlling the temperature of the power supply can involve heating the power supply.
摘要:
One embodiment of the present invention provides a system that mitigates the effects of multiple vibration sources on a set of hard disk drives (HDDs) within a computer system. During operation, the system identifies a target HDD in the set of HDDs, wherein the performance of the target HDD is affected by mechanical vibrations. The system also identifies one or more primary vibration sources from the multiple vibration sources that affect the performance of the target HDD. Next, for each of the primary vibration sources, the system measures a first time-domain signal associated with the operation of the primary vibration source using a first vibration transducer associated with the primary vibration source. The system also measures a second time-domain signal associated with the target HDD using a second vibration transducer associated with the target HDD. Next, for each of the primary vibration sources, the system then computes a cross-power-spectral-density (CPSD) between the first and the second time-domain signals. The system then selectively mitigates the primary vibration sources based on the CPSDs between the primary vibration sources and the target HDD.
摘要:
One embodiment of the present invention provides a system for predicting a remaining useful life (RUL) for a computer system. The system starts by collecting values for at least one telemetry metric from the computer system while the computer system is operating. The system then uses the collected values to generate a RUL prediction for the computer system or a component within the computer system.
摘要:
Embodiments of a system that determines a condition associated with an integrated circuit disposed on a circuit board are described. During operation, the system receives electromagnetic-interference (EMI) signals from one or more antennas while the integrated circuit is operating, where the one or more antennas are disposed on the circuit board. Then, the system analyzes the received EMI signals to determine the condition.
摘要:
A system that controls the temperature of a power supply in a computer system is presented. A state of the computer system is monitored. Next a signal is generated in response to the state of the computer system. The signal is then used to control the temperature of the power supply.
摘要:
One embodiment of the present invention provides a system that mitigates the effects of multiple vibration sources on a set of hard disk drives (HDDs) within a computer system. During operation, the system identifies a target HDD in the set of HDDs, wherein the performance of the target HDD is affected by mechanical vibrations. The system also identifies one or more primary vibration sources from the multiple vibration sources that affect the performance of the target HDD. Next, for each of the primary vibration sources, the system measures a first time-domain signal associated with the operation of the primary vibration source using a first vibration transducer associated with the primary vibration source. The system also measures a second time-domain signal associated with the target HDD using a second vibration transducer associated with the target HDD. Next, for each of the primary vibration sources, the system then computes a cross-power-spectral-density (CPSD) between the first and the second time-domain signals. The system then selectively mitigates the primary vibration sources based on the CPSDs between the primary vibration sources and the target HDD.
摘要:
One embodiment of the present invention provides a system that determines fan speeds for a set of fans in a computer system. During operation, the system receives time-series data collected by a vibration transducer associated with the computer system, wherein the vibration transducer is configured to record mechanical vibrations from the set of fans. Next, the system performs a spectral analysis on the time-series data to obtain frequency peaks associated with fan speeds for individual fans in the set of fans. The system then identifies fan speeds for the individual fans in the set of fans from the frequency peaks.