摘要:
A system that monitors electromagnetic interference (EMI) signals to facilitate proactive fault monitoring in a computer system is presented. During operation, the system receives EMI signals from one or more antennas located in close proximity to the computer system. The system then analyzes the received signals to proactively detect anomalies during operation of the computer system.
摘要:
A system that monitors electromagnetic interference (EMI) signals to facilitate proactive fault monitoring in a computer system is presented. During operation, the system receives EMI signals from one or more antennas located in close proximity to the computer system. The system then analyzes the received signals to proactively detect anomalies during operation of the computer system.
摘要:
Embodiments of a system that determines a condition associated with an integrated circuit disposed on a circuit board are described. During operation, the system receives electromagnetic-interference (EMI) signals from one or more antennas while the integrated circuit is operating, where the one or more antennas are disposed on the circuit board. Then, the system analyzes the received EMI signals to determine the condition.
摘要:
Some embodiments of the present invention provide a system that generates a utilization charge for a computer system. First, a set of performance parameters of the computer system are monitored. Next, a power utilization of the computer system is inferred based on the set of performance parameters and a power-utilization model. Then, a utilization charge is generated based on the power utilization of the computer system.
摘要:
A system that balances thermal variations within a set of computer systems in a datacenter. During operation, the system obtains a thermal flux map for the set of computer systems. The system then analyzes the thermal flux map to determine whether imbalances exist in the thermal flux across the set of computer systems. If so, the system can adjust: (1) the scheduling of loads across the set of computer systems, and/or (2) air conditioning within the datacenter, so that the thermal flux is more balanced across the set of computer systems.
摘要:
A system that generates a dynamic power-flux map for a set of computer systems. During operation the system determines the locations of the computer systems. Next, the system receives dynamic traces of power consumption for the computer systems, wherein a dynamic trace of power consumption for a given computer system is generated based on dynamic traces of monitored inferential variables for the given computer system. The system then correlates the locations of the computer systems with the dynamic traces of power consumption for the computer systems, and generates the dynamic power-flux map for the set of computer systems based on the correlated locations and the dynamic traces for the computer systems.
摘要:
A system that generates a dynamic power-flux map for a set of computer systems. During operation the system determines the locations of the computer systems. Next, the system receives dynamic traces of power consumption for the computer systems, wherein a dynamic trace of power consumption for a given computer system is generated based on dynamic traces of monitored inferential variables for the given computer system. The system then correlates the locations of the computer systems with the dynamic traces of power consumption for the computer systems, and generates the dynamic power-flux map for the set of computer systems based on the correlated locations and the dynamic traces for the computer systems.
摘要:
Some embodiments provide a system that analyzes telemetry data from a monitored system. During operation, the system obtains the telemetry data as a set of telemetric signals from the monitored system and groups the telemetry data into one or more clusters of correlated telemetric signals from the telemetric signals. Next, the system increases a bandwidth associated with monitoring the telemetric signals. To increase the bandwidth, the system omits one or more of the correlated telemetric signals from each of the clusters during sampling of the telemetric signals and estimates the omitted correlated telemetric signals by applying a nonlinear, nonparametric regression technique to the sampled telemetric signals.
摘要:
One embodiment provides a system that mitigates vibrations caused by cooling fans in a computer system. More specifically, the system includes a cooling fan mechanically coupled to the chassis of the computer system, wherein vibrations generated by the cooling fan are coupled to the chassis. The system also includes an actuation mechanism that creates a relative displacement between the cooling fan and the chassis when a control signal is applied to the actuation mechanism. The system additionally includes a detection mechanism which detects the relative displacement and generates a feedback signal which represents the relative displacement. The system further includes a control signal generation mechanism which converts the feedback signal into the control signal, which is subsequently applied to the actuation mechanism. When the control signal is applied to the actuation mechanism, the relative displacement between the cooling fan and the chassis vibrationally decouples the cooling fan from the chassis.
摘要:
Some embodiments provide a system that analyzes telemetry data from a monitored system. During operation, the system obtains the telemetry data as a set of telemetric signals from the monitored system and groups the telemetry data into one or more clusters of correlated telemetric signals from the telemetric signals. Next, the system increases a bandwidth associated with monitoring the telemetric signals. To increase the bandwidth, the system omits one or more of the correlated telemetric signals from each of the clusters during sampling of the telemetric signals and estimates the omitted correlated telemetric signals by applying a nonlinear, nonparametric regression technique to the sampled telemetric signals.