摘要:
A system that monitors telemetry from a host computer system to detect degradation in a remote storage device. During operation, the system monitors performance parameters from a host computer system which accesses the remote storage device, wherein the performance parameters relate to the interactions between the host computer system and the remote storage device. The system then determines whether the monitored performance parameters have deviated from predicted values for the performance parameters. If so, the system generates a signal indicating that the remote storage device has degraded.
摘要:
A system that generates a model for an electronic prognostics system which is used to monitor a computer system while the computer system is operating correctly. During operation, the system receives telemetry signals generated by the computer system. Next, the system builds the model based on the telemetry signals, wherein the model uses correlations between different telemetry signals to identify anomalies in the telemetry signals. The system then validates the model against telemetry signals from the computer system.
摘要:
A computer system to schedule loads across a set of processor cores is described. During operation, the computer system receives a process to be executed. Next, the computer system obtains one or more thermodynamic process characteristics associated with the process and one or more thermodynamic processor-core characteristics associated with operation of the set of processor cores. Then, the computer system schedules the process to be executed by at least one of the processor cores based on the one or more thermodynamic process characteristics and the one or more thermodynamic processor-core characteristics.
摘要:
One embodiment provides a system that analyzes a target electromagnetic signal radiating from a monitored system. During operation, the system monitors the target electromagnetic signal using a set of antennas to obtain a set of received target electromagnetic signals from the monitored system. Next, the system calculates a weighted mean of the received target electromagnetic signals using a first pattern-recognition model. The system then subtracts the received target electromagnetic signals from the weighted mean of the received target electromagnetic signals to obtain a set of noise-reduced signals for the monitored system. Finally, the system assesses the integrity of the monitored system by analyzing the noise-reduced signals using a second pattern-recognition model.
摘要:
Some embodiments of the present invention provide a system that controls the temperature of a computer system. First, a performance parameter of the computer system is monitored. Next, a future temperature of the computer system is predicted based on the performance parameter. Then, exhaust air from the computer system is mixed with ambient air from outside the computer system based on the predicted future temperature, and the mixed exhaust air and ambient air are channeled through the computer system to control the temperature of the computer system.
摘要:
Some embodiments of the present invention provide a system that characterizes a computer system using a pattern-recognition model. First, values for an environmental parameter are monitored from a set of sensors associated with the computer system. Then, a baseline for the environmental parameter is calculated based on the monitored values from a subset of the set of sensors. Next, the baseline is subtracted from the monitored values from sensors in the set of sensors to produce compensated values. Then, the compensated values are used as inputs to the pattern-recognition model, which produces estimates for the compensated values based on correlations between the compensated values learned during a training phase. Next, residuals are calculated by subtracting the estimates for the compensated values from the compensated values. Then, the residuals are analyzed to characterize the computer system.
摘要:
One embodiment of the present invention provides a system that proactively monitors and detects metal whisker growth in a target area within a computer system. During operation, the system collects target electromagnetic interference (EMI) signals using one or more antennas positioned in the vicinity of the target area. Next, the system analyzes the target EMI signals to proactively detect the onset of metal whisker growth in the target area.
摘要:
Some embodiments of the present invention provide a system that controls temperature variations in a computer system. During operation, a telemetry variable of the computer system is monitored. Next, a future temperature of the computer system is predicted based on the telemetry variable. A signal is then generated in response to the future temperature. Then, the signal is sent to a cooling device in the computer system to control temperature variations of the computer system.
摘要:
One embodiment of the present invention provides a system for predicting a remaining useful life (RUL) for a component in a set of components within a computer system. The system starts by collecting values of at least one degradation-related parameter associated with the operation of a monitored component within the computer system. Note that the degradation-related parameter is a direct measurement of a degree of degradation of the monitored component. The system additionally collects values of at least one stress-based parameter from the computer system. Note that the stress-based parameter measures an accumulative stress in the operating environment of the set of components which can cause degradation of the set of components. The system then uses the values of the at least one degradation-related parameter and the values of the at least one stress-based parameter to predict an RUL for a component in the set of components.
摘要:
A system that mitigates quantization effects in quantized telemetry signals. During operation, the system monitors a set of quantized telemetry signals. For a given quantized telemetry signal in the set of quantized telemetry signals, the system uses a set of models to generate a set of estimates for the given quantized telemetry signal from the other monitored quantized telemetry signals, wherein each model in the set of models was initialized using a different randomly selected subset of a training dataset. The system then averages the set of estimates to produce an estimated signal for the given quantized telemetry signal.