摘要:
A system that detects a change point in a time series of telemetry signals from a computer system. During operation, the system receives the time series of telemetry signals from the computer system. For each element in the time series, the system (1) inserts the element into a data structure which keeps track of the number of elements in the data structure that have a value greater than and that have a value less than the value of the inserted element; and (2) uses the information stored in the data structure to add a contribution by the inserted element to a trend statistic for the time series. The system then uses the trend statistic to select a hypothesis for the trend in the time series.
摘要:
One embodiment of the present invention provides a system that efficiently conducts vibrational characterizations for a computer system having variable component configurations. During operation, the system receives a given component configuration associated with the computer system. Next, the system looks up the given component configuration in a resonant spectra library, which contains structural resonant frequencies for a number of possible component configurations for the computer system. If the given component configuration is found in the resonant spectra library, the system retrieves a set of structural resonant frequencies associated with the given component configuration. The system subsequently controls one or more vibration sources within the computer system to avoid the set of structural resonant frequencies.
摘要:
One embodiment of the present invention provides a system that facilitates detecting an anomaly in a signal, wherein the signal is sampled to produce a set of possible quantized signal values. During operation, the system constructs a “reference distribution” for an “occurrence frequency” of a specific quantized signal value from the set of possible quantized signal values. The system then obtains a “deviant distribution” associated with the reference distribution, wherein the deviant distribution has an offset from the reference distribution to indicate an anomaly in the signal. Next, in response to a new occurrence of the specific quantized signal value, the system updates a mean and a variance of the reference distribution for the specific quantized signal value. The system also adjusts the deviant distribution for the specific quantized signal value based on the updated mean and the updated variance of the reference distribution for the specific quantized signal value.
摘要:
One embodiment of the present invention provides a system that optimizes subset selection to facilitate parallel training of a support vector machine (SVM). During operation, the system receives a dataset comprised of data points. Next, the system evaluates the data points to produce a class separability measure, and uses the class separability measure to partition the data points in the dataset into N batches. The system then performs SVM training computations on the N batches in parallel to produce support vectors for each of the N batches. Finally, the system performs a final SVM training computation using an agglomeration of support vectors computed for each of the N batches to obtain a substantially optimal solution to the SVM training problem for the entire dataset.
摘要:
Embodiments of the present invention provide a system that dynamically controls a temperature profile within a computer system by generating computer system activity. The system starts by receiving a desired temperature profile. The system then generates a load profile based on the desired temperature profile, wherein the load profile specifies operations to be performed by the computer system. The system next executes the load profile on the computer system to generate computer system activity, wherein the computer system activity causes the desired temperature profile in the computer system.
摘要:
One embodiment of the present invention provides a system that dynamically adjusts data resolution during proactive-fault-monitoring in a computer system. During operation, the system temporarily stores high-resolution data for a telemetry signal from the computer system in a buffer. The system then generates low-resolution data for the telemetry signal from the high-resolution data. Next, the system monitors the low-resolution data, and while doing so, determines if an anomaly exists in the low-resolution data. If an anomaly exists in the low-resolution data, the system records the high-resolution data from the buffer on a storage device.
摘要:
One embodiment of the present invention provides a system that cools integrated circuit (IC) chips within a computer system. During operation, the system converts heat generated by a heat-generating-device within the computer system into thermoelectric power. The system then supplies the thermoelectric power to an IC chip as a cooling power to reduce the operating temperature of the IC chip, thereby recycling wasted energy within the computer system.
摘要:
One embodiment of the present invention provides a system that determines the complexity of a transaction between a first entity and a second entity. The system starts by determining a first probability of success for a portion of the transaction which is associated with actions of the first entity. The system then determines a second probability of success for a portion of the transaction which is associated with actions of the second entity. The system next calculates a transactional complexity for the transaction based on the first probability of success and the second probability of success.
摘要:
One embodiment of the present invention provides a system that estimates residual life of a software system under a software-based failure mechanism. During operation, the system first constructs a prognostic database for the software-based failure mechanism based on a plurality of software systems of the same type as the software system, wherein the prognostic database includes a set of prognostic readings associated with the software-based failure mechanism from the plurality of software systems. Note that a given prognostic reading in the prognostic database comprises: (1) a symptom index, which is a function of one or more variables associated with the software-based failure mechanism; and (2) a residual life, which is the remaining time to a failure under the software-based failure mechanism. Next, the system obtains a symptom index value from the software system which is being monitored. The system then estimates a residual life for the software system under the software-based failure mechanism by comparing the symptom index value with the prognostic readings in the prognostic database.
摘要:
One embodiment of the present invention provides a system that systematically monitors and records performance information in a computer system. During operation, the system periodically measures a number of performance parameters in the computer system. The system then stores data representing values for the measured performance parameters in a long-term storage space. Next, the system recurrently compresses data stored in the long-term storage space, thereby allowing additional data representing newly collected performance information to be stored in the long-term storage space.