PREDICTING REMAINING USEFUL LIFE FOR A COMPUTER SYSTEM USING A STRESS-BASED PREDICTION TECHNIQUE
    1.
    发明申请
    PREDICTING REMAINING USEFUL LIFE FOR A COMPUTER SYSTEM USING A STRESS-BASED PREDICTION TECHNIQUE 有权
    使用基于应力的预测技术预测计算机系统的有用寿命

    公开(公告)号:US20110246093A1

    公开(公告)日:2011-10-06

    申请号:US12752767

    申请日:2010-04-01

    IPC分类号: G06F19/00 G06F15/00

    CPC分类号: G06F11/008

    摘要: One embodiment of the present invention provides a system for predicting a remaining useful life (RUL) for a component in a set of components within a computer system. The system starts by collecting values of at least one degradation-related parameter associated with the operation of a monitored component within the computer system. Note that the degradation-related parameter is a direct measurement of a degree of degradation of the monitored component. The system additionally collects values of at least one stress-based parameter from the computer system. Note that the stress-based parameter measures an accumulative stress in the operating environment of the set of components which can cause degradation of the set of components. The system then uses the values of the at least one degradation-related parameter and the values of the at least one stress-based parameter to predict an RUL for a component in the set of components.

    摘要翻译: 本发明的一个实施例提供一种用于预测计算机系统内的一组组件中的组件的剩余使用寿命(RUL)的系统。 该系统通过收集与计算机系统内的被监测组件的操作相关联的至少一个退化相关参数的值来开始。 请注意,降解相关参数是对被监测组分的降解程度的直接测量。 该系统还从计算机系统收集至少一个基于应力的参数的值。 注意,基于应力的参数测量组件组的操作环境中的累积应力,这可能导致该组件的劣化。 然后,系统使用至少一个退化相关参数的值和至少一个基于应力的参数的值来预测组件组中的组件的RUL。

    Enhancing throughput and fault-tolerance in a parallel-processing system
    2.
    发明授权
    Enhancing throughput and fault-tolerance in a parallel-processing system 有权
    提高并行处理系统的吞吐量和容错能力

    公开(公告)号:US07543180B2

    公开(公告)日:2009-06-02

    申请号:US11371998

    申请日:2006-03-08

    IPC分类号: G06F11/00

    摘要: One embodiment of the present invention provides a system that enhances throughput and fault-tolerance in a parallel-processing system. During operation, the system first receives a task. Next, the system partitions N computing nodes into M set-aside nodes and N-M primary computing nodes, wherein M≧1. The system then processes the task in parallel across the N-M primary computing nodes. While doing so, the system proactively monitors the health of each of the N-M primary computing nodes. If the system detects a node in the N-M primary computing nodes to be at risk of failure, the system copies the portion of the task associated with the at-risk node to a subset of the M set-aside nodes. The system then processes the portion of the task in parallel across the subset of the M set-aside nodes while the N-M primary computing nodes continue executing.

    摘要翻译: 本发明的一个实施例提供一种提高并行处理系统中的吞吐量和容错能力的系统。 在操作过程中,系统首先接收到一个任务。 接下来,系统将N个计算节点划分为M个置换节点和N-M个主要计算节点,其中M> = 1。 然后,系统在N-M主计算节点上并行处理任务。 在这样做的同时,系统主动监控每个N-M主计算节点的运行状况。 如果系统检测到N-M主计算节点中的节点处于故障风险,则系统将与风险中节点相关联的任务的一部分复制到M个备用节点的子集。 然后,在N-M主计算节点继续执行的同时,系统跨M个备用节点的子集并行地处理任务的该部分。

    Predicting remaining useful life for a computer system using a stress-based prediction technique
    3.
    发明授权
    Predicting remaining useful life for a computer system using a stress-based prediction technique 有权
    使用基于压力的预测技术预测计算机系统的剩余使用寿命

    公开(公告)号:US08340923B2

    公开(公告)日:2012-12-25

    申请号:US12752767

    申请日:2010-04-01

    IPC分类号: G06F19/00

    CPC分类号: G06F11/008

    摘要: One embodiment of the present invention provides a system for predicting a remaining useful life (RUL) for a component in a set of components within a computer system. The system starts by collecting values of at least one degradation-related parameter associated with the operation of a monitored component within the computer system. Note that the degradation-related parameter is a direct measurement of a degree of degradation of the monitored component. The system additionally collects values of at least one stress-based parameter from the computer system. Note that the stress-based parameter measures an accumulative stress in the operating environment of the set of components which can cause degradation of the set of components. The system then uses the values of the at least one degradation-related parameter and the values of the at least one stress-based parameter to predict an RUL for a component in the set of components.

    摘要翻译: 本发明的一个实施例提供了一种用于预测计算机系统内的一组组件中的组件的剩余使用寿命(RUL)的系统。 该系统通过收集与计算机系统内的被监测组件的操作相关联的至少一个退化相关参数的值来开始。 请注意,降解相关参数是对被监测组分的降解程度的直接测量。 该系统还从计算机系统收集至少一个基于应力的参数的值。 注意,基于应力的参数测量组件组的操作环境中的累积应力,这可能导致该组件的劣化。 然后,系统使用至少一个退化相关参数的值和至少一个基于应力的参数的值来预测组件组中的组件的RUL。

    Method and apparatus for generating a model for an electronic prognostics system
    4.
    发明授权
    Method and apparatus for generating a model for an electronic prognostics system 有权
    用于生成电子预测系统的模型的方法和装置

    公开(公告)号:US08706451B1

    公开(公告)日:2014-04-22

    申请号:US11640017

    申请日:2006-12-15

    IPC分类号: G06F17/10

    摘要: A system that generates a model for an electronic prognostics system which is used to monitor a computer system while the computer system is operating correctly. During operation, the system receives telemetry signals generated by the computer system. Next, the system builds the model based on the telemetry signals, wherein the model uses correlations between different telemetry signals to identify anomalies in the telemetry signals. The system then validates the model against telemetry signals from the computer system.

    摘要翻译: 一种生成电子预测系统模型的系统,该系统用于在计算机系统正常运行时监视计算机系统。 在操作期间,系统接收由计算机系统产生的遥测信号。 接下来,系统基于遥测信号建立模型,其中该模型使用不同遥测信号之间的相关性来识别遥测信号中的异常。 系统然后根据来自计算机系统的遥测信号验证该模型。

    Temperature-aware and energy-aware scheduling in a computer system
    5.
    发明授权
    Temperature-aware and energy-aware scheduling in a computer system 有权
    计算机系统中的温度感知和能量感知调度

    公开(公告)号:US08555283B2

    公开(公告)日:2013-10-08

    申请号:US11871369

    申请日:2007-10-12

    IPC分类号: G06F9/45 G06F1/00

    摘要: A computer system to schedule loads across a set of processor cores is described. During operation, the computer system receives a process to be executed. Next, the computer system obtains one or more thermodynamic process characteristics associated with the process and one or more thermodynamic processor-core characteristics associated with operation of the set of processor cores. Then, the computer system schedules the process to be executed by at least one of the processor cores based on the one or more thermodynamic process characteristics and the one or more thermodynamic processor-core characteristics.

    摘要翻译: 描述了一组用于在一组处理器核心上调度负载的计算机系统。 在操作期间,计算机系统接收要执行的进程。 接下来,计算机系统获得与该过程相关联的一个或多个热力学过程特征以及与该组处理器核心的操作相关联的一个或多个热力学处理器核心特性。 然后,计算机系统基于一个或多个热力学过程特性和一个或多个热力学处理器核心特征来调度由至少一个处理器核心执行的处理。

    Facilitating power supply unit management using telemetry data analysis
    6.
    发明授权
    Facilitating power supply unit management using telemetry data analysis 有权
    使用遥测数据分析促进供电单元管理

    公开(公告)号:US08253588B2

    公开(公告)日:2012-08-28

    申请号:US12611008

    申请日:2009-11-02

    IPC分类号: G08C19/16

    摘要: Some embodiments provide a system that analyzes telemetry data from a computer system. During operation, the system obtains the telemetry data as a set of telemetric signals from the computer system and validates the telemetric signals using a nonlinear, nonparametric regression technique. Next, the system assesses the integrity of a power supply unit (PSU) in the computer system by comparing the telemetric signals to one or more reference telemetric signals associated with the computer system. If the assessed integrity falls below a threshold, the system performs a remedial action for the computer system.

    摘要翻译: 一些实施例提供一种从计算机系统分析遥测数据的系统。 在操作过程中,系统将遥测数据作为一组来自计算机系统的遥测信号,并使用非线性非参数回归技术对遥测信号进行验证。 接下来,系统通过将遥测信号与与计算机系统相关联的一个或多个参考遥测信号进行比较来评估计算机系统中的电源单元(PSU)的完整性。 如果评估的完整性低于阈值,则系统对计算机系统执行补救措施。

    Noise reduction technique for monitoring electromagnetic signals
    7.
    发明授权
    Noise reduction technique for monitoring electromagnetic signals 有权
    用于监控电磁信号的降噪技术

    公开(公告)号:US08244493B2

    公开(公告)日:2012-08-14

    申请号:US12494709

    申请日:2009-06-30

    IPC分类号: G01R23/00

    摘要: One embodiment provides a system that analyzes a target electromagnetic signal radiating from a monitored system. During operation, the system monitors the target electromagnetic signal using a set of antennas to obtain a set of received target electromagnetic signals from the monitored system. Next, the system calculates a weighted mean of the received target electromagnetic signals using a first pattern-recognition model. The system then subtracts the received target electromagnetic signals from the weighted mean of the received target electromagnetic signals to obtain a set of noise-reduced signals for the monitored system. Finally, the system assesses the integrity of the monitored system by analyzing the noise-reduced signals using a second pattern-recognition model.

    摘要翻译: 一个实施例提供了一种系统,其分析从受监视系统辐射的目标电磁信号。 在操作期间,系统使用一组天线监视目标电磁信号,以从被监视的系统获得一组接收到的目标电磁信号。 接下来,系统使用第一模式识别模型计算接收到的目标电磁信号的加权平均值。 然后,系统从接收的目标电磁信号的加权平均值中减去接收到的目标电磁信号,以获得被监视系统的一组降噪信号。 最后,系统通过使用第二模式识别模型分析降噪信号来评估监控系统的完整性。

    Controlling the temperature of a computer system
    8.
    发明授权
    Controlling the temperature of a computer system 有权
    控制计算机系统的温度

    公开(公告)号:US08180501B2

    公开(公告)日:2012-05-15

    申请号:US12204433

    申请日:2008-09-04

    IPC分类号: G05D23/00

    CPC分类号: G05D23/1919

    摘要: Some embodiments of the present invention provide a system that controls the temperature of a computer system. First, a performance parameter of the computer system is monitored. Next, a future temperature of the computer system is predicted based on the performance parameter. Then, exhaust air from the computer system is mixed with ambient air from outside the computer system based on the predicted future temperature, and the mixed exhaust air and ambient air are channeled through the computer system to control the temperature of the computer system.

    摘要翻译: 本发明的一些实施例提供一种控制计算机系统的温度的系统。 首先,监视计算机系统的性能参数。 接下来,基于性能参数预测计算机系统的未来温度。 然后,根据预测的未来温度,将来自计算机系统的排气与计算机系统外部的环境空气混合,混合的排气和环境空气通过计算机系统引导以控制计算机系统的温度。

    Estimating relative humidity inside a computer system
    9.
    发明授权
    Estimating relative humidity inside a computer system 有权
    估计计算机系统内的相对湿度

    公开(公告)号:US08155765B2

    公开(公告)日:2012-04-10

    申请号:US12114363

    申请日:2008-05-02

    IPC分类号: G05B13/04 G06F11/00

    CPC分类号: G06F11/00 G06F11/3495

    摘要: One embodiment of the present invention provides a system that estimates the relative humidity inside a computer system. During operation, a set of performance parameters of the computer system and an external relative humidity outside of the computer system are monitored. Then, the relative humidity inside the computer system is estimated based on the set of performance parameters, the external relative humidity, and a relative humidity model, wherein training of the relative humidity model includes measuring an external training relative humidity outside of the computer system and a training relative humidity inside the computer system while monitoring the set of performance parameters of the computer system.

    摘要翻译: 本发明的一个实施例提供一种估计计算机系统内的相对湿度的系统。 在操作期间,监视计算机系统的一组性能参数和计算机系统外部的外部相对湿度。 然后,基于性能参数,外部相对湿度和相对湿度模型的集合来估计计算机系统内的相对湿度,其中相对湿度模型的训练包括测量计算机系统外部的外部训练相对湿度,以及 在计算机系统内训练相对湿度,同时监视计算机系统的一组性能参数。

    Characterizing a computer system using a pattern-recognition model
    10.
    发明授权
    Characterizing a computer system using a pattern-recognition model 有权
    使用模式识别模型来表征计算机系统

    公开(公告)号:US08150655B2

    公开(公告)日:2012-04-03

    申请号:US12240211

    申请日:2008-09-29

    IPC分类号: G06F11/30

    摘要: Some embodiments of the present invention provide a system that characterizes a computer system using a pattern-recognition model. First, values for an environmental parameter are monitored from a set of sensors associated with the computer system. Then, a baseline for the environmental parameter is calculated based on the monitored values from a subset of the set of sensors. Next, the baseline is subtracted from the monitored values from sensors in the set of sensors to produce compensated values. Then, the compensated values are used as inputs to the pattern-recognition model, which produces estimates for the compensated values based on correlations between the compensated values learned during a training phase. Next, residuals are calculated by subtracting the estimates for the compensated values from the compensated values. Then, the residuals are analyzed to characterize the computer system.

    摘要翻译: 本发明的一些实施例提供一种表征使用模式识别模型的计算机系统的系统。 首先,从与计算机系统相关联的一组传感器监测环境参数的值。 然后,基于来自传感器组的子集的监视值来计算环境参数的基线。 接下来,从传感器组中的传感器的监视值中减去基线,以产生补偿值。 然后,补偿值被用作模式识别模型的输入,其基于在训练阶段期间学习的补偿值之间的相关性而产生对补偿值的估计。 接下来,通过从补偿值中减去补偿值的估计值来计算残差。 然后,分析残差以表征计算机系统。