DIAGNOSTIC SYSTEM AND METHOD
    1.
    发明申请
    DIAGNOSTIC SYSTEM AND METHOD 有权
    诊断系统及方法

    公开(公告)号:US20080195369A1

    公开(公告)日:2008-08-14

    申请号:US11674198

    申请日:2007-02-13

    IPC分类号: G06F9/455 G06F17/18 G06F15/00

    摘要: Disclosed are system and method embodiments for determining the root-causes of a performance objective violation, such as an end-to-end service level objection (SLO) violation, in a large-scale system with multi-tiered applications. This determination is made using a hybrid of component-level snapshots of the state of the system during a period in which an abnormal event occurred (i.e., black box mapping) and of known events and their causes (i.e., white-box mapping). Specifically, in response to a query about a violation (e.g., why did the response time for application a1 increase from r1 to r2), a processor will access and correlate the black-box and white-box mappings to determine a short-list of probable causes for the violation.

    摘要翻译: 公开了用于在具有多层应用的大规模系统中确定诸如端到端服务级别异议(SLO)违反的性能目标违规的根本原因的系统和方法实施例。 在异常事件发生的时段(即黑框映射)以及已知事件及其原因(即,白盒映射)中,使用系统状态的组件级快照的混合来进行该确定。 具体来说,响应于关于违规的查询(例如,为什么应用的响应时间1从r 1到r 2增加),处理器将访问并关联黑盒和白盒映射以确定短 - 违规行为的可能原因清单。

    Diagnostic system and method
    2.
    发明授权
    Diagnostic system and method 有权
    诊断系统和方法

    公开(公告)号:US08655623B2

    公开(公告)日:2014-02-18

    申请号:US11674198

    申请日:2007-02-13

    IPC分类号: G21C17/003

    摘要: Disclosed are system and method embodiments for determining the root-causes of a performance objective violation, such as an end-to-end service level objection (SLO) violation, in a large-scale system with multi-tiered applications. This determination is made using a hybrid of component-level snapshots of the state of the system during a period in which an abnormal event occurred (i.e., black box mapping) and of known events and their causes (i.e., white-box mapping). Specifically, in response to a query about a violation (e.g., why did the response time for application a1 increase from r1 to r2), a processor will access and correlate the black-box and white-box mappings to determine a short-list of probable causes for the violation.

    摘要翻译: 公开了用于在具有多层应用的大规模系统中确定诸如端到端服务级别异议(SLO)违反的性能目标违规的根本原因的系统和方法实施例。 在异常事件发生的时段(即黑框映射)以及已知事件及其原因(即,白盒映射)中,使用系统状态的组件级快照的混合来进行该确定。 具体来说,响应于关于违规的查询(例如,为什么应用程序a1的响应时间从r1增加到r2),处理器将访问并关联黑盒和白盒映射,以确定一个短列表 违规的可能原因。

    System and method for recording behavior history for abnormality detection
    3.
    发明授权
    System and method for recording behavior history for abnormality detection 失效
    用于记录异常检测行为历史的系统和方法

    公开(公告)号:US07711520B2

    公开(公告)日:2010-05-04

    申请号:US12116506

    申请日:2008-05-07

    IPC分类号: G06F11/30

    摘要: Disclosed is an autonomic abnormality detection device having a plurality of agents, a server with a one or more processors, a data storage device and a corrective actions engine. The device is adapted to detect and diagnose abnormalities in system components. Particularly, the device uses agents to track performance/workload measurements of system components and dynamically compiles a history of those performance/workload measurements for each component. In order to detect abnormalities a processor compares current performance/workload measurements for a component to the compiled histories for that component and for other components. The processor can further be adapted to determine possible causes of a detected abnormality and to report the abnormality, including the possible causes, to a corrective actions engine.

    摘要翻译: 公开了具有多个代理的自主异常检测装置,具有一个或多个处理器的服务器,数据存储装置和纠正动作引擎。 该设备适用于检测和诊断系统组件中的异常。 特别地,该设备使用代理跟踪系统组件的性能/工作负载测量,并动态编译每个组件的性能/工作负载测量的历史记录。 为了检测异常,处理器将组件的当前性能/工作负载测量结果与该组件和其他组件的已编译历史进行比较。 处理器还可以适于确定检测到的异常的可能原因,并将包括可能的原因的异常报告给校正动作引擎。

    Technique for mapping goal violations to anamolies within a system
    4.
    发明授权
    Technique for mapping goal violations to anamolies within a system 有权
    将目标违规行为映射到系统中的安全性的技术

    公开(公告)号:US07673189B2

    公开(公告)日:2010-03-02

    申请号:US11347999

    申请日:2006-02-06

    IPC分类号: G06F11/00

    摘要: A computer and method for problem detection and determination for automated system management in a system, wherein the method comprises monitoring system state, workload, and performance parameters of the system; comparing the monitored parameters against normal system performance behavior of the system, wherein the normal system performance behavior is maintained as a mapping of a system state and workload-to-performance parameters; summarizing performance abnormalities at a specified layer in the system as computation and data-processing attributes, wherein the performance abnormalities comprise deviations from the normal system performance behavior; correlating the performance abnormalities across multiple layers in the system using an attribute-based framework; and communicating a root-cause of the performance abnormalities.

    摘要翻译: 一种用于系统中自动化系统管理的问题检测和确定的计算机和方法,其中所述方法包括监视系统的系统状态,工作负载和性能参数; 将所监视的参数与所述系统的正常系统性能行为进行比较,其中所述正常系统性能行为被维持为系统状态和工作负荷与性能参数的映射; 总结系统中指定层的性能异常,作为计算和数据处理属性,其中性能异常包括与正常系统性能行为的偏差; 使用基于属性的框架将系统中多个层的性能异常相关联; 并传达性能异常的根本原因。

    SYSTEM AND METHOD FOR RECORDING BEHAVIOR HISTORY FOR ABNORMALITY DETECTION
    5.
    发明申请
    SYSTEM AND METHOD FOR RECORDING BEHAVIOR HISTORY FOR ABNORMALITY DETECTION 失效
    用于记录非正常检测行为历史的系统和方法

    公开(公告)号:US20080209027A1

    公开(公告)日:2008-08-28

    申请号:US12116506

    申请日:2008-05-07

    IPC分类号: G06F15/173

    摘要: Disclosed is an autonomic abnormality detection device having a plurality of agents, a server with a one or more processors, a data storage device and a corrective actions engine. The device is adapted to detect and diagnose abnormalities in system components. Particularly, the device uses agents to track performance/workload measurements of system components and dynamically compiles a history of those performance/workload measurements for each component. In order to detect abnormalities a processor compares current performance/workload measurements for a component to the compiled histories for that component and for other components. The processor can further be adapted to determine possible causes of a detected abnormality and to report the abnormality, including the possible causes, to a corrective actions engine.

    摘要翻译: 公开了具有多个代理的自主异常检测装置,具有一个或多个处理器的服务器,数据存储装置和纠正动作引擎。 该设备适用于检测和诊断系统组件中的异常。 特别地,该设备使用代理跟踪系统组件的性能/工作负载测量,并动态编译每个组件的性能/工作负载测量的历史记录。 为了检测异常,处理器将组件的当前性能/工作负载测量结果与该组件和其他组件的已编译历史进行比较。 处理器还可以适于确定检测到的异常的可能原因,并将包括可能的原因的异常报告给校正动作引擎。

    System and method for recording behavior history for abnormality detection
    6.
    发明授权
    System and method for recording behavior history for abnormality detection 失效
    用于记录异常检测行为历史的系统和方法

    公开(公告)号:US07395187B2

    公开(公告)日:2008-07-01

    申请号:US11348010

    申请日:2006-02-06

    IPC分类号: G06F11/30

    摘要: Disclosed is an autonomic abnormality detection device having a plurality of agents, a server with a one or more processors, a data storage device and a corrective actions engine. The device is adapted to detect and diagnose abnormalities in system components. Particularly, the device uses agents to track performance/workload measurements of system components and dynamically compiles a history of those performance/workload measurements for each component. In order to detect abnormalities a processor compares current performance/workload measurements for a component to the compiled histories for that component and for other components. The processor can further be adapted to determine possible causes of a detected abnormality and to report the abnormality, including the possible causes, to a corrective actions engine.

    摘要翻译: 公开了具有多个代理的自主异常检测装置,具有一个或多个处理器的服务器,数据存储装置和纠正动作引擎。 该设备适用于检测和诊断系统组件中的异常。 特别地,该设备使用代理跟踪系统组件的性能/工作负载测量,并动态编译每个组件的性能/工作负载测量的历史记录。 为了检测异常,处理器将组件的当前性能/工作负载测量结果与该组件和其他组件的已编译历史进行比较。 处理器还可以适于确定检测到的异常的可能原因,并将包括可能的原因的异常报告给校正动作引擎。

    Self-modulation in a model-based automated management framework
    7.
    发明授权
    Self-modulation in a model-based automated management framework 失效
    基于模型的自动化管理框架中的自调制

    公开(公告)号:US07444272B2

    公开(公告)日:2008-10-28

    申请号:US11250066

    申请日:2005-10-13

    IPC分类号: G06F17/10 G06E1/00

    CPC分类号: G05B17/02 G05B13/042

    摘要: Embodiments herein present a method, system, computer program product, etc. for automated management using a hybrid of prediction models and feedback-based systems. The method begins by calculating confidence values of models. Next, the method selects a first model based on the confidence values and processes the first model through a constraint solver to produce first workload throttling values. Following this, workloads are repeatedly processed through a feedback-based execution engine, wherein the feedback-based execution engine is controlled by the first workload throttling values. The first workload throttling values are applied incrementally to the feedback-based execution engine, during repetitions of the processing of the workloads, with a step-size that is proportional to the confidence values. The processing of the workloads is repeated until an objective function is maximized, wherein the objective function specifies performance goals of the workloads.

    摘要翻译: 本文的实施例提出了使用预测模型和基于反馈的系统的混合的自动化管理的方法,系统,计算机程序产品等。 该方法从计算模型的置信度开始。 接下来,该方法基于置信度值选择第一模型,并通过约束求解器处理第一模型以产生第一工作负载节流值。 此后,通过基于反馈的执行引擎重复处理工作负载,其中基于反馈的执行引擎由第一工作负载节流值控制。 在重复处理工作负载期间,第一个工作负载限制值将逐步应用于基于反馈的执行引擎,步长与置信度值成比例。 重复处理工作负载直到目标函数最大化,其中目标函数指定工作负载的性能目标。