System and method for online unsupervised event pattern extraction and holistic root cause analysis for distributed systems

    公开(公告)号:US10831585B2

    公开(公告)日:2020-11-10

    申请号:US15937362

    申请日:2018-03-27

    Applicant: Xiaohui Gu

    Inventor: Xiaohui Gu

    Abstract: An unsupervised pattern extraction system and method for extracting user interested patterns from various kinds of data such as system-level metric values, system call traces, and semi-structured or free form text log data and performing holistic root cause analysis for distributed systems. The distributed system includes a plurality of computer machines or smart devices. The system consists of both real time data collection and analytics functions. The analytics functions automatically extract event patterns and recognize recurrent events in real time by analyzing collected data streams from different sources. A root cause analysis component analyzes the extracted events and identifies both correlation and causality relationships among different components to pinpoint root cause of a networked-system anomaly. Furthermore, an anomaly impact prediction component estimates the impact scope of the detected anomaly and raises early alarms about impending service outages or application performance degradations based on the identified correlation and causality relationships.

    Peer-to-peer multi-party voice-over-IP services
    2.
    发明授权
    Peer-to-peer multi-party voice-over-IP services 有权
    点对点多方语音IP服务

    公开(公告)号:US07849138B2

    公开(公告)日:2010-12-07

    申请号:US12038386

    申请日:2008-02-27

    CPC classification number: H04L12/1818 H04L45/16 H04L45/48

    Abstract: A system and computer program product for establishing multi-party VoIP conference audio calls in a distributed, peer-to-peer network where any number of nodes are able to arbitrarily and asynchronously start or stop producing audio output to be mixed into a single composite audio stream that is distributed to all nodes. A single distribution tree is used that has optimal communications characteristics to distribute the composite audio signal to all nodes. An audio mixing tree is established and maintained by adaptively and dynamically adding and merging intermediate mixing nodes operating between user nodes and the root of the single distribution tree. The intermediate mixing nodes and the root of the single distribution tree are all hosted, in an exemplary embodiment, on user nodes that are endpoints of the distribution tree.

    Abstract translation: 一种用于在分布式对等网络中建立多方VoIP会议音频呼叫的系统和计算机程序产品,其中任何数量的节点能够任意地和异步地开始或停止产生要混合到单个复合音频中的音频输出 分发给所有节点的流。 使用具有最佳通信特性以将复合音频信号分配给所有节点的单个分发树。 通过自适应地动态地添加和合并在用户节点和单个分发树的根之间运行的中间混合节点来建立和维护音频混合树。 在示例性实施例中,分发树的中间混合节点和根分别在作为分发树的端点的用户节点上托管。

    System and Method for Scalable Processing of Multi-Way Data Stream Correlations
    3.
    发明申请
    System and Method for Scalable Processing of Multi-Way Data Stream Correlations 失效
    用于多路数据流相关性的可扩展处理的系统和方法

    公开(公告)号:US20090248749A1

    公开(公告)日:2009-10-01

    申请号:US12478627

    申请日:2009-06-04

    Abstract: A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing.

    Abstract translation: 一种用于处理多路流相关性的计算机实现的方法,装置和计算机可用程序代码。 接收流数据进行相关。 形成一个任务,用于将多路流相关工作负载连续划分成较小的工作负载。 每个较小的工作负载片段可以由单个主机处理。 流数据被发送到不同的主机进行相关处理。

    Method for providing load diffusion in data stream correlations
    4.
    发明授权
    Method for providing load diffusion in data stream correlations 失效
    在数据流相关中提供负载扩散的方法

    公开(公告)号:US07487206B2

    公开(公告)日:2009-02-03

    申请号:US11183149

    申请日:2005-07-15

    CPC classification number: G06F17/30985 Y10S707/99932

    Abstract: A computer implemented method for performing load diffusion to process data stream pairs. A data stream pair is received for correlation. The data stream pair is partitioned into portions to meet correlation constraints for correlating data in the data stream pair to form a partitioned data stream pair. The partitioned data stream pair is sent to a set of nodes for correlation processing to perform the load diffusion.

    Abstract translation: 一种用于执行负载扩散以处理数据流对的计算机实现的方法。 接收数据流对以进行相关。 将数据流对划分成部分以满足用于使数据流对中的数据相关的相关约束,以形成分区数据流对。 分区数据流对被发送到一组节点进行相关处理以执行负载扩散。

    System and Method for Machine Learning Driven Automated Incident Prevention for Distributed Systems

    公开(公告)号:US20240121254A1

    公开(公告)日:2024-04-11

    申请号:US17960204

    申请日:2022-10-05

    Applicant: Xiaohui Gu

    Inventor: Xiaohui Gu

    CPC classification number: H04L63/1425 G06N5/022 H04L63/1441

    Abstract: An unsupervised pattern extraction system and method for extracting incident and root cause patterns from various kinds of machine data such as system-level metric values, system call traces, and semi-structured or free form text log data and performing holistic root cause analysis for distributed systems. The system utilizing Natural Language Processing and machine learning techniques to extract incident and root cause information from received incident reports and other system data. The system consists of both real time data collection (104) and analytics functions (200). The previously reported incident data is used to discover and apply remediation techniques to utilize prior remediation efforts to automatically classify and correct incidents. The system may then annotate a remediation data file with the technique applied. The system will utilize prior known remediation techniques for identified categories to predict and prevent future issues.

    System and Method for Online Unsupervised Event Pattern Extraction and Holistic Root Cause Analysis for Distributed Systems

    公开(公告)号:US20190324831A1

    公开(公告)日:2019-10-24

    申请号:US15937362

    申请日:2018-03-27

    Applicant: Xiaohui Gu

    Inventor: Xiaohui Gu

    Abstract: An unsupervised pattern extraction system and method for extracting user interested patterns from various kinds of data such as system-level metric values, system call traces, and semi-structured or free form text log data and performing holistic root cause analysis for distributed systems. The distributed system includes a plurality of computer machines or smart devices. The system consists of both real time data collection and analytics functions. The analytics functions automatically extract event patterns and recognize recurrent events in real time by analyzing collected data streams from different sources. A root cause analysis component analyzes the extracted events and identifies both correlation and causality relationships among different components to pinpoint root cause of a networked-system anomaly. Furthermore, an anomaly impact prediction component estimates the impact scope of the detected anomaly and raises early alarms about impending service outages or application performance degradations based on the identified correlation and causality relationships.

    Systems and methods for optimal component composition in a stream processing system
    7.
    发明授权
    Systems and methods for optimal component composition in a stream processing system 有权
    流处理系统中最佳组件组成的系统和方法

    公开(公告)号:US08286153B2

    公开(公告)日:2012-10-09

    申请号:US12061284

    申请日:2008-04-02

    CPC classification number: H04L12/4641

    Abstract: A system and method are provided for optimizing component composition in a distributed stream-processing environment having a plurality of nodes capable of being associated with one or more of a plurality of stream processing components. The system includes an adaptive composition probing (ACP) module and a hierarchical state manager. The ACP module probes a subset of the plurality of stream processing components to determine the optimal component composition in response to a stream processing request. The hierarchical state manager manages local and global information for use by said ACP module in determining the optimal component composition.

    Abstract translation: 提供了一种用于在分布式流处理环境中优化组件组成的系统和方法,其具有能够与多个流处理组件中的一个或多个相关联的多个节点。 该系统包括自适应组合探测(ACP)模块和分级状态管理器。 ACP模块探测多个流处理组件的子集,以响应于流处理请求来确定最佳组件组成。 层级状态管理器管理本地和全局信息,供所述ACP模块在确定最佳组件组成时使用。

    Method and system for indexing and serializing data
    8.
    发明授权
    Method and system for indexing and serializing data 失效
    索引和序列化数据的方法和系统

    公开(公告)号:US07752192B2

    公开(公告)日:2010-07-06

    申请号:US11681486

    申请日:2007-03-02

    CPC classification number: G06F17/30911

    Abstract: The present invention provides a computer implemented method, an apparatus, and a computer usable program product for indexing data. A controller identifies a set of data to be indexed, wherein a set of data structure trees represents the set of data. The controller merges the set of data structure trees to form a unified tree, wherein the unified tree contains a node for each unit of data in the set of data. The controller assigns an identifier to the node for each unit of data in the set of data that describes the node within the unified tree. The controller then serializes the unified tree to form a set of sequential series that represents the set of data structure trees, wherein the set of sequential series forms an index for the set of data.

    Abstract translation: 本发明提供了一种用于索引数据的计算机实现的方法,装置和计算机可用程序产品。 控制器识别要索引的一组数据,其中一组数据结构树表示该组数据。 控制器将数据结构树组合成一个统一的树,其中统一树包含一组数据中每个数据单元的节点。 控制器为描述统一树中节点的数据集中的每个数据单元向节点分配一个标识符。 然后,控制器对统一树进行序列化以形成一组代表数据结构树的顺序序列,其中,该顺序序列集合形成该组数据的索引。

    Systems and methods for optimal component composition in a stream processing system
    9.
    发明授权
    Systems and methods for optimal component composition in a stream processing system 失效
    流处理系统中最佳组件组成的系统和方法

    公开(公告)号:US07562355B2

    公开(公告)日:2009-07-14

    申请号:US11068785

    申请日:2005-03-01

    CPC classification number: H04L12/4641

    Abstract: A system and method are provided for optimizing component composition in a distributed stream-processing environment having a plurality of nodes capable of being associated with one or more of a plurality of stream processing components. The system includes an adaptive composition probing (ACP) module and a hierarchical state manager. The ACP module probes a subset of the plurality of stream processing components to determine the optimal component composition in response to a stream processing request. The hierarchical state manager manages local and global information for use by said ACP module in determining the optimal component composition.

    Abstract translation: 提供了一种用于在分布式流处理环境中优化组件组成的系统和方法,其具有能够与多个流处理组件中的一个或多个相关联的多个节点。 该系统包括自适应组合探测(ACP)模块和分级状态管理器。 ACP模块探测多个流处理组件的子集,以响应于流处理请求来确定最佳组件组成。 层级状态管理器管理本地和全局信息,供所述ACP模块在确定最佳组件组成时使用。

    Model-based self-optimizing distributed information management
    10.
    发明授权
    Model-based self-optimizing distributed information management 有权
    基于模型的自优化分布式信息管理

    公开(公告)号:US07720841B2

    公开(公告)日:2010-05-18

    申请号:US11538525

    申请日:2006-10-04

    Abstract: Disclosed are a method, information processing system, and computer readable medium for managing data collection in a distributed processing system. The method includes dynamically collecting at least one statistical query pattern associated with a selected group of information processing nodes. The statistical query pattern is dynamically collected from a plurality of information processing nodes in a distributed processing system. At least one operating attribute distribution associated with an operating attribute that has been queried for the selected group is dynamically monitored. The selected group is dynamically configured, based on the query pattern and the operating attribute distribution, to periodically push a set of attributes associated with the each information processing node in the selected group.

    Abstract translation: 公开了一种用于管理分布式处理系统中的数据收集的方法,信息处理系统和计算机可读介质。 该方法包括动态地收集与所选择的一组信息处理节点相关联的至少一个统计查询模式。 统计查询模式是从分布式处理系统中的多个信息处理节点动态收集的。 动态地监视与被选择组查询的操作属性相关联的至少一个操作属性分布。 基于查询模式和操作属性分布动态地配置所选择的组,以周期性地推送与所选择的组中的每个信息处理节点相关联的一组属性。

Patent Agency Ranking