AUTOMATED SYSTEM PROBLEM DIAGNOSING
    1.
    发明申请
    AUTOMATED SYSTEM PROBLEM DIAGNOSING 有权
    自动化系统问题诊断

    公开(公告)号:US20110185233A1

    公开(公告)日:2011-07-28

    申请号:US12693373

    申请日:2010-01-25

    IPC分类号: G06F11/07 G06F17/30

    CPC分类号: G06F17/30675 G06F11/079

    摘要: Embodiments of the invention relate to automated system problem diagnosing. An index is created with problem description information of previously diagnosed problems, a diagnosis for each problem, and a solution to each diagnosis. System states, traces and logs are extracted from a source system with a new problem. The problem diagnosis system generates problem description information of the new problem from the system states, traces and logs. Problem description information of the new problem is compared with problem description information in the problem description index. A search score is computed for each document in the problem description index. The search score is a measure of similarity between each document in the index and the description of the new problem. A matching score is assigned to each previously diagnosed problems based on the search score. The matching score is a measure of similarity between the new problem and each previously diagnosed problem. The system determines a diagnosis and solution of the new problem based on a diagnosis and solution of one of the previously diagnosed problems.

    摘要翻译: 本发明的实施例涉及自动化系统问题诊断。 创建索引,其中包含先前诊断的问题的问题描述信息,每个问题的诊断以及每个诊断的解决方案。 系统状态,跟踪和日志是从具有新问题的源系统中提取出来的。 问题诊断系统从系统状态,跟踪和日志生成新问题的问题描述信息。 新问题的问题描述信息与问题描述索引中的问题描述信息进行比较。 对问题描述索引中的每个文档计算搜索分数。 搜索分数是索引中的每个文档与新问题的描述之间的相似度的量度。 根据搜索得分将匹配得分分配给每个先前诊断的问题。 匹配分数是新问题和每个先前诊断的问题之间的相似性的量度。 该系统基于以前诊断的问题之一的诊断和解决方案来确定新问题的诊断和解决方案。

    Structural data classification
    2.
    发明授权
    Structural data classification 有权
    结构数据分类

    公开(公告)号:US08121967B2

    公开(公告)日:2012-02-21

    申请号:US12141251

    申请日:2008-06-18

    IPC分类号: G06F17/00 G06N5/02

    CPC分类号: G06N99/005

    摘要: Techniques for classifying structural data with skewed distribution are disclosed. By way of example, a method classifying structural input data comprises a computer system performing the following steps. Multiple classifiers are constructed, wherein each classifier is constructed on a subset of training data, using one or more selected composite features from the subset of training data. A consensus among the multiple classifiers is computed in accordance with a voting scheme such that at least a portion of the structural input data is assigned to a particular class in accordance with the computed consensus. Such techniques for structured data classification are capable of handling skewed class distribution and partial feature coverage issues.

    摘要翻译: 公开了分布具有偏斜分布的结构数据的技术。 作为示例,分类结构输入数据的方法包括执行以下步骤的计算机系统。 构建多个分类器,其中使用来自训练数据的子集的一个或多个选定的复合特征,在训练数据的子集上构建每个分类器。 根据投票方案计算多个分类器之间的共识,使得至少一部分结构输入数据根据所计算的一致性被分配给特定类别。 这种用于结构化数据分类的技术能够处理倾斜的类分布和部分特征覆盖问题。

    System and method for efficiently performing similarity searches of structural data
    3.
    发明申请
    System and method for efficiently performing similarity searches of structural data 有权
    有效执行结构数据相似性检索的系统和方法

    公开(公告)号:US20060224562A1

    公开(公告)日:2006-10-05

    申请号:US11096165

    申请日:2005-03-31

    申请人: Xifeng Yan Philip Yu

    发明人: Xifeng Yan Philip Yu

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30536 G06F19/705

    摘要: Techniques for similarity searching are provided. In one aspect, a method of searching structural data in a database against one or more structural queries comprises the following steps. A desired minimum degree of similarity between the one or more queries and the structural data in the database is first specified. One or more indices are then used to exclude from consideration any structural data in the database that does not share the minimum degree of similarity with one or more of the queries.

    摘要翻译: 提供了相似搜索的技术。 在一个方面,一种在一个或多个结构性查询中搜索数据库中的结构数据的方法包括以下步骤。 首先指定一个或多个查询与数据库中的结构数据之间期望的最小相似程度。 然后使用一个或多个索引从考虑中排除不与一个或多个查询共享最小相似度的数据库中的任何结构数据。

    System and method for efficiently performing similarity searches of structural data
    4.
    发明授权
    System and method for efficiently performing similarity searches of structural data 有权
    有效执行结构数据相似性检索的系统和方法

    公开(公告)号:US09165042B2

    公开(公告)日:2015-10-20

    申请号:US11096165

    申请日:2005-03-31

    IPC分类号: G06F7/00 G06F17/30 G06F19/00

    CPC分类号: G06F17/30536 G06F19/705

    摘要: Techniques for similarity searching are provided. Structural data in a database is searched against one or more structural queries. A desired minimum degree of similarity between the one or more queries and the structural data in the database is first specified. One or more indices are then used to exclude from consideration any structural data in the database that does not share the minimum degree of similarity with one or more of the queries.

    摘要翻译: 提供了相似搜索的技术。 针对一个或多个结构查询搜索数据库中的结构数据。 首先指定一个或多个查询与数据库中的结构数据之间期望的最小相似程度。 然后使用一个或多个索引从考虑中排除不与一个或多个查询共享最小相似度的数据库中的任何结构数据。

    Automated system problem diagnosing
    5.
    发明授权
    Automated system problem diagnosing 有权
    自动化系统故障诊断

    公开(公告)号:US08112667B2

    公开(公告)日:2012-02-07

    申请号:US12693373

    申请日:2010-01-25

    IPC分类号: G06F11/00

    CPC分类号: G06F17/30675 G06F11/079

    摘要: Embodiments of the invention relate to automated system problem diagnosing. An index is created with problem description information of previously diagnosed problems, a diagnosis for each problem, and a solution to each diagnosis. System states, traces and logs are extracted from a source system with a new problem. The problem diagnosis system generates problem description information of the new problem from the system states, traces and logs. Problem description information of the new problem is compared with problem description information in the problem description index. A search score is computed for each document in the problem description index. The search score is a measure of similarity between each document in the index and the description of the new problem. A matching score is assigned to each previously diagnosed problems based on the search score. The matching score is a measure of similarity between the new problem and each previously diagnosed problem. The system determines a diagnosis and solution of the new problem based on a diagnosis and solution of one of the previously diagnosed problems.

    摘要翻译: 本发明的实施例涉及自动化系统问题诊断。 创建索引,其中包含先前诊断的问题的问题描述信息,每个问题的诊断以及每个诊断的解决方案。 系统状态,跟踪和日志是从具有新问题的源系统中提取出来的。 问题诊断系统从系统状态,跟踪和日志生成新问题的问题描述信息。 新问题的问题描述信息与问题描述索引中的问题描述信息进行比较。 对问题描述索引中的每个文档计算搜索分数。 搜索分数是索引中的每个文档与新问题的描述之间的相似度的量度。 根据搜索得分将匹配得分分配给每个先前诊断的问题。 匹配分数是新问题和每个先前诊断的问题之间的相似性的量度。 该系统基于以前诊断的问题之一的诊断和解决方案来确定新问题的诊断和解决方案。

    METHOD AND APPARATUS FOR STRUCTURAL DATA CLASSIFICATION
    6.
    发明申请
    METHOD AND APPARATUS FOR STRUCTURAL DATA CLASSIFICATION 有权
    用于结构数据分类的方法和装置

    公开(公告)号:US20090319457A1

    公开(公告)日:2009-12-24

    申请号:US12141251

    申请日:2008-06-18

    IPC分类号: G06N5/02

    CPC分类号: G06N99/005

    摘要: Techniques for classifying structural data with skewed distribution are disclosed. By way of example, a method classifying structural input data comprises a computer system performing the following steps. Multiple classifiers are constructed, wherein each classifier is constructed on a subset of training data, using one or more selected composite features from the subset of training data. A consensus among the multiple classifiers is computed in accordance with a voting scheme such that at least a portion of the structural input data is assigned to a particular class in accordance with the computed consensus. Such techniques for structured data classification are capable of handling skewed class distribution and partial feature coverage issues.

    摘要翻译: 公开了分布具有偏斜分布的结构数据的技术。 作为示例,分类结构输入数据的方法包括执行以下步骤的计算机系统。 构建多个分类器,其中使用来自训练数据的子集的一个或多个选定的复合特征,在训练数据的子集上构建每个分类器。 根据投票方案计算多个分类器之间的共识,使得至少一部分结构输入数据根据所计算的一致性被分配给特定类别。 这种用于结构化数据分类的技术能够处理倾斜的类分布和部分特征覆盖问题。

    System for entity search and a method for entity scoring in a linked document database
    7.
    发明授权
    System for entity search and a method for entity scoring in a linked document database 有权
    用于实体搜索的系统和在链接的文档数据库中实体评分的方法

    公开(公告)号:US08117208B2

    公开(公告)日:2012-02-14

    申请号:US12233812

    申请日:2008-09-19

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30867

    摘要: A system has a processor coupled to access a document database that indexes keywords and instances of entities having entity types in a plurality of documents. The processor is programmed to receive an input query including one or more keywords and one or more entity types, and search the database for documents having the keywords and entities with the entity types of the input query. The processor is programmed for aggregating a respective score for each of a plurality of entity tuples across the plurality of documents. The aggregated scores are normalized. Each respective normalized score provides a ranking of a respective entity tuple, relative to other entity tuples, as an answer to the input query. The processor has an interface to a storage or display device or network for outputting a list including a subset of the entity tuples having the highest normalized scores among the plurality of entity tuples.

    摘要翻译: 系统具有处理器,其耦合到访问文档数据库,所述文档数据库在多个文档中对具有实体类型的关键字和实体的实例进行索引。 处理器被编程为接收包括一个或多个关键字和一个或多个实体类型的输入查询,并且搜索数据库中具有关键词和具有输入查询的实体类型的实体的文档。 处理器被编程用于聚集跨多个文档的多个实体元组中的每一个的相应分数。 综合得分归一化。 每个相应的归一化分数提供相对于其他实体元组的相应实体元组作为输入查询的答案的排序。 处理器具有到存储或显示设备或网络的接口,用于输出包括多个实体元组中具有最高归一化分数的实体元组的子集的列表。

    System and method for graph indexing
    8.
    发明授权
    System and method for graph indexing 失效
    图索引的系统和方法

    公开(公告)号:US07974978B2

    公开(公告)日:2011-07-05

    申请号:US10835729

    申请日:2004-04-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30247 G06Q30/0201

    摘要: Techniques for graph indexing are provided. In one aspect, a method for indexing graphs in a database, the graphs comprising graphic data, comprises the following steps. Frequent subgraphs among one or more of the graphs in the database are identified, the frequent subgraphs appearing in at least a threshold number of the graphs in the database. One or more of the frequent subgraphs are used to create an index of the graphs in the database.

    摘要翻译: 提供图索引技术。 一方面,一种用于索引数据库中的图形的方法,包括图形数据的图形包括以下步骤。 识别数据库中一个或多个图形中的频繁子图,频繁的子图出现在数据库中至少阈值数量的图形中。 一个或多个频繁子图用于创建数据库中图形的索引。

    SYSTEM FOR ENTITY SEARCH AND A METHOD FOR ENTITY SCORING IN A LINKED DOCUMENT DATABASE
    9.
    发明申请
    SYSTEM FOR ENTITY SEARCH AND A METHOD FOR ENTITY SCORING IN A LINKED DOCUMENT DATABASE 有权
    用于实体搜索的系统和用于在链接的文档数据库中进行实体分类的方法

    公开(公告)号:US20090083262A1

    公开(公告)日:2009-03-26

    申请号:US12233812

    申请日:2008-09-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: A system has a processor coupled to access a document database that indexes keywords and instances of entities having entity types in a plurality of documents. The processor is programmed to receive an input query including one or more keywords and one or more entity types, and search the database for documents having the keywords and entities with the entity types of the input query. The processor is programmed for aggregating a respective score for each of a plurality of entity tuples across the plurality of documents. The aggregated scores are normalized. Each respective normalized score provides a ranking of a respective entity tuple, relative to other entity tuples, as an answer to the input query. The processor has an interface to a storage or display device or network for outputting a list including a subset of the entity tuples having the highest normalized scores among the plurality of entity tuples.

    摘要翻译: 系统具有处理器,其耦合到访问文档数据库,所述文档数据库在多个文档中对具有实体类型的关键字和实体的实例进行索引。 处理器被编程为接收包括一个或多个关键字和一个或多个实体类型的输入查询,并且搜索数据库中具有关键词和具有输入查询的实体类型的实体的文档。 处理器被编程用于聚集跨多个文档的多个实体元组中的每一个的相应分数。 综合得分归一化。 每个相应的归一化分数提供相对于其他实体元组的相应实体元组作为输入查询的答案的排序。 处理器具有到存储或显示设备或网络的接口,用于输出包括多个实体元组中具有最高归一化分数的实体元组的子集的列表。

    System and method for graph indexing
    10.
    发明申请
    System and method for graph indexing 失效
    图索引的系统和方法

    公开(公告)号:US20060036564A1

    公开(公告)日:2006-02-16

    申请号:US10835729

    申请日:2004-04-30

    申请人: Xifeng Yan Philip Yu

    发明人: Xifeng Yan Philip Yu

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30247 G06Q30/0201

    摘要: Techniques for graph indexing are provided. In one aspect, a method for indexing graphs in a database, the graphs comprising graphic data, comprises the following steps. Frequent subgraphs among one or more of the graphs in the database are identified, the frequent subgraphs appearing in at least a threshold number of the graphs in the database. One or more of the frequent subgraphs are used to create an index of the graphs in the database.

    摘要翻译: 提供图索引技术。 一方面,一种用于索引数据库中的图形的方法,包括图形数据的图形包括以下步骤。 识别数据库中一个或多个图形中的频繁子图,频繁的子图出现在数据库中至少阈值数量的图形中。 一个或多个频繁子图用于创建数据库中图形的索引。