Method and system for detecting frequent association patterns
    1.
    发明授权
    Method and system for detecting frequent association patterns 失效
    检测频繁关联模式的方法和系统

    公开(公告)号:US06618725B1

    公开(公告)日:2003-09-09

    申请号:US09699661

    申请日:2000-10-30

    IPC分类号: G06F1730

    摘要: A text-mining system and method automatically extracts useful information from a large set of tree-structured data by generating successive sets of candidate tree-structured association patterns for comparison with the tree-structured data. The number of times is counted that each of the candidate association patterns matches with a tree in the set of tree-structured data in order to determine which of the candidate association patterns frequently matches with a tree in the data set. Each successive set of candidate association patterns is generated from the frequent association patterns determined from the previous set of candidate association patterns.

    摘要翻译: 文本挖掘系统和方法通过生成连续的候选树结构关联模式集合来与大量树结构数据自动提取有用信息,以便与树结构数据进行比较。 计算次数,使得每个候选关联模式与树结构数据集合中的树匹配,以便确定候选关联模式中的哪一个频繁地与数据集中的树匹配。 每个连续的一组候选关联模式是从从先前的一组候选关联模式确定的频繁关联模式生成的。

    Method for executing aggregate queries, and computer system
    2.
    发明授权
    Method for executing aggregate queries, and computer system 失效
    执行汇总查询的方法和计算机系统

    公开(公告)号:US06182061B2

    公开(公告)日:2001-01-30

    申请号:US09057520

    申请日:1998-04-09

    IPC分类号: G06F1730

    摘要: To provide a method for performing a plurality of aggregations in parallel and at a high speed, in a computer system so constructed that each of a plurality of processors connected across a network can use a memory area for itself and a part of the database for itself that includes data categorized into one or a plurality of groups, a method comprising the steps of: (a) ensuring space for storing results of M aggregate queries of the N aggregate queries (M is an integer equal to or less than N) in the memory area for itself in each processor; (b) executing all of the M aggregate queries for the part of the database for itself in each processor; (c) transmitting the results of the M aggregate queries executed by each processor to another processor for counting up and calculating of a final result for counting up; and (d) repeating the steps (a) to (c) until execution of the N aggregate queries is completed by each processor.

    摘要翻译: 为了提供并行和高速地执行多个聚合的方法,在计算机系统中,构造为跨越网络连接的多个处理器中的每一个可以将自身的存储区域和数据库的一部分本身使用 其包括分类为一个或多个组的数据,一种方法包括以下步骤:(a)确保存储N个聚合查询的M个聚合查询的结果的空间(M是等于或小于N的整数) 每个处理器本身的存储区域; (b)在每个处理器中执行数据库部分的所有M个聚合查询; (c)将由每个处理器执行的M个聚合查询的结果传送到另一个处理器,以计数和计算最后结果; 和(d)重复步骤(a)至(c),直到每个处理器完成N个聚合查询的执行。

    Drawing candidate line segments extraction system, drawing candidate line segments extraction method, solid model synthesis system, and solid model synthesis method
    3.
    发明授权
    Drawing candidate line segments extraction system, drawing candidate line segments extraction method, solid model synthesis system, and solid model synthesis method 失效
    绘制候选线段提取系统,绘制候选线段提取方法,实体模型合成系统和实体模型合成方法

    公开(公告)号:US06400363B1

    公开(公告)日:2002-06-04

    申请号:US09522861

    申请日:1998-05-26

    IPC分类号: G06T1500

    CPC分类号: G06T17/10

    摘要: The two-dimensional coordinates of a vertex are extracted in each of a top view and front view and, if their X-coordinates are equal to each other, the combination of their Y-coordinate values is determined to be the two-dimensional coordinates of a candidate vertex in a side view. Then, candidate line segments for the side view are extracted from the line segments connecting two candidate vertices, excepting not only those line segments for which no corresponding line segment exists in the top and front views, but a so those line segments for which corresponding horizontal or vertical line segments exist in the top and front views, and which a e not horizontal or vertical in the side view.

    摘要翻译: 在顶视图和前视图中的每一个中提取顶点的二维坐标,并且如果它们的X坐标彼此相等,则将它们的Y坐标值的组合确定为 侧视图中的候选顶点。 然后,从连接两个候选顶点的线段提取用于侧视图的候选线段,除了不仅在顶视图和前视图中不存在对应的线段的那些线段,而且对于相应的水平线 或垂直线段存在于顶视图和正视图中,以及侧视图中不是水平或垂直的。

    Method and system for obtaining a combination of faulty parts from a dispersed parts tree
    5.
    发明授权
    Method and system for obtaining a combination of faulty parts from a dispersed parts tree 有权
    从分散的部分树中获得有缺陷的部分的组合的方法和系统

    公开(公告)号:US07769706B2

    公开(公告)日:2010-08-03

    申请号:US12489244

    申请日:2009-06-22

    IPC分类号: G06F15/00 G06F15/18

    摘要: It is an object of the present invention to find out parts to be a highly possible cause of failure without searching all of part data of all of products. Dispersed parts data on a parts tree are sequentially accessed from a set of known failed products, and part attribute values each having a higher support in the faulty product are extracted. In this process, a subset of parts used in the faulty product is also obtained simultaneously. The part attribute values having higher supports and the subset of parts used in the faulty product are represented as a tree in which a parts type serves as a node. Next, an information gain of a rule that having the two part attribute values is a cause of failure is calculated on two part attribute values having higher supports on the tree of the parts type. This calculation is locally performed on a common parent part of two parts and parts having a certain information gain is outputted as a cause of failure. How to select these two part attributes is performed in such a way that part attributes located closer to each other on the tree are first evaluated, and first found part attributes are made a candidate of a cause of failure.

    摘要翻译: 本发明的一个目的是在不搜索所有产品的全部零件数据的情况下,发现零件是非常可能的故障原因。 从一组已知的故障产品中顺序访问零件树上的分散零件数据,并提取每个在故障产品中具有较高支持度的零件属性值。 在此过程中,也可以同时获得在故障产品中使用的部件的子集。 具有较高支持的部件属性值和在故障产品中使用的部件的子集被表示为其中部件类型用作节点的树。 接下来,对具有两部分属性值的规则的信息增益作为故障原因,对具有较高支持度的零件类型的树上的两部分属性值进行计算。 该计算在两部分的公共父部分执行,并且具有某一信息增益的部分作为故障的原因被输出。 首先评估如何选择这两个部分属性,使得首先评估在树上彼此更靠近的部分属性,并且首先发现零件属性成为故障原因的候选者。

    Method and system for obtaining a combination of faulty parts from a dispersed parts tree
    6.
    发明授权
    Method and system for obtaining a combination of faulty parts from a dispersed parts tree 失效
    从分散的部分树中获得有缺陷的部分的组合的方法和系统

    公开(公告)号:US07567948B2

    公开(公告)日:2009-07-28

    申请号:US11865199

    申请日:2007-10-01

    IPC分类号: G06F15/18 G06F15/00

    摘要: It is an object of the present invention to find out parts to be a highly possible cause of failure without searching all of part data of all of products.Dispersed parts data on a parts tree are sequentially accessed from a set of known failed products, and part attribute values each having a higher support in the faulty product are extracted. In this process, a subset of parts used in the faulty product is also obtained simultaneously. The part attribute values having higher supports and the subset of parts used in the faulty product are represented as a tree in which a parts type serves as a node. Next, an information gain of a rule that having the two part attribute values is a cause of failure is calculated on two part attribute values having higher supports on the tree of the parts type. This calculation is locally performed on a common parent part of two parts and parts having a certain information gain is outputted as a cause of failure. How to select these two part attributes is performed in such a way that part attributes located closer to each other on the tree are first evaluated, and first found part attributes are made a candidate of a cause of failure.

    摘要翻译: 本发明的一个目的是在不搜索所有产品的全部零件数据的情况下,发现零件是非常可能的故障原因。 从一组已知的故障产品中顺序访问零件树上的分散零件数据,并提取每个在故障产品中具有较高支持度的零件属性值。 在此过程中,也可以同时获得在故障产品中使用的部件的子集。 具有较高支持的部件属性值和在故障产品中使用的部件的子集被表示为其中部件类型用作节点的树。 接下来,对具有两部分属性值的规则的信息增益作为故障原因,对具有较高支持度的零件类型的树上的两部分属性值进行计算。 该计算在两部分的公共父部分执行,并且具有某一信息增益的部分作为故障的原因被输出。 首先评估如何选择这两个部分属性,使得首先评估在树上彼此更靠近的部分属性,并且首先找到零件属性成为故障原因的候选者。

    Candidate synonym support device for generating candidate synonyms that can handle abbreviations, mispellings, and the like
    7.
    发明授权
    Candidate synonym support device for generating candidate synonyms that can handle abbreviations, mispellings, and the like 失效
    候选同义词支持装置,用于生成可以处理缩写,误导等的候选同义词

    公开(公告)号:US07483829B2

    公开(公告)日:2009-01-27

    申请号:US10484333

    申请日:2002-07-19

    IPC分类号: G06F17/27 G06F17/20 G06F7/00

    摘要: A candidate synonym acquisition device acquires a set of candidate synonyms similar to an input word for each writer from data for each writer, and acquires a set of candidate synonyms similar to the input word from a collective data. A generated candidate synonym set is inputted to a candidate synonym determination device to evaluate the candidate synonyms of the collective data. In the evaluation, the status of “absolute” is given to a word matching a word ranked first in the candidate synonyms for each writer and the status of “negative” is given to words matching words ranked second and lower therein.

    摘要翻译: 候选同义词获取装置从每个写入器的数据中获取与每个写入器的输入字类似的一组候选同义词,并且从集合数据获取类似于输入字的一组候选同义词。 将生成的候选同义词集合输入到候选同义词确定装置,以评估集合数据的候选同义词。 在评价中,对于与每个作者的候选同义词首先排列的单词匹配的单词被赋予“绝对”的状态,并且将“否定”的状态赋予与第二和第二排列的词相匹配的单词。

    Graphics image generation and data analysis
    8.
    发明申请
    Graphics image generation and data analysis 审中-公开
    图形图像生成和数据分析

    公开(公告)号:US20070185904A1

    公开(公告)日:2007-08-09

    申请号:US10933657

    申请日:2004-09-02

    IPC分类号: G06F7/00

    CPC分类号: G06F16/9024

    摘要: Provides graphics display apparatus, systems and methods for effectively presenting information obtained by data mining, and to improve the visibility of the display of individual data elements and attributes of data included in a particular category while allowing an overview of whole large-scale hierarchical data to be provided. An example embodiment includes an aggregation unit for performing aggregation of attributes of nodes in the hierarchical data according to given aggregation criteria; a filtering unit for filtering the result of aggregation performed by the aggregation unit according to given filtering criteria to select nodes to be displayed from the hierarchical data; and a visualization unit for generating a graphics image that includes the nodes to be displayed selected by the filtering unit and reflects the hierarchical structure of the hierarchical data.

    摘要翻译: 提供用于有效地呈现通过数据挖掘获得的信息的图形显示装置,系统和方法,并且提高单个数据元素的显示和特定类别中包括的数据的属性的可视性,同时允许将整个大规模分层数据概述到 提供。 示例性实施例包括:聚合单元,用于根据给定的聚合标准执行分层数据中的节点的属性的聚合; 过滤单元,用于根据给定的过滤标准过滤由聚合单元执行的聚合结果,以从分层数据中选择要显示的节点; 以及可视化单元,用于生成包括由所述过滤单元选择的要显示的节点的图形图像,并且反映所述分层数据的分层结构。

    Change analysis
    9.
    发明授权
    Change analysis 有权
    变化分析

    公开(公告)号:US08417648B2

    公开(公告)日:2013-04-09

    申请号:US12372545

    申请日:2009-02-17

    IPC分类号: G06E1/00 G06F15/00

    CPC分类号: G06K9/623 G06N99/005

    摘要: Different virtual labels, for example, like +1 and −1, are assigned to two data sets. A change analysis problem for the two data sets is reduced to a supervised learning problem by using the virtual labels. Specifically, a classifier such as logical regression, decision tree and SVM is prepared and is trained by use of a data set obtained by merging the two data sets assigned the virtual labels. A feature selection function of the resultant classifier is used to rank and output both every attribute contributing to classification and its contribution rate.

    摘要翻译: 不同的虚拟标签,例如+1和-1,分配给两个数据集。 通过使用虚拟标签将两个数据集的变化分析问题简化为监督学习问题。 具体地说,准备了诸如逻辑回归,决策树和SVM的分类器,并且通过使用通过合并分配了虚拟标签的两个数据集获得的数据集进行训练。 使用得到的分类器的特征选择功能对贡献于分类的每个属性及其贡献率进行排序和输出。

    Apparatus, method and program for refreshing a summary table
    10.
    发明授权
    Apparatus, method and program for refreshing a summary table 失效
    用于刷新汇总表的装置,方法和程序

    公开(公告)号:US08271440B2

    公开(公告)日:2012-09-18

    申请号:US11876520

    申请日:2007-10-22

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30377

    摘要: An apparatus is provided with base table storage sections that store base tables and delta tables for the base tables, a summary table storage section that stores a summary table for storing results of queries to a plurality of base tables and delta information about the summary table, delta data processing sections that insert delta data of the base tables into the delta tables, and a delta computation processing section that generates delta information about the summary table. The delta computation processing section is provided with a generation section that generates delta information about a specified base table on the basis of an update that has been performed for the base table, in a situation where a subsequent update of the specified base table is permitted; and a control section that performs control so that, when a different base table is specified, delta information is generated in a different transaction.

    摘要翻译: 提供了一种设备,用于存储用于基表的基表和增量表的基表存储部分,汇总表存储部分,其将用于存储查询结果的汇总表存储到多个基表和关于汇总表的增量信息, 增量数据处理部分,其将基本表的增量数据插入增量表;以及增量计算处理部分,其生成关于汇总表的增量信息。 增量计算处理部分具有生成部,其在允许指定的基表的后续更新的情况下,基于对基表进行的更新,生成关于指定的基表的增量信息; 以及控制部,其执行控制,使得当指定不同的基表时,在不同的事务中生成增量信息。