Using data mining algorithms including association rules and tree classifications to discover data rules
    12.
    发明授权
    Using data mining algorithms including association rules and tree classifications to discover data rules 有权
    使用包括关联规则和树分类的数据挖掘算法来发现数据规则

    公开(公告)号:US07836004B2

    公开(公告)日:2010-11-16

    申请号:US11609307

    申请日:2006-12-11

    IPC分类号: G06N5/02

    CPC分类号: G06F17/30303

    摘要: Provided are a method, system, and article of manufacture for using a data mining algorithm to discover data rules. A data set including multiple records is processed to generate data rules for the data set. Each record has a record format including a plurality of fields and each rule provides a predicted condition for one field based on at least one predictor condition in at least one other field. The generated data rules are provided to a user interface to enable a user to edit the generated data rules. The data rules are stored in a rule repository to be available to use to validate data sets having the record format.

    摘要翻译: 提供了使用数据挖掘算法来发现数据规则的方法,系统和制品。 处理包括多个记录的数据集以生成数据集的数据规则。 每个记录具有包括多个字段的记录格式,并且每个规则基于至少一个其他字段中的至少一个预测变量条件提供一个字段的预测条件。 生成的数据规则被提供给用户界面,以使用户能够编辑所生成的数据规则。 数据规则存储在可用于验证具有记录格式的数据集的规则库中。

    Using a data mining algorithm to generate rules used to validate a selected region of a predicted column
    13.
    发明授权
    Using a data mining algorithm to generate rules used to validate a selected region of a predicted column 失效
    使用数据挖掘算法生成用于验证预测列的选定区域的规则

    公开(公告)号:US08171001B2

    公开(公告)日:2012-05-01

    申请号:US11769634

    申请日:2007-06-27

    IPC分类号: G06F17/00

    CPC分类号: G06N5/022 G06F17/30303

    摘要: Provided are an article of manufacture, system, and method for using a data mining algorithm to generate rules used to validate a selected region of a predicted column. A data set has a plurality of columns and records providing data for each of the columns. Selection is received of at least one predicted column for which rules are to be generated and at least one region of the selected at least one predicted column, wherein each region specifies data positions in the column. The data set is processed to determine association relationships among data in at least one predictor column and subsequences in the selected at least one region of the at least one predicted column. At least one rule is generated from the relationships specifying a condition involving at least one predictor column that predicts at least one value in the selected region of the at least one predicted column.

    摘要翻译: 提供了使用数据挖掘算法来生成用于验证预测列的选定区域的规则的制造,系统和方法。 数据集具有多个列和用于为每个列提供数据的记录。 接收到要为其生成规则的至少一个预测列和所选择的至少一个预测列的至少一个区域的选择,其中每个区域指定列中的数据位置。 处理数据集以确定至少一个预测列中的数据与所选择的至少一个预测列的所选择的至少一个区域中的子序列之间的关联关系。 从关系中产生至少一个规则,所述关系规定了涉及至少一个预测列的条件,所述预测器列预测所述至少一个预测列的所选区域中的至少一个值。

    SCHEMA MAPPING SPECIFICATION FRAMEWORK
    14.
    发明申请
    SCHEMA MAPPING SPECIFICATION FRAMEWORK 失效
    SCHEMA映射规范框架

    公开(公告)号:US20080256124A1

    公开(公告)日:2008-10-16

    申请号:US12143875

    申请日:2008-06-23

    IPC分类号: G06F17/30

    摘要: A method, system and program product for specifying, in a schema mapping framework, a mapping between a source schema and a target schema. The source and target schemas are schemas included in respective groups of registered, heterogeneous schemas. The source and target schemas may be of different types. Serialized versions of the source and target schemas include source objects and target objects, respectively. A mapping model is serialized into mapping objects that include logical references representing the source objects and logical references representing the target objects. The logical references are resolved to the source objects and target objects, thereby storing pointers to the source objects and to the target objects. After resolving the logical references, the mapping model includes the logical references and the pointers to the source and target objects.

    摘要翻译: 一种用于在模式映射框架中指定源模式和目标模式之间的映射的方法,系统和程序产品。 源和目标模式是包括在注册的,异构模式的相应组中的模式。 源和目标模式可能有不同的类型。 源和目标模式的序列化版本分别包括源对象和目标对象。 将映射模型序列化为映射对象,其中包含表示源对象的逻辑引用和表示目标对象的逻辑引用。 逻辑引用被解析为源对象和目标对象,从而存储指向源对象和目标对象的指针。 在解析逻辑引用之后,映射模型包括逻辑引用和指向源对象和目标对象的指针。

    Discovering transformations applied to a source table to generate a target table

    公开(公告)号:US09720971B2

    公开(公告)日:2017-08-01

    申请号:US12165549

    申请日:2008-06-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30507

    摘要: Provided are a method, system, and article of manufacture for discovering transformations applied to a source table to generate a target table. Selection is made of a source table comprising a plurality of rows and a target table resulting from a transformation applied to the rows of the source table. A first pre-processing method is applied with respect to columns in the source and target tables to produce first category pre-processing output. The first category pre-processing output is used to determine first category transformation rules with respect to at least one source table column and at least one target table column. For each unpredicted target column in the target table not predicted by the determined first category transformation rules, a second pre-processing method is applied to columns in the source table and unpredicted target columns to produce second category pre-processing output. The second category pre-processing output is used to determine second category transformation rules with respect to at least one source table column and at least one target table column.

    DISCOVERING TRANSFORMATIONS APPLIED TO A SOURCE TABLE TO GENERATE A TARGET TABLE
    16.
    发明申请
    DISCOVERING TRANSFORMATIONS APPLIED TO A SOURCE TABLE TO GENERATE A TARGET TABLE 有权
    发现适用于源表的变换以产生目标表

    公开(公告)号:US20090327208A1

    公开(公告)日:2009-12-31

    申请号:US12165549

    申请日:2008-06-30

    IPC分类号: G06N5/04

    CPC分类号: G06F17/30507

    摘要: Provided are a method, system, and article of manufacture for discovering transformations applied to a source table to generate a target table. Selection is made of a source table comprising a plurality of rows and a target table resulting from a transformation applied to the rows of the source table. A first pre-processing method is applied with respect to columns in the source and target tables to produce first category pre-processing output. The first category pre-processing output is used to determine first category transformation rules with respect to at least one source table column and at least one target table column. For each unpredicted target column in the target table not predicted by the determined first category transformation rules, a second pre-processing method is applied to columns in the source table and unpredicted target columns to produce second category pre-processing output. The second category pre-processing output is used to determine second category transformation rules with respect to at least one source table column and at least one target table column.

    摘要翻译: 提供了用于发现应用于源表以生成目标表的变换的方法,系统和制品。 选择包括多个行的源表和由应用于源表的行的变换产生的目标表。 对源表和目标表中的列应用第一预处理方法以产生第一类预处理输出。 第一类预处理输出用于确定关于至少一个源表列和至少一个目标表列的第一类转换规则。 对于目标表中未被确定的第一类别变换规则预测的每个未预测的目标列,将第二预处理方法应用于源表中的列和不可预测的目标列以产生第二类预处理输出。 第二类预处理输出用于确定关于至少一个源表列和至少一个目标表列的第二类转换规则。

    Schema mapping specification framework
    17.
    发明授权
    Schema mapping specification framework 失效
    模式映射规范框架

    公开(公告)号:US07519606B2

    公开(公告)日:2009-04-14

    申请号:US11343503

    申请日:2006-01-31

    IPC分类号: G06F7/00

    摘要: A method and system for specifying, in a schema mapping framework, a mapping between a source schema and a target schema. The source and target schemas are schemas included in respective groups of registered, heterogeneous schemas. The source and target schemas may be of different types. Serialized versions of the source and target schemas include source objects and target objects, respectively. A mapping model is serialized into mapping objects that include logical references representing the source objects and logical references representing the target objects. The logical references are resolved to the source objects and target objects, thereby storing pointers to the source objects and to the target objects. After resolving the logical references, the mapping model includes the logical references and the pointers to the source and target objects.

    摘要翻译: 在模式映射框架中指定源模式和目标模式之间的映射的方法和系统。 源和目标模式是包括在注册的,异构模式的相应组中的模式。 源和目标模式可能有不同的类型。 源和目标模式的序列化版本分别包括源对象和目标对象。 将映射模型序列化为映射对象,其中包含表示源对象的逻辑引用和表示目标对象的逻辑引用。 逻辑引用被解析为源对象和目标对象,从而存储指向源对象和目标对象的指针。 在解析逻辑引用之后,映射模型包括逻辑引用和指向源对象和目标对象的指针。

    Method, system, and program for invoking methods between computer languages
    18.
    发明授权
    Method, system, and program for invoking methods between computer languages 有权
    用于在计算机语言之间调用方法的方法,系统和程序

    公开(公告)号:US07437741B2

    公开(公告)日:2008-10-14

    申请号:US10795644

    申请日:2004-03-08

    IPC分类号: G06F13/00

    CPC分类号: G06F9/548 G06F9/4484

    摘要: Provided are a method, system, and program for invoking methods between computer languages. A plurality of subclasses of a class in a first computer language are instantiated and a plurality of subclasses of a class in a second computer language are instantiated, wherein methods in the subclasses in the second computer language implement functionality of methods in the subclasses in the first computer language. A call is received from an application in the first computer language to one method in the subclasses of the first computer language, wherein the call to the method in the subclass of the first computer language invokes a corresponding method in the subclasses of the second computer language that implements the functionality of the called method in the first computer language.

    摘要翻译: 提供一种用于在计算机语言之间调用方法的方法,系统和程序。 实例化第一计算机语言中的类的多个子类,并且实例化第二计算机语言中的类的多个子类,其中第二计算机语言中的子类中的方法实现第一计算机语言中的子类中的方法的功能 电脑语言 从第一计算机语言的应用程序接收到第一计算机语言的子类中的一种方法的调用,其中对第一计算机语言的子类中的方法的调用在第二计算机语言的子类中调用相应的方法 它以第一种计算机语言实现被叫方法的功能。

    Deriving a Data Model From a Hierarchy Of Related Terms, And
Deriving a Hierarchy Of Related Terms From a Data Model
    19.
    发明申请

    公开(公告)号:US20080091690A1

    公开(公告)日:2008-04-17

    申请号:US11549556

    申请日:2006-10-13

    IPC分类号: G06F7/00

    CPC分类号: G06F8/10

    摘要: Various embodiments of a method, system and computer program product generate a data model based on a glossary model. The glossary model comprises categories and terms. At least one category of the glossary model comprises at least one term of the terms. The categories have a hierarchical relationship. The categories are mapped to objects of a data model. The terms are mapped to attributes of the data model. The attributes are associated with the objects of the data model, wherein a particular attribute of the attributes is associated with a particular object of the objects that is mapped from a particular category of the categories that comprises a particular term of the terms from which the particular attribute is mapped. The objects are associated in a hierarchical relationship based on the hierarchical relationship of the categories. In other embodiments, a method, system and computer program product generate a glossary model based on a data model.

    摘要翻译: 方法,系统和计算机程序产品的各种实施例基于词汇表模型生成数据模型。 词汇表模型包括类别和术语。 术语表模型的至少一个类别包括术语的至少一个术语。 类别具有层次关系。 类别被映射到数据模型的对象。 这些术语被映射到数据模型的属性。 这些属性与数据模型的对象相关联,其中属性的特定属性与对象的特定对象相关联,所述对象的特定对象从包括特定术语的类别的特定类别映射, 属性被映射。 基于类别的层次关系,对象以层次关系关联。 在其他实施例中,方法,系统和计算机程序产品基于数据模型生成词汇表模型。