Using a data mining algorithm to generate format rules used to validate data sets
    1.
    发明授权
    Using a data mining algorithm to generate format rules used to validate data sets 失效
    使用数据挖掘算法生成用于验证数据集的格式规则

    公开(公告)号:US08166000B2

    公开(公告)日:2012-04-24

    申请号:US11769639

    申请日:2007-06-27

    IPC分类号: G06F17/00

    CPC分类号: G06N5/025 G06F17/30303

    摘要: Provided are a method, system, and article of manufacture for using a data mining algorithm to generate format rules used to validate data sets. A data set has a plurality of columns and records providing data for each of the columns. Selection is received of at least one format column for which format rules are to be generated and selection is received of at least one predictor column. A format mask column is generated for each selected format column. For records in the data set, a value in the at least one format column is converted to a format mask representing a format of the value in the format column and storing the format mask in the format mask column in the record for which the format mask was generated. The at least one predictor column and the at least one format mask column are processed to generate at least one format rule. Each format rule specifies a format mask associated with at least one condition in the at least one predictor column.

    摘要翻译: 提供了使用数据挖掘算法来生成用于验证数据集的格式规则的方法,系统和制品。 数据集具有多个列和用于为每个列提供数据的记录。 接收至少一个要生成格式规则的格式列的选择,并且接收至少一个预测变量列的选择。 为每个选定的格式列生成格式掩码列。 对于数据集中的记录,将至少一个格式列中的值转换为表示格式列中值的格式的格式掩码,并将格式掩码存储在格式掩码的记录中的格式掩码列中 被生成。 至少一个预测器列和至少一个格式掩码列被处理以产生至少一个格式规则。 每个格式规则指定与至少一个预测器列中的至少一个条件相关联的格式掩码。

    System and method for virtualization of relational stored procedures in non-native relational database systems
    2.
    发明申请
    System and method for virtualization of relational stored procedures in non-native relational database systems 有权
    非本地关系数据库系统中关系存储过程虚拟化的系统和方法

    公开(公告)号:US20080016080A1

    公开(公告)日:2008-01-17

    申请号:US11484971

    申请日:2006-07-12

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30415

    摘要: A system, method, and program product are provided that identifies a remote stored procedure stored in a remote database management system, and automatically generates a local stored procedure stored in a local database management system. To automatically generate the local stored procedure, local and remote metadata are gathered corresponding, respectively, to a local database management system and a remote database management system. The remote metadata is used to create a call statement to the remote stored procedure. The created invocation method maps input values, input to the local stored procedure, to input parameters of the remote stored procedure. Results in the local stored procedure are set by mapping the data returned from the remote stored procedure to the corresponding return values (e.g., parameters and result sets) in the local stored procedure values.

    摘要翻译: 提供了识别存储在远程数据库管理系统中的远程存储过程的系统,方法和程序产品,并且自动生成存储在本地数据库管理系统中的本地存储过程。 为了自动生成本地存储过程,将本地和远程元数据分别收集到本地数据库管理系统和远程数据库管理系统中。 远程元数据用于创建到远程存储过程的调用语句。 创建的调用方法将输入值输入到本地存储过程,以输入远程存储过程的参数。 通过将从远程存储过程返回的数据映射到本地存储过程值中的相应返回值(例如,参数和结果集)来设置本地存储过程中的结果。

    System and method for virtualization of relational stored procedures in non-native relational database systems
    4.
    发明授权
    System and method for virtualization of relational stored procedures in non-native relational database systems 有权
    非本地关系数据库系统中关系存储过程虚拟化的系统和方法

    公开(公告)号:US07739296B2

    公开(公告)日:2010-06-15

    申请号:US11484971

    申请日:2006-07-12

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30415

    摘要: A system, method, and program product are provided that identifies a remote stored procedure stored in a remote database management system, and automatically generates a local stored procedure stored in a local database management system. To automatically generate the local stored procedure, local and remote metadata are gathered corresponding, respectively, to a local database management system and a remote database management system. The remote metadata is used to create a call statement to the remote stored procedure. The created invocation method maps input values, input to the local stored procedure, to input parameters of the remote stored procedure. Results in the local stored procedure are set by mapping the data returned from the remote stored procedure to the corresponding return values (e.g., parameters and result sets) in the local stored procedure values.

    摘要翻译: 提供了识别存储在远程数据库管理系统中的远程存储过程的系统,方法和程序产品,并且自动生成存储在本地数据库管理系统中的本地存储过程。 为了自动生成本地存储过程,将本地和远程元数据分别收集到本地数据库管理系统和远程数据库管理系统中。 远程元数据用于创建到远程存储过程的调用语句。 创建的调用方法将输入值输入到本地存储过程,以输入远程存储过程的参数。 通过将从远程存储过程返回的数据映射到本地存储过程值中的相应返回值(例如,参数和结果集)来设置本地存储过程中的结果。

    Managing validation models and rules to apply to data sets
    6.
    发明授权
    Managing validation models and rules to apply to data sets 有权
    管理验证模型和规则以应用于数据集

    公开(公告)号:US08401987B2

    公开(公告)日:2013-03-19

    申请号:US11779251

    申请日:2007-07-17

    IPC分类号: G06N5/00

    摘要: Provided are a method, system, and article of manufacture for managing validation models and rules to apply to data sets. A schema definition describing a structure of at least one column in a first data set having a plurality of columns and records providing data for each of the columns is received. At least one model is generated, wherein each model asserts conditions for at least one column in a record of the first data set. The schema definition and the at least one model are stored in a data quality model. Selection is received of a second data set and the data quality model. A determination is made as to whether a structure of the second data set is compatible with the schema definition in the selected data quality model. Each model in the data quality model is applied to the records in the second data set to validate the records in the second data set in response to determining that the structure of the second data set and the schema definition are compatible.

    摘要翻译: 提供了一种用于管理验证模型和规则以应用于数据集的方法,系统和制品。 接收描述具有多个列的第一数据集中的至少一列的结构的模式定义和为每个列提供数据的记录。 生成至少一个模型,其中每个模型为第一数据集的记录中的至少一列确定条件。 模式定义和至少一个模型存储在数据质量模型中。 接收到第二数据集和数据质量模型的选择。 确定第二数据集的结构是否与所选数据质量模型中的模式定义兼容。 数据质量模型中的每个模型被应用于第二数据集中的记录,以响应于确定第二数据集的结构和模式定义是兼容的来验证第二数据集中的记录。

    Data navigation system and method employing data transformation lineage model
    7.
    发明授权
    Data navigation system and method employing data transformation lineage model 失效
    数据导航系统和采用数据转换谱系模型的方法

    公开(公告)号:US07725433B1

    公开(公告)日:2010-05-25

    申请号:US09221542

    申请日:1998-12-28

    IPC分类号: G06G7/00 G06G17/00

    CPC分类号: G06F17/30297

    摘要: A method, apparatus, and article of manufacture for a transformation lineage model. Data stored on a data storage device connected to a computer is navigated. In response to receiving user input, a target object in an information catalog is selected. Then, information about a source from which the target object was derived is provided.

    摘要翻译: 转型谱系模型的方法,装置和制品。 存储在连接到计算机的数据存储设备上的数据被导航。 响应于接收到的用户输入,选择信息目录中的目标对象。 然后,提供关于导出目标对象的源的信息。

    Multiple task wait system for use in a data warehouse environment
    8.
    发明授权
    Multiple task wait system for use in a data warehouse environment 失效
    用于数据仓库环境的多任务等待系统

    公开(公告)号:US07174551B2

    公开(公告)日:2007-02-06

    申请号:US10151381

    申请日:2002-05-20

    IPC分类号: G06F9/46

    CPC分类号: G06F9/4843 G06F9/5038

    摘要: A multiple task wait system and associated method allow a client application to wait for multiple tasks to be successfully or conditionally implemented before running subsequent tasks. Two mechanisms can be used to accomplish this multiple wait process: The first mechanism uses a multi-wait grouping process that is visible to the client, and the second mechanism uses a graphical representation to identify the tasks to be completed. The multi-wait grouping process allows a client to group a related set of tasks together for both control and documentation purposes. The client can add as many tasks as the resources of the computer allow to a group while defining the data flows and control flows between the tasks in the group using various graphical tools. The multi-wait system allows the client to define the constraints and conditions for a set of tasks to be considered complete, and further allows the system to define the constraints and conditions for considering all the tasks within the group to be completed. By utilizing the group concept, the system can selectively control the tasks to be included in the completion decision based on predefined rules.

    摘要翻译: 多任务等待系统和关联方法允许客户端应用程序在运行后续任务之前等待多个任务成功或有条件地实现。 可以使用两种机制来完成此多重等待过程:第一种机制使用客户端可见的多等待分组过程,第二种机制使用图形表示来识别要完成的任务。 多等待分组过程允许客户端将相关任务组合在一起,以实现控制和文档目的。 在使用各种图形工具定义数据流和控制组内任务之间的流程时,客户端可以添加与计算机资源一样多的任务。 多等待系统允许客户端定义要被认为是完整的一组任务的约束和条件,并且还允许系统定义用于考虑要完成的组内的所有任务的约束和条件。 通过利用群体概念,系统可以基于预定义的规则选择性地控制要包括在完成决策中的任务。

    Collaborative derivation of an interface and partial implementation of programming code
    9.
    发明授权
    Collaborative derivation of an interface and partial implementation of programming code 失效
    界面的协同推导和编程代码的部分实现

    公开(公告)号:US08176470B2

    公开(公告)日:2012-05-08

    申请号:US11549475

    申请日:2006-10-13

    IPC分类号: G06F9/44 G06F9/45

    CPC分类号: G06F8/10

    摘要: A method, system and computer program product provide an implementation of software. A control flow of a software component is constructed based on a specification model. In various embodiments, the specification model comprises at least one input and at least one requirement referencing the at least one input. At least a partial implementation of the software component is generated based on the control flow and the at least one input and the at least one requirement of the specification model. In some embodiments, the specification model further comprises at least one output, and the at least a partial implementation of the software component is also based on the at least one output.

    摘要翻译: 一种方法,系统和计算机程序产品提供软件的实现。 基于规范模型构建软件组件的控制流程。 在各种实施例中,规范模型包括至少一个输入和至少一个参考至少一个输入的要求。 基于控制流程以及规范模型的至少一个输入和至少一个要求来生成软件组件的至少部分实现。 在一些实施例中,规范模型还包括至少一个输出,并且软件组件的至少部分实现也基于该至少一个输出。

    WEB 2.0 SYSTEM AND METHOD FOR DYNAMIC CATEGORIZATION OF HETEROGENEOUS AND REGULATED ENTERPRISE ASSETS
    10.
    发明申请
    WEB 2.0 SYSTEM AND METHOD FOR DYNAMIC CATEGORIZATION OF HETEROGENEOUS AND REGULATED ENTERPRISE ASSETS 有权
    WEB 2.0系统和方法用于异构和调节企业资产的动态分类

    公开(公告)号:US20090144296A1

    公开(公告)日:2009-06-04

    申请号:US11949680

    申请日:2007-12-03

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30722

    摘要: A system and method for the dynamic categorization of heterogeneous, regulated enterprise information assets. In one embodiment of the invention a system includes a computer network controlled by an enterprise and a database including a plurality of enterprise data entities. A user interface, through which a plurality of enterprise users may access the enterprise data entities, is also used by the plurality of users to assign user-defined categories to the enterprise data entities. The user interface is configured to enable a plurality of the users to access and assign additional user-defined categories to enterprise data entities having user-defined categories previously assigned by other users.

    摘要翻译: 一种用于动态分类异构监管企业信息资产的系统和方法。 在本发明的一个实施例中,系统包括由企业控制的计算机网络和包括多个企业数据实体的数据库。 多个企业用户可以通过其访问企业数据实体的用户界面也被多个用户使用以将用户定义的类别分配给企业数据实体。 用户界面被配置为使得多个用户能够访问并向具有由其他用户先前分配的用户定义的类别的企业数据实体分配附加的用户定义的类别。