Detecting inconsistent data records
    1.
    发明授权
    Detecting inconsistent data records 有权
    检测不一致的数据记录

    公开(公告)号:US09037550B2

    公开(公告)日:2015-05-19

    申请号:US13434647

    申请日:2012-03-29

    摘要: A computer-implemented method for detecting a set of inconsistent data records in a database including multiple records, comprises selecting a data quality rule representing a functional dependency for the database, transforming the data quality rule into at least one rule vector with hashed components, selecting a set of attributes of the database, transforming at least one record of the database selected on the basis of the selected attributes into a record vector with hashed components, computing a dot product of the rule and record vectors to generate a measure representing violation of the data quality rule by the record.

    摘要翻译: 一种用于在包括多个记录的数据库中检测一组不一致数据记录的计算机实现的方法,包括选择表示数据库的功能依赖性的数据质量规则,将数据质量规则转换成具有散列分量的至少一个规则向量,选择 数据库的一组属性,将基于所选择的属性选择的数据库的至少一个记录转换成具有散列分量的记录向量,计算规则的点乘积和记录向量,以生成表示违反 数据质量规则记录。

    Pre-locking scheme for allowing consistent and concurrent workflow
process execution in a workflow management system
    2.
    发明授权
    Pre-locking scheme for allowing consistent and concurrent workflow process execution in a workflow management system 失效
    用于在工作流管理系统中实现一致和并发的工作流程执行的预锁定方案

    公开(公告)号:US6078982A

    公开(公告)日:2000-06-20

    申请号:US47248

    申请日:1998-03-24

    IPC分类号: G06F17/30 G06Q10/10 G06F13/14

    摘要: A system for allowing consistent execution of a workflow process in a computer-enabled workflow management system is described. The system includes a workflow process database accessible by the workflow process. The workflow process includes at least one sequence of workflow actions, having at least one set of parallel workflow actions and being configured as a plurality of nodes interconnected by arcs. Each node defines at least one of the workflow actions and reading and writing data items when executing the workflow actions. A first module is provided to lock all data items in the workflow process database that are specified for access by the workflow process from being accessed by other workflow processes during execution of the workflow process before the execution of the workflow process is commenced. A second module is provided to release all the locked data items from being locked after the workflow process has been executed such that execution consistency and concurrency of the workflow process is maintained. A computer implemented method for allowing consistent execution of a workflow process in a computer-enabled workflow management system is also described.

    摘要翻译: 描述了允许在启用计算机的工作流管理系统中一致地执行工作流程的系统。 该系统包括工作流程可访问的工作流程数据库。 工作流过程包括工作流动作的至少一个序列,具有至少一组并行工作流动作并被配置为通过弧互连的多个节点。 每个节点在执行工作流操作时定义工作流操作和读取和写入数据项中的至少一个。 提供第一模块来锁定工作流过程数据库中的所有工作流程数据库中的所有数据项,这些数据项被工作流程进程访问所指定,以便在执行工作流过程之前由工作流过程执行期间由其他工作流进程访问。 第二模块被提供以在工作流程执行之后释放所有锁定的数据项被锁定,从而保持工作流过程的执行一致性和并发性。 还描述了一种用于允许在启用计算机的工作流管理系统中一致地执行工作流程的计算机实现的方法。

    Data cleaning
    3.
    发明授权
    Data cleaning 有权
    数据清理

    公开(公告)号:US08805798B2

    公开(公告)日:2014-08-12

    申请号:US13468938

    申请日:2012-05-10

    IPC分类号: G06F7/02 G06F17/30

    CPC分类号: G06F17/30303

    摘要: A computer-implemented method comprising partitioning data representing an input instance of a database including multiple tuples into multiple fragments of tuples, detecting tuples which violate a data quality specification in respective ones of the fragments, selecting a data cleaning asset on the basis of characteristics of errors in detected tuples for a fragment and based on declared asset capabilities, assigning a selected data cleaning asset to the fragment, the selected data cleaning asset to provide a set of candidate corrections for the detected tuples in the fragment, providing data representing an output instance of the database in which detected tuples are replaced with selected candidate corrections.

    摘要翻译: 一种计算机实现的方法,包括将表示包括多个元组的数据库的输入实例的数据分割成多个元组的片段,检测违反相应片段中的数据质量规范的元组,基于特征来选择数据清理资产 检测到的分段的元组中的错误并且基于声明的资产能力,将所选择的数据清理资产分配给分段,所选择的数据清理资产为片段中检测到的元组提供一组候选校正,提供表示输出实例的数据 检测到的元组被替换为所选候选校正的数据库。

    DATA CLEANING
    4.
    发明申请
    DATA CLEANING 有权
    数据清理

    公开(公告)号:US20130275393A1

    公开(公告)日:2013-10-17

    申请号:US13468938

    申请日:2012-05-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30303

    摘要: A computer-implemented method comprising partitioning data representing an input instance of a database including multiple tuples into multiple fragments of tuples, detecting tuples which violate a data quality specification in respective ones of the fragments, selecting a data cleaning asset on the basis of characteristics of errors in detected tuples for a fragment and based on declared asset capabilities, assigning a selected data cleaning asset to the fragment, the selected data cleaning asset to provide a set of candidate corrections for the detected tuples in the fragment, providing data representing an output instance of the database in which detected tuples are replaced with selected candidate corrections.

    摘要翻译: 一种计算机实现的方法,包括将表示包括多个元组的数据库的输入实例的数据分割成多个元组的片段,检测违反相应片段中的数据质量规范的元组,基于特征来选择数据清理资产 检测到的分段的元组中的错误并且基于声明的资产能力,将所选择的数据清理资产分配给分段,所选择的数据清理资产为片段中检测到的元组提供一组候选校正,提供表示输出实例的数据 检测到的元组被替换为所选候选校正的数据库。