Just-in-time data quality assessment for best record creation

    公开(公告)号:US11093521B2

    公开(公告)日:2021-08-17

    申请号:US13929475

    申请日:2013-06-27

    IPC分类号: G06F16/27

    摘要: Systems and methods for just-in-time data quality assessment of best records created during data migration are disclosed. A data steward includes tools for creating and editing a best record creation strategy that defines how records from multiple systems will be integrated into target systems. At design time, the data steward can generate best record creation and validation rules based on the best record creation strategy. The data steward can apply the best record creation and validation rules to a sample of matched records from multiple data sources to generate a sample set of best records. The efficacy of the best record creation rules can be evaluated by assessing the number of fields in the sample set that fail the validation rules. During review, the validation rules can be applied to edits to the best records received from a human reviewer to ensure compliance with the best record creation strategy.

    Just-in-Time Data Quality Assessment for Best Record Creation
    2.
    发明申请
    Just-in-Time Data Quality Assessment for Best Record Creation 审中-公开
    即时数据质量评估最佳记录创建

    公开(公告)号:US20150006491A1

    公开(公告)日:2015-01-01

    申请号:US13929475

    申请日:2013-06-27

    IPC分类号: G06F17/30

    CPC分类号: G06F16/27

    摘要: Systems and methods for just-in-time data quality assessment of best records created during data migration are disclosed. A data steward includes tools for creating and editing a best record creation strategy that defines how records from multiple systems will be integrated into target systems. At design time, the data steward can generate best record creation and validation rules based on the best record creation strategy. The data steward can apply the best record creation and validation rules to a sample of matched records from multiple data sources to generate a sample set of best records. The efficacy of the best record creation rules can be evaluated by assessing the number of fields in the sample set that fail the validation rules. During review, the validation rules can be applied to edits to the best records received from a human reviewer to ensure compliance with the best record creation strategy.

    摘要翻译: 披露了在数据迁移期间创建的最佳记录的即时数据质量评估的系统和方法。 数据管家包括用于创建和编辑最佳记录创建策略的工具,该策略定义如何将来自多个系统的记录集成到目标系统中。 在设计时,数据管理员可以根据最佳记录创建策略生成最佳记录创建和验证规则。 数据管理员可以将最佳记录创建和验证规则应用于来自多个数据源的匹配记录的样本,以生成一组最佳记录。 可以通过评估样本集中失败验证规则的字段数来评估最佳记录创建规则的有效性。 在审查期间,验证规则可以应用于编辑从人类审阅者收到的最佳记录,以确保符合最佳记录创建策略。

    Entity expansion and grouping
    3.
    发明授权
    Entity expansion and grouping 有权
    实体扩展和分组

    公开(公告)号:US08312018B2

    公开(公告)日:2012-11-13

    申请号:US12908781

    申请日:2010-10-20

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30707 G06F17/278

    摘要: A computer readable storage medium includes executable instructions to convert an entity to a standard form including normalized attributes, a tag reference and a feature. The entity is expanded with corresponding variants. The standard form and corresponding variants are combined to form an annotated entity in a first processing step. The entity is assigned to a group in a second processing step that accesses the annotated entity. The entity is processed in a single pass comprising the first processing step and the second processing step.

    摘要翻译: 计算机可读存储介质包括可执行指令,以将实体转换成包括标准化属性,标签引用和特征的标准形式。 实体扩展为相应的变体。 在第一处理步骤中组合标准形式和相应变体以形成注释实体。 在访问注释实体的第二处理步骤中将实体分配给组。 该实体以包含第一处理步骤和第二处理步骤的单次处理。

    APPARATUS AND METHOD FOR ENTITY EXPANSION AND GROUPING
    4.
    发明申请
    APPARATUS AND METHOD FOR ENTITY EXPANSION AND GROUPING 有权
    用于实体扩展和分组的装置和方法

    公开(公告)号:US20120102031A1

    公开(公告)日:2012-04-26

    申请号:US12908781

    申请日:2010-10-20

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707 G06F17/278

    摘要: A computer readable storage medium includes executable instructions to convert an entity to a standard form including normalized attributes, a tag reference and a feature. The entity is expanded with corresponding variants. The standard form and corresponding variants are combined to form an annotated entity in a first processing step. The entity is assigned to a group in a second processing step that accesses the annotated entity. The entity is processed in a single pass comprising the first processing step and the second processing step.

    摘要翻译: 计算机可读存储介质包括可执行指令,以将实体转换成包括标准化属性,标签引用和特征的标准形式。 实体扩展为相应的变体。 在第一处理步骤中组合标准形式和相应变体以形成注释实体。 在访问注释实体的第二处理步骤中将实体分配给组。 该实体以包含第一处理步骤和第二处理步骤的单次处理。