Automated learning of document data fields

    公开(公告)号:US11631265B2

    公开(公告)日:2023-04-18

    申请号:US13479736

    申请日:2012-05-24

    摘要: Methods and systems for transforming at least a portion of a physical document into digital data. One method includes obtaining a first plurality of data items automatically extracted from a first physical document and a validated value for a data field. The method also includes automatically identifying a first linked data item included in the first plurality of data items that is linked to the validated value and setting a physical position included in a rule to the physical position of the first linked data item. In addition, the method includes obtaining a second plurality of data items automatically extracted from a second physical document and automatically identifying a candidate data item included in the second plurality of data items based on the rule. Furthermore, the method includes automatically populating a value for the data field for the second physical document based on the candidate data item.

    AUTOMATED LEARNING OF DOCUMENT DATA FIELDS
    2.
    发明申请
    AUTOMATED LEARNING OF DOCUMENT DATA FIELDS 审中-公开
    自动学习文档数据字段

    公开(公告)号:US20130318426A1

    公开(公告)日:2013-11-28

    申请号:US13479736

    申请日:2012-05-24

    IPC分类号: G06F17/24

    摘要: Methods and systems for transforming at least a portion of a physical document into digital data. One method includes obtaining a first plurality of data items automatically extracted from a first physical document and a validated value for a data field. The method also includes automatically identifying a first linked data item included in the first plurality of data items that is linked to the validated value and setting a physical position included in a rule to the physical position of the first linked data item. In addition, the method includes obtaining a second plurality of data items automatically extracted from a second physical document and automatically identifying a candidate data item included in the second plurality of data items based on the rule. Furthermore, the method includes automatically populating a value for the data field for the second physical document based on the candidate data item.

    摘要翻译: 用于将物理文档的至少一部分转换为数字数据的方法和系统。 一种方法包括获得从第一物理文档自动提取的第一多个数据项和数据字段的验证值。 该方法还包括自动识别包括在与验证值相关联的第一多个数据项中的第一链接数据项,并将包括在规则中的物理位置设置为第一链接数据项的物理位置。 此外,该方法包括获得从第二物理文档自动提取的第二多个数据项,并且基于规则自动识别包括在第二多个数据项中的候选数据项。 此外,该方法包括基于候选数据项自动填充第二物理文档的数据字段的值。