Creating flexible structure descriptions
    1.
    发明授权
    Creating flexible structure descriptions 有权
    创建灵活的结构描述

    公开(公告)号:US08908969B2

    公开(公告)日:2014-12-09

    申请号:US13562791

    申请日:2012-07-31

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描的文档图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 以及使用例如搜索算法来训练或修改柔性文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Creating Flexible Structure Descriptions
    2.
    发明申请
    Creating Flexible Structure Descriptions 有权
    创建灵活的结构描述

    公开(公告)号:US20130198615A1

    公开(公告)日:2013-08-01

    申请号:US13562791

    申请日:2012-07-31

    IPC分类号: G06F17/21

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描的文档图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 以及使用例如搜索算法来训练或修改柔性文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    3.
    发明授权
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US08805093B2

    公开(公告)日:2014-08-12

    申请号:US12977016

    申请日:2010-12-22

    IPC分类号: G06K9/62 G06K9/00

    摘要: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

    摘要翻译: 在一个实施例中,本发明提供了一种用于机器执行机器可读形式预识别分析的方法。 该方法包括以形式类型的形式预先分配至少一个图形图像,预先创建所述图形图像的至少一个模型以识别形式类型,将形式图像解析为区域,确定图像形式类型 所述形式图像包括:(a)在形式图像上检测至少一个所述图形图像以识别形式类型,(b)基于检测到的图形图像与 所述模型,以及(c)使用补充数据进行深刻分析,所述主要识别导致形式图像类型的多种可能性。

    METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE
    4.
    发明申请
    METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE 有权
    机器可读形式图像预分析方法

    公开(公告)号:US20110091109A1

    公开(公告)日:2011-04-21

    申请号:US12977016

    申请日:2010-12-22

    IPC分类号: G06K9/34

    摘要: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

    摘要翻译: 在一个实施例中,本发明提供了一种用于机器执行机器可读形式预识别分析的方法。 该方法包括以形式类型的形式预先分配至少一个图形图像,预先创建所述图形图像的至少一个模型以识别形式类型,将形式图像解析为区域,确定图像形式类型 所述形式图像包括:(a)在形式图像上检测至少一个所述图形图像以识别形式类型,(b)基于检测到的图形图像与 所述模型,以及(c)使用补充数据进行深刻分析,所述主要识别导致形式图像类型的多种可能性。

    Method and system for creating flexible structure descriptions
    5.
    发明授权
    Method and system for creating flexible structure descriptions 有权
    创建灵活结构描述的方法和系统

    公开(公告)号:US08233714B2

    公开(公告)日:2012-07-31

    申请号:US12364266

    申请日:2009-02-02

    IPC分类号: G06K9/34

    摘要: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 与涉及光学字符识别的形式的数据捕获相关的方法包括检测扫描图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 并使用搜索算法训练灵活的文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    6.
    发明授权
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US07881561B2

    公开(公告)日:2011-02-01

    申请号:US10603215

    申请日:2003-06-26

    IPC分类号: G06K9/36 G06F17/00

    摘要: The present invention relates generally to an optical character recognition of machine-readable forms, and in particular to a verification of a direction of spatial orientation and a definition of a form type of the document electronic image. The goals of the invention are achieved by preliminarily assigning one or more form objects as elements composing a graphic image unambiguously defining its direction of spatial orientation. Similarly, one or more form objects are preliminarily assigned as elements composing a graphic image unambiguously defining its type. The direction of spatial orientation and the type of the form are verified via identification of said images. The models of graphic images either for verification the direction of spatial orientation or for defining the form type are stored in a special data storage means, one of the embodiment of which is form model description.

    摘要翻译: 本发明一般涉及机器可读形式的光学字符识别,特别涉及对空间取向的方向的验证以及文档电子图像的形式类型的定义。 本发明的目的是通过预先分配一个或多个形式对象作为构成图形图像的元素来明确地定义其空间取向的方向来实现。 类似地,一个或多个表单对象被预先分配为构成图形图像的元素,明确地定义其类型。 通过所述图像的识别来验证空间取向的方向和形式的类型。 用于验证空间取向的方向或用于定义形式类型的图形图像的模型存储在特殊数据存储装置中,其中一个实施例是形式模型描述。

    Method and System for Creating Flexible Structure Descriptions
    7.
    发明申请
    Method and System for Creating Flexible Structure Descriptions 有权
    创建灵活结构描述的方法和系统

    公开(公告)号:US20090175532A1

    公开(公告)日:2009-07-09

    申请号:US12364266

    申请日:2009-02-02

    IPC分类号: G06K9/62 G06F3/048

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 并使用搜索算法训练灵活的文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    8.
    发明申请
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US20060274941A1

    公开(公告)日:2006-12-07

    申请号:US10603215

    申请日:2003-06-26

    IPC分类号: G06K9/00 G06K9/34

    摘要: The present invention relates generally to an optical character recognition of machine-readable forms, and in particular to a verification of a direction of spatial orientation and a definition of a form type of the document electronic image. The goals of the invention are achieved by preliminarily assigning one or more form objects as elements composing a graphic image unambiguously defining its direction of spatial orientation. Similarly, one or more form objects are preliminarily assigned as elements composing a graphic image unambiguously defining its type. The direction of spatial orientation and the type of the form are verified via identification of said images. The models of graphic images either for verification the direction of spatial orientation or for defining the form type are stored in a special data storage means, one of the embodiment of which is form model description.

    摘要翻译: 本发明一般涉及机器可读形式的光学字符识别,特别涉及对空间取向的方向的验证以及文档电子图像的形式类型的定义。 本发明的目的是通过预先分配一个或多个形式对象作为构成图形图像的元素来明确地定义其空间取向的方向来实现。 类似地,一个或多个表单对象被预先分配为构成图形图像的元素,明确地定义其类型。 通过所述图像的识别来验证空间取向的方向和形式的类型。 用于验证空间取向的方向或用于定义形式类型的图形图像的模型存储在特殊数据存储装置中,其中一个实施例是形式模型描述。

    Data capture from multi-page documents
    9.
    发明授权
    Data capture from multi-page documents 有权
    从多页文档中获取数据

    公开(公告)号:US08538162B2

    公开(公告)日:2013-09-17

    申请号:US13431767

    申请日:2012-03-27

    IPC分类号: G06K9/46

    摘要: A method for processing a batch of scanned images is disclosed. The method includes processing the scanned images into documents. For documents of multiple pages, the method maintains a page-based coordinate system to specify a location of structures within a page and joins the pages to form a multi-page sheet associated with a sheet-based coordinate system to specify a location of structures within the multi-page sheet. Data may be extracted from each document through a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

    摘要翻译: 公开了一种用于处理一批扫描图像的方法。 该方法包括将扫描的图像处理成文档。 对于多页的文档,该方法维护基于页面的坐标系,以指定页面内的结构的位置并加入页面以形成与基于页面的坐标系相关联的多页表格,以指定结构的位置 多页表。 可以通过页面模式从每个文档中提取数据,其中使用基于页面的坐标系在各个页面上检测结构,并且使用基于纸张的坐标系统在整个文档内检测结构的文档模式。

    Flexible Structure Descriptions for Multi-Page Documents
    10.
    发明申请
    Flexible Structure Descriptions for Multi-Page Documents 有权
    多页文档的灵活结构描述

    公开(公告)号:US20120243055A1

    公开(公告)日:2012-09-27

    申请号:US13242653

    申请日:2011-09-23

    IPC分类号: H04N1/40

    摘要: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

    摘要翻译: 提供了一批处理扫描图像的方法。 该方法包括将扫描的图像处理成文档。 对于多页的文档,该方法包括维护基于页面的坐标系统以指定页面内的结构的位置并且连接页面以形成具有基于纸张的坐标系的多页面表格,以指定页面内的结构的位置 多页表。 该方法包括执行数据提取操作以从每个文档提取数据,所述数据提取操作包括页面模式,其中使用基于页面的坐标系统在各个页面上检测结构,以及文档模式,其中在整个文档内检测到结构使用 基于表的坐标系。