Creating flexible structure descriptions
    1.
    发明授权
    Creating flexible structure descriptions 有权
    创建灵活的结构描述

    公开(公告)号:US08908969B2

    公开(公告)日:2014-12-09

    申请号:US13562791

    申请日:2012-07-31

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描的文档图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 以及使用例如搜索算法来训练或修改柔性文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    2.
    发明授权
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US08805093B2

    公开(公告)日:2014-08-12

    申请号:US12977016

    申请日:2010-12-22

    IPC分类号: G06K9/62 G06K9/00

    摘要: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

    摘要翻译: 在一个实施例中,本发明提供了一种用于机器执行机器可读形式预识别分析的方法。 该方法包括以形式类型的形式预先分配至少一个图形图像,预先创建所述图形图像的至少一个模型以识别形式类型,将形式图像解析为区域,确定图像形式类型 所述形式图像包括:(a)在形式图像上检测至少一个所述图形图像以识别形式类型,(b)基于检测到的图形图像与 所述模型,以及(c)使用补充数据进行深刻分析,所述主要识别导致形式图像类型的多种可能性。

    METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE
    3.
    发明申请
    METHOD OF PRE-ANALYSIS OF A MACHINE-READABLE FORM IMAGE 有权
    机器可读形式图像预分析方法

    公开(公告)号:US20110091109A1

    公开(公告)日:2011-04-21

    申请号:US12977016

    申请日:2010-12-22

    IPC分类号: G06K9/34

    摘要: In one embodiment, the invention provides a method for a machine to perform machine-readable form pre-recognition analysis. The method comprises preliminarily assigning at least one graphic image in a form for identification of form type, preliminarily creating at least one model of the said graphic image for identification of the form type, parsing a form image into regions, determining an image form type for the form image, comprising: (a) detecting on the form image at least one of said graphic images for identification of the form type, (b) performing a primary identification of the form image type based on a comparison of the detected graphic image with the said model, and(c) performing a profound analysis using a supplementary data said-primary identification results in multiple possibilities for the form image type.

    摘要翻译: 在一个实施例中,本发明提供了一种用于机器执行机器可读形式预识别分析的方法。 该方法包括以形式类型的形式预先分配至少一个图形图像,预先创建所述图形图像的至少一个模型以识别形式类型,将形式图像解析为区域,确定图像形式类型 所述形式图像包括:(a)在形式图像上检测至少一个所述图形图像以识别形式类型,(b)基于检测到的图形图像与 所述模型,以及(c)使用补充数据进行深刻分析,所述主要识别导致形式图像类型的多种可能性。

    Method and system for creating flexible structure descriptions
    4.
    发明授权
    Method and system for creating flexible structure descriptions 有权
    创建灵活结构描述的方法和系统

    公开(公告)号:US08233714B2

    公开(公告)日:2012-07-31

    申请号:US12364266

    申请日:2009-02-02

    IPC分类号: G06K9/34

    摘要: A method related to data capture from forms involving optical character recognition comprises detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 与涉及光学字符识别的形式的数据捕获相关的方法包括检测扫描图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 并使用搜索算法训练灵活的文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    5.
    发明授权
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US07881561B2

    公开(公告)日:2011-02-01

    申请号:US10603215

    申请日:2003-06-26

    IPC分类号: G06K9/36 G06F17/00

    摘要: The present invention relates generally to an optical character recognition of machine-readable forms, and in particular to a verification of a direction of spatial orientation and a definition of a form type of the document electronic image. The goals of the invention are achieved by preliminarily assigning one or more form objects as elements composing a graphic image unambiguously defining its direction of spatial orientation. Similarly, one or more form objects are preliminarily assigned as elements composing a graphic image unambiguously defining its type. The direction of spatial orientation and the type of the form are verified via identification of said images. The models of graphic images either for verification the direction of spatial orientation or for defining the form type are stored in a special data storage means, one of the embodiment of which is form model description.

    摘要翻译: 本发明一般涉及机器可读形式的光学字符识别,特别涉及对空间取向的方向的验证以及文档电子图像的形式类型的定义。 本发明的目的是通过预先分配一个或多个形式对象作为构成图形图像的元素来明确地定义其空间取向的方向来实现。 类似地,一个或多个表单对象被预先分配为构成图形图像的元素,明确地定义其类型。 通过所述图像的识别来验证空间取向的方向和形式的类型。 用于验证空间取向的方向或用于定义形式类型的图形图像的模型存储在特殊数据存储装置中,其中一个实施例是形式模型描述。

    Method and System for Creating Flexible Structure Descriptions
    6.
    发明申请
    Method and System for Creating Flexible Structure Descriptions 有权
    创建灵活结构描述的方法和系统

    公开(公告)号:US20090175532A1

    公开(公告)日:2009-07-09

    申请号:US12364266

    申请日:2009-02-02

    IPC分类号: G06K9/62 G06F3/048

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training the flexible document description using a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 并使用搜索算法训练灵活的文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of pre-analysis of a machine-readable form image
    7.
    发明申请
    Method of pre-analysis of a machine-readable form image 有权
    机器可读形式图像的预分析方法

    公开(公告)号:US20060274941A1

    公开(公告)日:2006-12-07

    申请号:US10603215

    申请日:2003-06-26

    IPC分类号: G06K9/00 G06K9/34

    摘要: The present invention relates generally to an optical character recognition of machine-readable forms, and in particular to a verification of a direction of spatial orientation and a definition of a form type of the document electronic image. The goals of the invention are achieved by preliminarily assigning one or more form objects as elements composing a graphic image unambiguously defining its direction of spatial orientation. Similarly, one or more form objects are preliminarily assigned as elements composing a graphic image unambiguously defining its type. The direction of spatial orientation and the type of the form are verified via identification of said images. The models of graphic images either for verification the direction of spatial orientation or for defining the form type are stored in a special data storage means, one of the embodiment of which is form model description.

    摘要翻译: 本发明一般涉及机器可读形式的光学字符识别,特别涉及对空间取向的方向的验证以及文档电子图像的形式类型的定义。 本发明的目的是通过预先分配一个或多个形式对象作为构成图形图像的元素来明确地定义其空间取向的方向来实现。 类似地,一个或多个表单对象被预先分配为构成图形图像的元素,明确地定义其类型。 通过所述图像的识别来验证空间取向的方向和形式的类型。 用于验证空间取向的方向或用于定义形式类型的图形图像的模型存储在特殊数据存储装置中,其中一个实施例是形式模型描述。

    Creating Flexible Structure Descriptions
    8.
    发明申请
    Creating Flexible Structure Descriptions 有权
    创建灵活的结构描述

    公开(公告)号:US20130198615A1

    公开(公告)日:2013-08-01

    申请号:US13562791

    申请日:2012-07-31

    IPC分类号: G06F17/21

    摘要: In one embodiment, the invention provides a method, comprising detecting data fields on a scanned document image; generating a flexible document description based on the detected data fields, including creating a set of search elements for each data field, each search element having associated search criteria; and training or modifying the flexible document description using, for example, a search algorithm to detect the data fields on additional training images based on the set of search elements.

    摘要翻译: 在一个实施例中,本发明提供一种方法,包括检测扫描的文档图像上的数据字段; 基于检测到的数据字段生成灵活的文档描述,包括为每个数据字段创建一组搜索元素,每个搜索元素具有相关联的搜索准则; 以及使用例如搜索算法来训练或修改柔性文档描述,以基于搜索元素集来检测附加训练图像上的数据字段。

    Method of describing the structure of graphical objects
    9.
    发明授权
    Method of describing the structure of graphical objects 有权
    描述图形对象结构的方法

    公开(公告)号:US08171391B2

    公开(公告)日:2012-05-01

    申请号:US11556196

    申请日:2006-11-03

    IPC分类号: G06F17/27 G06K9/34

    CPC分类号: G06F17/30958

    摘要: The proposed technical solution allows processing of machine-readable forms of unfixed format. It comprises a method of specifying the logical structure of a document characterized by: preliminary specification of the list and descriptions of varieties of elements which may be present in the form, specifying an algorithm of setting the search constraints for every element, description of at least the following characteristics of search for every simple or compound element—the spatial characteristics of the search area and the parametric characteristics of the element, description of the method of identification of obtained elements, testing the type of the element, testing the properties which are typical of the type, testing the completeness of composition of the parts of the element.

    摘要翻译: 所提出的技术解决方案允许处理非固定格式的机器可读形式。 它包括指定文档的逻辑结构的方法,其特征在于:初始指定列表的形式和可以以形式存在的元素的种类的描述,指定为每个元素设置搜索约束的算法,至少描述 搜索每个简单或复合元素的特征 - 搜索区域的空间特征和元素的参数特征,所获取元素的识别方法的描述,测试元素的类型,测试典型的属性 的类型,测试元件部件的组成的完整性。

    Flexible Structure Descriptions for Multi-Page Documents
    10.
    发明申请
    Flexible Structure Descriptions for Multi-Page Documents 有权
    多页文档的灵活结构描述

    公开(公告)号:US20120243055A1

    公开(公告)日:2012-09-27

    申请号:US13242653

    申请日:2011-09-23

    IPC分类号: H04N1/40

    摘要: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

    摘要翻译: 提供了一批处理扫描图像的方法。 该方法包括将扫描的图像处理成文档。 对于多页的文档,该方法包括维护基于页面的坐标系统以指定页面内的结构的位置并且连接页面以形成具有基于纸张的坐标系的多页面表格,以指定页面内的结构的位置 多页表。 该方法包括执行数据提取操作以从每个文档提取数据,所述数据提取操作包括页面模式,其中使用基于页面的坐标系统在各个页面上检测结构,以及文档模式,其中在整个文档内检测到结构使用 基于表的坐标系。