Flexible Structure Descriptions for Multi-Page Documents
    1.
    发明申请
    Flexible Structure Descriptions for Multi-Page Documents 有权
    多页文档的灵活结构描述

    公开(公告)号:US20120243055A1

    公开(公告)日:2012-09-27

    申请号:US13242653

    申请日:2011-09-23

    IPC分类号: H04N1/40

    摘要: A method for processing a batch of scanned images is provided. The method comprises processing the scanned images into documents. For documents of multiple pages, the method comprises maintaining a page-based coordinate system to specify a location of structures within a page and joining the pages to form a multi-page sheet having a sheet-based coordinate system to specify a location of structures within the multi-page sheet. The method comprises performing a data extraction operation to extract data from each document, said data extraction operation including a page mode wherein structures are detected on individual pages using the page-based coordinate system and a document mode wherein structures are detected within the entire document using the sheet-based coordinate system.

    摘要翻译: 提供了一批处理扫描图像的方法。 该方法包括将扫描的图像处理成文档。 对于多页的文档,该方法包括维护基于页面的坐标系统以指定页面内的结构的位置并且连接页面以形成具有基于纸张的坐标系的多页面表格,以指定页面内的结构的位置 多页表。 该方法包括执行数据提取操作以从每个文档提取数据,所述数据提取操作包括页面模式,其中使用基于页面的坐标系统在各个页面上检测结构,以及文档模式,其中在整个文档内检测到结构使用 基于表的坐标系。

    Method for object recognition and describing structure of graphical objects
    2.
    发明授权
    Method for object recognition and describing structure of graphical objects 有权
    用于对象识别和描述图形对象结构的方法

    公开(公告)号:US09224040B2

    公开(公告)日:2015-12-29

    申请号:US13242218

    申请日:2011-09-23

    IPC分类号: G06F3/00 G06K9/00

    CPC分类号: G06K9/00463 G06K9/00456

    摘要: The invention involves a method for processing of machine-readable forms or documents of non-fixed format. The method makes use of, for example, a structural description of characteristics of document elements, a description of a logical structure of the document, and methods of searching for document elements by using the structural description. A structural description of the spatial and parametric characteristics of document elements and the logical connections between elements may include a hierarchical logical structure of the elements, specification of an algorithm of determining the search constraints, specification of characteristics of every searched element, and specification of a set of parameters for a compound element identified on the basis of the aggregate of its components. The method of describing the logical structure of a document and methods of searching for elements of a document may be based on the use of the structural description.

    摘要翻译: 本发明涉及一种用于处理非固定格式的机器可读形式或文档的方法。 该方法利用例如文档元素的特征的结构描述,文档的逻辑结构的描述以及通过使用结构描述搜索文档元素的方法。 文档元素的空间和参数特征以及元素之间的逻辑连接的结构描述可以包括元素的分层逻辑结构,确定搜索约束的算法的规范,每个搜索元素的特征的规范,以及 基于其组件的集合确定的复合元素的参数集合。 描述文档的逻辑结构的方法和搜索文档的元素的方法可以基于结构描述的使用。