Method and apparatus to convert bitmapped images for use in a structured text/graphics editor
    3.
    发明授权
    Method and apparatus to convert bitmapped images for use in a structured text/graphics editor 有权
    用于转换位图图像以用于结构化文本/图形编辑器的方法和装置

    公开(公告)号:US07576753B2

    公开(公告)日:2009-08-18

    申请号:US11375809

    申请日:2006-03-15

    IPC分类号: G09G5/02

    CPC分类号: G06K9/00463

    摘要: An image analysis and conversion method and system. Bitmapped ink images are converted to structured object representations of the bitmapped images, which may be read and edited by a structured text/graphics editor. The structured object representations correlate to perceptually salient areas of the bitmapped images. The structured object representations are editable by the structured text/graphics editor to allow a user to generate alternative interpretations of the bitmapped images.

    摘要翻译: 一种图像分析与转换方法及系统。 位图油墨图像被转换为​​位图图像的结构化对象表示,其可由结构化文本/图形编辑器读取和编辑。 结构化对象表示与位图图像的感知显着区域相关。 结构化对象表示可由结构化文本/图形编辑器编辑,以允许用户生成位图图像的替代解释。

    System and method for forms recognition by synthesizing corrected localization of data fields
    7.
    发明授权
    System and method for forms recognition by synthesizing corrected localization of data fields 有权
    通过合成数据字段的校正定位来进行表单识别的系统和方法

    公开(公告)号:US09536141B2

    公开(公告)日:2017-01-03

    申请号:US13537729

    申请日:2012-06-29

    申请人: Eric Saund

    发明人: Eric Saund

    IPC分类号: G06F17/00 G06K9/00 G06F17/24

    摘要: A method and system generates an idealized image of a form. An image of a form and a template model of the form are received. The form includes data fields. Word boxes of the image are identified. The word boxes are assigned to corresponding data fields of the form. An idealized image of the from is generated based on the assignments and the template model.

    摘要翻译: 一种方法和系统产生一个形式的理想化图像。 接收表单的图像和表单的模板模型。 表单包括数据字段。 识别图像的字框。 单词框被分配给表单的相应数据字段。 基于分配和模板模型生成来自的理想化图像。

    Method for generating a graph lattice from a corpus of one or more data graphs

    公开(公告)号:US08872828B2

    公开(公告)日:2014-10-28

    申请号:US12883464

    申请日:2010-09-16

    申请人: Eric Saund

    发明人: Eric Saund

    IPC分类号: G06T17/20 G06T11/20

    CPC分类号: G06T11/206

    摘要: A document recognition system and method, where images are represented as a collection of primitive features whose spatial relations are represented as a graph. Useful subsets of all the possible subgraphs representing different portions of images are represented over a corpus of many images. The data structure is a lattice of subgraphs, and algorithms are provided means to build and use the graph lattice efficiently and effectively.

    System and method for forms classification by line-art alignment
    9.
    发明授权
    System and method for forms classification by line-art alignment 有权
    通过线条对齐形式分类的系统和方法

    公开(公告)号:US08792715B2

    公开(公告)日:2014-07-29

    申请号:US13539941

    申请日:2012-07-02

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00449

    摘要: A system and method to classify forms. An image representing a form of an unknown document type is received. The image includes line-art. Further, a plurality of template models corresponding to a plurality of different document types is received. The plurality of different document types is intended to include the correct document type of the unknown document. A subset of the plurality of template models are selected as candidate template models. The candidate template models include line-art junctions best matching line-art junctions of the received image. One of the candidate template models is selected as a best candidate template model. The best candidate template model includes horizontal and vertical lines best matching horizontal and vertical lines of the received image, respectively, aligned to the best candidate template model.

    摘要翻译: 一种用于分类表单的系统和方法。 接收到表示未知文档类型的形式的图像。 图像包括线条艺术。 此外,接收对应于多个不同文档类型的多个模板模型。 多个不同的文档类型旨在包括未知文档的正确文档类型。 选择多个模板模型的子集作为候选模板模型。 候选模板模型包括最佳匹配接收图像的线艺术结的线艺术结。 选择候选模板模型之一作为最佳候选模板模型。 最佳候选模板模型包括分别与最佳候选模板模型对齐的最佳匹配接收图像的水平和垂直线的水平和垂直线。

    System and method for localizing data fields on structured and semi-structured forms
    10.
    发明授权
    System and method for localizing data fields on structured and semi-structured forms 有权
    用于本地化结构化和半结构化形式的数据字段的系统和方法

    公开(公告)号:US08781229B2

    公开(公告)日:2014-07-15

    申请号:US13537630

    申请日:2012-06-29

    申请人: Eric Saund

    发明人: Eric Saund

    IPC分类号: G06K9/34

    摘要: A method and system to localize data fields of a form. An image of a form is received, where the form includes data fields. Word boxes of the image are identified. The word boxes are grouped into candidate zones, where each of the candidate zones includes one or more of the word boxes. Hypotheses are formed from the data fields and the candidate zones, where each hypothesis assigns one of the candidate zones to one of the data fields or a null data field. A constrained optimization search of the hypotheses is performed for an optimal set of hypotheses. The optimal set of hypotheses assigns word box groups to corresponding data fields.

    摘要翻译: 本地化表单数据字段的方法和系统。 收到表单的图像,其中表单包括数据字段。 识别图像的字框。 单词框被分组成候选区域,其中每个候选区域包括一个或多个单词框。 假设从数据字段和候选区域形成,其中每个假设将一个候选区域分配给数据字段之一或空数据字段。 对于最优假设集执行假设的约束优化搜索。 最佳假设集合将字框组分配给相应的数据字段。