Assisted OCR
    1.
    发明申请

    公开(公告)号:US20150063698A1

    公开(公告)日:2015-03-05

    申请号:US14012143

    申请日:2013-08-28

    CPC classification number: G06K9/18 G06K9/344

    Abstract: A method including determining a position of each glyph in an image of a text document, identifying word boundaries in the document thereby implying the existence of a first plurality of words, preparing a first array of word lengths based on the first plurality of words, preparing a second array of word lengths based on a second plurality of words of a text file including a certain text, comparing at least part of the first array to at least part of the second array to find a best alignment between the first and second array, deriving a layout of at least part of the certain text as arranged in the image of the text document at least based on the best alignment and the position of at least some of the glyphs in the image. Related apparatus and methods are also described.

    Assisted OCR
    2.
    发明授权
    Assisted OCR 有权
    辅助OCR

    公开(公告)号:US09092688B2

    公开(公告)日:2015-07-28

    申请号:US14012143

    申请日:2013-08-28

    CPC classification number: G06K9/18 G06K9/344

    Abstract: A method including determining a position of each glyph in an image of a text document, identifying word boundaries in the document thereby implying the existence of a first plurality of words, preparing a first array of word lengths based on the first plurality of words, preparing a second array of word lengths based on a second plurality of words of a text file including a certain text, comparing at least part of the first array to at least part of the second array to find a best alignment between the first and second array, deriving a layout of at least part of the certain text as arranged in the image of the text document at least based on the best alignment and the position of at least some of the glyphs in the image. Related apparatus and methods are also described.

    Abstract translation: 一种方法,包括确定文本文档的图像中的每个字形的位置,识别文档中的字边界,从而意味着存在第一多个单词,基于第一多个单词准备第一个字长数组,准备 基于包括特定文本的文本文件的第二多个单词的第二长度字阵列,将所述第一阵列的至少一部分与所述第二阵列的至少一部分进行比较以找到所述第一和第二阵列之间的最佳对准, 至少基于图像中的至少一些字形的最佳对齐和位置,导出布置在文本文档的图像中的至少部分某些文本的布局。 还描述了相关装置和方法。

Patent Agency Ranking