Apparatus and method of analyzing layout of document, and computer product
    1.
    发明授权
    Apparatus and method of analyzing layout of document, and computer product 失效
    分析文件布局和计算机产品的装置和方法

    公开(公告)号:US07257253B2

    公开(公告)日:2007-08-14

    申请号:US10350180

    申请日:2003-01-24

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.

    摘要翻译: 在用于分析文档的布局的装置中,字符候选元素生成器从文档图像的黑色像素连接分量生成角色候选元素。 当水平方向的线矩形发生器在垂直方向上相对于水平线的每个位移量时,将多个字符候选元素设置为在水平行方向对齐的字符候选元素中的行候选矩形 取向小于或等于阈值。 水平定向的段落框生成器将在垂直方向上彼此具有大致相同长度的多个行候选元素设置为段落候选元素。

    STORAGE MEDIUM STORING DOCUMENT RECOGNITION PROGRAM, DOCUMENT RECOGNITION APPARATUS AND METHOD THEREOF
    2.
    发明申请
    STORAGE MEDIUM STORING DOCUMENT RECOGNITION PROGRAM, DOCUMENT RECOGNITION APPARATUS AND METHOD THEREOF 有权
    存储媒体存储文件识别程序,文档识别装置及其方法

    公开(公告)号:US20090226089A1

    公开(公告)日:2009-09-10

    申请号:US12392798

    申请日:2009-02-25

    IPC分类号: G06K9/18

    CPC分类号: G06K9/00463

    摘要: A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.

    摘要翻译: 一种程序使计算机作为文件识别装置起作用,具有用于从输入图像中提取像素的连接分量的提取单元,生成单元,用于生成由提取单元提取的像素的连接分量和组合元素 通过组合参考元素和与参考元素相邻的像素的连接分量作为要估计的元素获得的计算单元,用于计算确定性程度的计算单元,其表示由生成单元生成的要估计的元素多少是 字符和确定单元,用于基于由计算单元计算出的确定性程度来识别要估计的要素中的字符的元素。

    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion
    3.
    发明申请
    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion 失效
    用于校正图像失真的程序,用于校正图像失真的装置,用于校正图像失真的方法,以及用于校正图像失真的记录介质存储程序

    公开(公告)号:US20060140504A1

    公开(公告)日:2006-06-29

    申请号:US11359096

    申请日:2006-02-22

    IPC分类号: G06K9/40

    摘要: A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.

    摘要翻译: 使用纸面内容作为线索,从通过图像拾取装置成像的纸面的图像中提取在纸面的弯曲表面上彼此平行的测地线的投影集; 并且从测地线的投影组中提取形成对应于纸面的弯曲表面的刻线表面的投影线的投影组。 然后,从测地线和划线的投影组估计纸面的弯曲表面,并且基于纸面的弯曲表面校正图像的变形。 如果这样做,就可以进行与各种各样的变形的对应关系,并且即使只有一部分纸面出现在图像中,也可以执行失真校正。

    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product
    4.
    发明授权
    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product 有权
    规则投影提取装置,规则投影提取方法和计算机产品

    公开(公告)号:US07903874B2

    公开(公告)日:2011-03-08

    申请号:US11894188

    申请日:2007-08-20

    IPC分类号: G06K9/34

    摘要: A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.

    摘要翻译: 将位于上端的顶部平行测地线突起与位于下方的平行测地线投影的底部平行测地线投影相关联的一组直线作为一组划线候选投影提取为搜索 一套规则线的预测目标。 对于每个被划线的候选投影,通过将划线候选投影移动预定间隔而获得的相邻行的交叉比矢量之间的距离为邻域的偏差, 线候选投影。 一组直线投影候选之间的直线投影的组合,在一组直线上彼此不相交的相邻偏差的总和最小的一组直线被提取为一组连续的线条投影 动态规划。

    Apparatus, method, and computer program for analyzing document layout

    公开(公告)号:US20060204096A1

    公开(公告)日:2006-09-14

    申请号:US11175127

    申请日:2005-07-05

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.

    Storage medium, apparatus and method for recognizing characters in a document image using document recognition
    7.
    发明授权
    Storage medium, apparatus and method for recognizing characters in a document image using document recognition 有权
    使用文件识别识别文档图像中的字符的存储介质,装置和方法

    公开(公告)号:US08515175B2

    公开(公告)日:2013-08-20

    申请号:US12392798

    申请日:2009-02-25

    IPC分类号: G06K9/18

    CPC分类号: G06K9/00463

    摘要: A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.

    摘要翻译: 一种程序使计算机作为文件识别装置起作用,具有用于从输入图像中提取像素的连接分量的提取单元,生成单元,用于生成由提取单元提取的像素的连接分量和组合元素 通过组合参考元素和与参考元素相邻的像素的连接分量作为要估计的元素获得的计算单元,用于计算确定性程度的计算单元,其表示由生成单元生成的要估计的元素多少是 字符和确定单元,用于基于由计算单元计算出的确定性程度来识别要估计的要素中的字符的元素。

    RECORDING MEDIUM FOR RECORDING LOGICAL STRUCTURE MODEL CREATION ASSISTANCE PROGRAM, LOGICAL STRUCTURE MODEL CREATION ASSISTANCE DEVICE AND LOGICAL STRUCTURE MODEL CREATION ASSISTANCE METHOD
    8.
    发明申请
    RECORDING MEDIUM FOR RECORDING LOGICAL STRUCTURE MODEL CREATION ASSISTANCE PROGRAM, LOGICAL STRUCTURE MODEL CREATION ASSISTANCE DEVICE AND LOGICAL STRUCTURE MODEL CREATION ASSISTANCE METHOD 有权
    用于记录逻辑结构模型创建辅助程序,逻辑结构模型创建辅助装置和逻辑结构模型创建辅助方法的记录介质

    公开(公告)号:US20090148049A1

    公开(公告)日:2009-06-11

    申请号:US12328442

    申请日:2008-12-04

    IPC分类号: G06K9/46

    CPC分类号: G06F17/243

    摘要: A method for assisting in the creation of a logical structure model, which stores, from an image in which character strings associated respectively with a plurality of logical elements constituting a logical structure are described, the logical elements, character strings associated with the logical elements, and the logical structure, wherein character strings in an input image and the logical structure among the character strings in the input image are extracted, a logical element is selected among the plurality of logical elements according to the degrees of similarity between the extracted character strings and the character string associated respectively with the plurality of logical elements stored in the logical structure model, a character string associated with the selected logical element and a character string in the input image associated with the logical element based on the logical structure among the extracted character strings in the input image are extracted.

    摘要翻译: 一种辅助创建逻辑结构模型的方法,该逻辑结构模型存储从其中描述了分别与构成逻辑结构的多个逻辑元件相关联的字符串的图像,逻辑元素,与逻辑元素相关联的字符串, 以及逻辑结构,其中输入图像中的字符串和输入图像中的字符串之间的逻辑结构被提取,根据提取的字符串之间的相似度和多个逻辑元素之间的相似度来选择逻辑元素, 分别与存储在逻辑结构模型中的多个逻辑元素相关联的字符串,与所选择的逻辑元素相关联的字符串和基于提取的字符串中的逻辑结构与逻辑元素相关联的输入图像中的字符串 在输入图像中提取。

    Correcting device and method for perspective transformed document images
    10.
    发明授权
    Correcting device and method for perspective transformed document images 有权
    用于透视变换的文档图像的校正装置和方法

    公开(公告)号:US08170368B2

    公开(公告)日:2012-05-01

    申请号:US12076122

    申请日:2008-03-13

    CPC分类号: G06K9/3283 G06K2009/363

    摘要: This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.

    摘要翻译: 本发明提供了一种用于文件图像的透视变换的校正装置和校正方法。 校正装置包括水平消失点确定单元,用于检测透视变换文档图像的水平消失点; 垂直消失点确定单元,用于检测透视变换文档图像的垂直消失点; 以及透视变换校正和转换单元,用于校正透视变换的文档图像; 其中所述水平消失点确定单元包括直接水平线段检测单元,间接水平线段检测单元和水平消失点检测单元,并且其中所述水平消失点检测单元根据直接水平检测水平消失点 由直接水平线段检测单元检测的线段和由间接水平线段检测单元检测的间接水平线段。