Apparatus and method of analyzing layout of document, and computer product
    1.
    发明授权
    Apparatus and method of analyzing layout of document, and computer product 失效
    分析文件布局和计算机产品的装置和方法

    公开(公告)号:US07257253B2

    公开(公告)日:2007-08-14

    申请号:US10350180

    申请日:2003-01-24

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.

    摘要翻译: 在用于分析文档的布局的装置中,字符候选元素生成器从文档图像的黑色像素连接分量生成角色候选元素。 当水平方向的线矩形发生器在垂直方向上相对于水平线的每个位移量时,将多个字符候选元素设置为在水平行方向对齐的字符候选元素中的行候选矩形 取向小于或等于阈值。 水平定向的段落框生成器将在垂直方向上彼此具有大致相同长度的多个行候选元素设置为段落候选元素。

    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion
    2.
    发明申请
    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion 失效
    用于校正图像失真的程序,用于校正图像失真的装置,用于校正图像失真的方法,以及用于校正图像失真的记录介质存储程序

    公开(公告)号:US20060140504A1

    公开(公告)日:2006-06-29

    申请号:US11359096

    申请日:2006-02-22

    IPC分类号: G06K9/40

    摘要: A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.

    摘要翻译: 使用纸面内容作为线索,从通过图像拾取装置成像的纸面的图像中提取在纸面的弯曲表面上彼此平行的测地线的投影集; 并且从测地线的投影组中提取形成对应于纸面的弯曲表面的刻线表面的投影线的投影组。 然后,从测地线和划线的投影组估计纸面的弯曲表面,并且基于纸面的弯曲表面校正图像的变形。 如果这样做,就可以进行与各种各样的变形的对应关系,并且即使只有一部分纸面出现在图像中,也可以执行失真校正。

    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product
    3.
    发明授权
    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product 有权
    规则投影提取装置,规则投影提取方法和计算机产品

    公开(公告)号:US07903874B2

    公开(公告)日:2011-03-08

    申请号:US11894188

    申请日:2007-08-20

    IPC分类号: G06K9/34

    摘要: A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.

    摘要翻译: 将位于上端的顶部平行测地线突起与位于下方的平行测地线投影的底部平行测地线投影相关联的一组直线作为一组划线候选投影提取为搜索 一套规则线的预测目标。 对于每个被划线的候选投影,通过将划线候选投影移动预定间隔而获得的相邻行的交叉比矢量之间的距离为邻域的偏差, 线候选投影。 一组直线投影候选之间的直线投影的组合,在一组直线上彼此不相交的相邻偏差的总和最小的一组直线被提取为一组连续的线条投影 动态规划。

    Apparatus, method, and computer program for analyzing document layout

    公开(公告)号:US20060204096A1

    公开(公告)日:2006-09-14

    申请号:US11175127

    申请日:2005-07-05

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.

    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion
    5.
    发明授权
    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion 失效
    用于校正图像失真的程序,用于校正图像失真的装置,用于校正图像失真的方法,以及用于校正图像失真的记录介质存储程序

    公开(公告)号:US07471848B2

    公开(公告)日:2008-12-30

    申请号:US11359096

    申请日:2006-02-22

    IPC分类号: G06K9/40

    摘要: A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.

    摘要翻译: 使用纸面内容作为线索,从通过图像拾取装置成像的纸面的图像中提取在纸面的弯曲表面上彼此平行的测地线的投影集; 并且从测地线的投影组中提取形成对应于纸面的弯曲表面的刻线表面的投影线的投影组。 然后,从测地线和划线的投影组估计纸面的弯曲表面,并且基于纸面的弯曲表面校正图像的变形。 如果这样做,就可以进行与各种各样的变形的对应关系,并且即使只有一部分纸面出现在图像中,也可以执行失真校正。

    Correcting device and method for perspective transformed document images
    6.
    发明申请
    Correcting device and method for perspective transformed document images 有权
    用于透视变换的文档图像的校正装置和方法

    公开(公告)号:US20080226171A1

    公开(公告)日:2008-09-18

    申请号:US12076122

    申请日:2008-03-13

    IPC分类号: G06K9/34

    CPC分类号: G06K9/3283 G06K2009/363

    摘要: This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.

    摘要翻译: 本发明提供了一种用于文件图像的透视变换的校正装置和校正方法。 校正装置包括水平消失点确定单元,用于检测透视变换文档图像的水平消失点; 垂直消失点确定单元,用于检测透视变换文档图像的垂直消失点; 以及透视变换校正和转换单元,用于校正透视变换的文档图像; 其中,所述水平消失点确定单元包括直接水平线段检测单元,间接水平线段检测单元和水平消失点检测单元,并且其中所述水平消失点检测单元根据直接水平检测水平消失点 由直接水平线段检测单元检测的线段和由间接水平线段检测单元检测的间接水平线段。

    Correcting device and method for perspective transformed document images
    7.
    发明授权
    Correcting device and method for perspective transformed document images 有权
    用于透视变换的文档图像的校正装置和方法

    公开(公告)号:US08170368B2

    公开(公告)日:2012-05-01

    申请号:US12076122

    申请日:2008-03-13

    CPC分类号: G06K9/3283 G06K2009/363

    摘要: This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.

    摘要翻译: 本发明提供了一种用于文件图像的透视变换的校正装置和校正方法。 校正装置包括水平消失点确定单元,用于检测透视变换文档图像的水平消失点; 垂直消失点确定单元,用于检测透视变换文档图像的垂直消失点; 以及透视变换校正和转换单元,用于校正透视变换的文档图像; 其中所述水平消失点确定单元包括直接水平线段检测单元,间接水平线段检测单元和水平消失点检测单元,并且其中所述水平消失点检测单元根据直接水平检测水平消失点 由直接水平线段检测单元检测的线段和由间接水平线段检测单元检测的间接水平线段。

    Apparatus, method, and computer program for analyzing document layout
    8.
    发明授权
    Apparatus, method, and computer program for analyzing document layout 有权
    用于分析文件布局的装置,方法和计算机程序

    公开(公告)号:US07627176B2

    公开(公告)日:2009-12-01

    申请号:US11175127

    申请日:2005-07-05

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.

    摘要翻译: 一种文档布局分析程序,即使在文档布局如此复杂的情况下,能够从给定文档图像中提取适当的文本块集合,使得具有单个提取条件的常规提取方法将不能正常工作。 多个不同的提取条件存储在提取条件存储器中,用于从给定的文档图像中提取文本块。 根据这些提取条件,文本块提取器从文档图像中提取多组文本块。 文本块整合器通过对每个提取的文本块执行字符识别来生成一组合并的文本块,基于字符识别的结果来评估每个文本块的有效性,以及从多个文本集合中选择最有效的文本块 块。

    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product
    9.
    发明申请
    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product 有权
    规则投影提取装置,规则投影提取方法和计算机产品

    公开(公告)号:US20080112619A1

    公开(公告)日:2008-05-15

    申请号:US11894188

    申请日:2007-08-20

    IPC分类号: G06K9/34

    摘要: A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.

    摘要翻译: 将位于上端的顶部平行测地线突起与位于下方的平行测地线投影的底部平行测地线投影相关联的一组直线作为一组划线候选投影提取为搜索 一套规则线的预测目标。 对于每个被划线的候选投影,通过将划线候选投影移动预定间隔而获得的相邻行的交叉比矢量之间的距离为邻域的偏差, 线候选投影。 一组直线投影候选之间的直线投影的组合,在一组直线上彼此不相交的相邻偏差的总和最小的一组直线被提取为一组连续的线条投影 动态规划。

    Character area extracting device, imaging device having character area extracting function, recording medium saving character area extracting programs, and character area extracting method
    10.
    发明授权
    Character area extracting device, imaging device having character area extracting function, recording medium saving character area extracting programs, and character area extracting method 有权
    字符区域提取装置,具有字符区域提取功能的成像装置,记录介质保存字符区域提取程序和字符区域提取方法

    公开(公告)号:US08447113B2

    公开(公告)日:2013-05-21

    申请号:US13067133

    申请日:2011-05-11

    IPC分类号: G06K9/18

    摘要: A character area extracting device includes a reflective and non-reflective area separation unit separating image data into reflective and non-reflective areas, and binarizing the image data by changing a first threshold value when it is inappropriate; a reflective area binarizing unit separating the reflective area into character and background areas, and binarizing it by changing a second threshold value when it is inappropriate; a non-reflective area binarizing unit separating the non-reflective area into the character and background areas, and binarizing it by changing a third threshold value when it is inappropriate; a reflective and non-reflective area separation evaluation unit; and a line extracting unit connecting the character areas of the reflective and non-reflective areas and extracting positional information of the connected character areas in the image data.

    摘要翻译: 字符区域提取装置包括将图像数据分离成反射和非反射区域的反射和非反射区域分离单元,并且当不合适时通过改变第一阈值来二值化图像数据; 反射区域二值化单元,将反射区域分离成字符和背景区域,并且当不适当时通过改变第二阈值来对其进行二值化; 非反射区域二值化单元,将非反射区域分离成字符和背景区域,并且当不合适时通过改变第三阈值来二值化; 反射和非反射区域分离评估单元; 以及线提取单元,连接反射区域和非反射区域的字符区域,并提取图像数据中连接的字符区域的位置信息。