Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion
    21.
    发明申请
    Program for correcting image distortion, apparatus for correcting image distortion, method for correcting image distortion, and recording medium storing program for correcting image distortion 失效
    用于校正图像失真的程序,用于校正图像失真的装置,用于校正图像失真的方法,以及用于校正图像失真的记录介质存储程序

    公开(公告)号:US20060140504A1

    公开(公告)日:2006-06-29

    申请号:US11359096

    申请日:2006-02-22

    IPC分类号: G06K9/40

    摘要: A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.

    摘要翻译: 使用纸面内容作为线索,从通过图像拾取装置成像的纸面的图像中提取在纸面的弯曲表面上彼此平行的测地线的投影集; 并且从测地线的投影组中提取形成对应于纸面的弯曲表面的刻线表面的投影线的投影组。 然后,从测地线和划线的投影组估计纸面的弯曲表面,并且基于纸面的弯曲表面校正图像的变形。 如果这样做,就可以进行与各种各样的变形的对应关系,并且即使只有一部分纸面出现在图像中,也可以执行失真校正。

    Storage medium, apparatus and method for recognizing characters in a document image using document recognition
    22.
    发明授权
    Storage medium, apparatus and method for recognizing characters in a document image using document recognition 有权
    使用文件识别识别文档图像中的字符的存储介质,装置和方法

    公开(公告)号:US08515175B2

    公开(公告)日:2013-08-20

    申请号:US12392798

    申请日:2009-02-25

    IPC分类号: G06K9/18

    CPC分类号: G06K9/00463

    摘要: A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.

    摘要翻译: 一种程序使计算机作为文件识别装置起作用,具有用于从输入图像中提取像素的连接分量的提取单元,生成单元,用于生成由提取单元提取的像素的连接分量和组合元素 通过组合参考元素和与参考元素相邻的像素的连接分量作为要估计的元素获得的计算单元,用于计算确定性程度的计算单元,其表示由生成单元生成的要估计的元素多少是 字符和确定单元,用于基于由计算单元计算出的确定性程度来识别要估计的要素中的字符的元素。

    Area extraction program, character recognition program, and character recognition device
    23.
    发明授权
    Area extraction program, character recognition program, and character recognition device 有权
    区域提取程序,字符识别程序和字符识别装置

    公开(公告)号:US08300942B2

    公开(公告)日:2012-10-30

    申请号:US12366004

    申请日:2009-02-05

    IPC分类号: G06K9/46 G06K9/72

    摘要: An area extraction method including obtaining a character lattice showing a connection relation between unit areas, which are obtained by separating a character string pattern in an image into patterns each recognized as corresponding to a single character, judging whether or not all combinations of each of the unit areas in the obtained character lattice and each of the unit areas in a regular lattice defining a regular connection relation between the unit areas are likely to be established, generating a path coupling between nodes corresponding to the combination of the unit areas which is determined as likely to be established, determining an optimum path from the generated paths based on a degree of coincidence with the regular lattice or the character lattice, and extracting from an image the unit areas in the character lattice corresponding to the determined optimum path.

    摘要翻译: 一种区域提取方法,其包括获得通过将图像中的字符串图案分离成各自识别为与单个字符相对应的图案而获得的单元区域之间的连接关系的字符格子,判断是否将每个 获得的字符格中的单位区域和规定单位区域之间的规则连接关系的规则格子中的每个单位区域很可能被建立,生成对应于单位区域的组合的节点之间的路径耦合,单元区域被确定为 可能建立起来,基于与规则格子或字符格子的一致程度从所生成的路径确定最佳路径,以及从图像中提取与所确定的最佳路径对应的字符格点中的单位区域。

    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product
    24.
    发明授权
    Ruled-line-projection extracting apparatus, ruled-line projection extracting method, and computer product 有权
    规则投影提取装置,规则投影提取方法和计算机产品

    公开(公告)号:US07903874B2

    公开(公告)日:2011-03-08

    申请号:US11894188

    申请日:2007-08-20

    IPC分类号: G06K9/34

    摘要: A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.

    摘要翻译: 将位于上端的顶部平行测地线突起与位于下方的平行测地线投影的底部平行测地线投影相关联的一组直线作为一组划线候选投影提取为搜索 一套规则线的预测目标。 对于每个被划线的候选投影,通过将划线候选投影移动预定间隔而获得的相邻行的交叉比矢量之间的距离为邻域的偏差, 线候选投影。 一组直线投影候选之间的直线投影的组合,在一组直线上彼此不相交的相邻偏差的总和最小的一组直线被提取为一组连续的线条投影 动态规划。

    Layout analysis program, layout analysis apparatus and layout analysis method
    25.
    发明申请
    Layout analysis program, layout analysis apparatus and layout analysis method 有权
    布局分析程序,布局分析仪器和布局分析方法

    公开(公告)号:US20070140560A1

    公开(公告)日:2007-06-21

    申请号:US11384327

    申请日:2006-03-21

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A layout analysis program, a layout analysis apparatus, layout analysis method and a medium can highly accurately extract a text block from an image if the image is a color image. The layout analysis program causes a computer to execute a divided region extracting step that extracts a region partitioned by a pattern according to a binary image so as to use the outcome of extraction as divided region, a set of character elements extracting step that extracts a set of the character elements extracted by a first binary image layout analysis process for each extracted divided region so as to use the outcome of extraction as set of character elements, a text block extracting step that extracts a region including the extracted set of character elements in each divided region so as to avoid overlapping the non-character elements extracted by a second binary image layout analysis process and use the outcome of extraction as text block and a layout information generating step that generates layout information according to the text block and the non-character elements extracted by the second binary image layout analysis process.

    摘要翻译: 如果图像是彩色图像,则布局分析程序,布局分析装置,布局分析方法和介质可以从图像高精度地提取文本块。 布局分析程序使计算机执行划分区域提取步骤,其提取根据二进制图像的图案划分的区域,以便将提取的结果用作划分区域,提取一组字符元素提取步骤 通过对于每个提取的分割区域的第一二进制图像布局分析处理提取的字符元素,以便将提取的结果用作字符元素的集合;文本块提取步骤,提取包括提取的每个字符元素集合的区域 以避免与通过第二二进制图像布局分析处理提取的非字符元素重叠并使用提取结果作为文本块和布局信息生成步骤,根据文本块和非字符生成布局信息 通过第二个二进制图像布局分析过程提取的元素。

    Apparatus, method, and computer program for analyzing document layout

    公开(公告)号:US20060204096A1

    公开(公告)日:2006-09-14

    申请号:US11175127

    申请日:2005-07-05

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.

    Apparatus and method for analyzing and determining correlation of information in a document
    27.
    发明授权
    Apparatus and method for analyzing and determining correlation of information in a document 有权
    用于分析和确定文档中信息的相关性的装置和方法

    公开(公告)号:US08224090B2

    公开(公告)日:2012-07-17

    申请号:US12005527

    申请日:2007-12-27

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: According to an aspect of an embodiment, an apparatus for analyzing and determining correlation of information contained in a given form containing blocks, at least one of the blocks containing data indicative of a header, the rest of the blocks containing data in association with header information, comprising: a memory for storing templates having nodes, character data associated with said nodes respectively, and relative position information between said nodes; and a processor for analyzing and determining correlation of the information according to a process comprising: obtaining data contained in said blocks in the given form, determining relative position of said blocks to produce relative position information, analyzing the data obtained from the blocks and the relative position information of the blocks in comparison with the character data and the relative position information of said nodes of said templates, and determining correlation of the data contained in said blocks.

    摘要翻译: 根据实施例的一个方面,一种用于分析和确定包含在包含块的给定形式的信息的相关性的装置,所述块中的至少一个包含指示头部的数据,其余块包含与标题信息相关联的数据 包括:存储器,用于存储具有节点的模板,分别与所述节点相关联的字符数据以及所述节点之间的相对位置信息; 以及处理器,用于根据包括以下步骤的处理来分析和确定所述信息的相关性,所述处理包括:以给定形式获取包含在所述块中的数据,确定所述块的相对位置以产生相对位置信息,分析从块获得的数据和相对 与字符数据和所述模板的所述节点的相对位置信息相比较的块的位置信息,以及确定包含在所述块中的数据的相关性。

    Correcting device and method for perspective transformed document images
    28.
    发明授权
    Correcting device and method for perspective transformed document images 有权
    用于透视变换的文档图像的校正装置和方法

    公开(公告)号:US08170368B2

    公开(公告)日:2012-05-01

    申请号:US12076122

    申请日:2008-03-13

    CPC分类号: G06K9/3283 G06K2009/363

    摘要: This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.

    摘要翻译: 本发明提供了一种用于文件图像的透视变换的校正装置和校正方法。 校正装置包括水平消失点确定单元,用于检测透视变换文档图像的水平消失点; 垂直消失点确定单元,用于检测透视变换文档图像的垂直消失点; 以及透视变换校正和转换单元,用于校正透视变换的文档图像; 其中所述水平消失点确定单元包括直接水平线段检测单元,间接水平线段检测单元和水平消失点检测单元,并且其中所述水平消失点检测单元根据直接水平检测水平消失点 由直接水平线段检测单元检测的线段和由间接水平线段检测单元检测的间接水平线段。

    Ruled line extracting program, ruled line extracting apparatus and ruled line extracting method
    29.
    发明授权
    Ruled line extracting program, ruled line extracting apparatus and ruled line extracting method 有权
    规则线提取程序,划线提取装置和划线提取方法

    公开(公告)号:US07769234B2

    公开(公告)日:2010-08-03

    申请号:US11607758

    申请日:2006-11-30

    IPC分类号: G06K9/46

    摘要: A ruled line extracting apparatus, a ruled line extracting program and a ruled line extracting method re-extract a ruled line by changing the predetermined requirements to be met by ruled line s when a ruled line candidate extracted according to the requirements shows a low reliability. A ruled line extracting program that causes a computer to extract a ruled line in an image of a document comprises an extraction step that extracts a ruled line candidate from the image of a document according to the first requirement predefined to be met by the figures of the elements of the ruled lines, a judgment step that judges if the ruled line candidate is stable or unstable according to the structural stability of the ruled line candidate extracted in the extraction step, a requirement determination step that determines the second requirement to be met by the figures of the elements of the ruled line different from the first requirement according to the ruled line candidate judged as stable in the judgment step and the first requirement and a re-extraction step that re-extracts a ruled line candidate according to the second requirement determined in the requirement determination step.

    摘要翻译: 格线提取装置,格线提取程序和格线提取方法,当根据要求提取的格线候选显示出低可靠性时,通过改变规定线s满足的预定要求来重新提取格线。 导致计算机提取文档图像中的划线的划线提取程序包括:提取步骤,根据预定要由图像的图形所满足的第一要求从文档的图像中提取格线候选 规则线的要素,判断步骤,根据在提取步骤中提取的划线候选的结构稳定性来判断排序候选者是否稳定或不稳定;要求确定步骤,确定由第二要求满足的第二要求 根据在判定步骤和第一要求判断为稳定的判定行候选人的不同于第一要求的划线的要素的数字和根据第二要求重新提取格线候补的再提取步骤 在要求确定步骤中。

    Layout analysis program, layout analysis apparatus and layout analysis method
    30.
    发明授权
    Layout analysis program, layout analysis apparatus and layout analysis method 有权
    布局分析程序,布局分析仪器和布局分析方法

    公开(公告)号:US07711189B2

    公开(公告)日:2010-05-04

    申请号:US11384327

    申请日:2006-03-21

    IPC分类号: G06K9/34 G06K9/46 G06K9/00

    CPC分类号: G06K9/00463

    摘要: A layout analysis program, a layout analysis apparatus, layout analysis method and a medium can highly accurately extract a text block from an image if the image is a color image. The layout analysis program causes a computer to execute a divided region extracting step that extracts a region partitioned by a pattern according to a binary image so as to use the outcome of extraction as divided region, a set of character elements extracting step that extracts a set of the character elements extracted by a first binary image layout analysis process for each extracted divided region so as to use the outcome of extraction as set of character elements, a text block extracting step that extracts a region including the extracted set of character elements in each divided region so as to avoid overlapping the non-character elements extracted by a second binary image layout analysis process and use the outcome of extraction as text block and a layout information generating step that generates layout information according to the text block and the non-character elements extracted by the second binary image layout analysis process.

    摘要翻译: 如果图像是彩色图像,则布局分析程序,布局分析装置,布局分析方法和介质可以从图像高精度地提取文本块。 布局分析程序使计算机执行划分区域提取步骤,其提取根据二进制图像的图案划分的区域,以便将提取的结果用作划分区域,提取一组字符元素提取步骤 通过对于每个提取的分割区域的第一二进制图像布局分析处理提取的字符元素,以便将提取的结果用作字符元素的集合;文本块提取步骤,提取包括提取的每个字符元素集合的区域 以避免与通过第二二进制图像布局分析处理提取的非字符元素重叠并使用提取结果作为文本块和布局信息生成步骤,根据文本块和非字符生成布局信息 通过第二个二进制图像布局分析过程提取的元素。