Character recognition method, character recognition device, and computer product
    11.
    发明申请
    Character recognition method, character recognition device, and computer product 有权
    字符识别方法,字符识别装置和计算机产品

    公开(公告)号:US20080069447A1

    公开(公告)日:2008-03-20

    申请号:US11654180

    申请日:2007-01-16

    IPC分类号: G06K9/18

    CPC分类号: G06K9/346 G06K2209/01

    摘要: Upon receiving, for example, document data including a character string from outside, a character recognition device detects a line from a line-touching character-string image in which at least one character (such as number, alphabet letter, kana character, and Chinese character) touches (or overlaps) a line in the document data, tentatively removes the line, and estimates a character region. The character recognition device extracts a line-touching character image from the line-touching character-string image (original image) based on the estimated character region. The character recognition device creates a line-added reference character image by adding a quasi-line to a reference character image stored in advance.

    摘要翻译: 例如,在从外部接收到包括字符串的文档数据的情况下,字符识别装置从至少一个字符(例如号码,字母,假名字符和中文)的线条触摸字符串图像中检测线 字符)触摸(或重叠)文档数据中的一行,暂时删除该行,并估计字符区域。 字符识别装置基于估计的字符区域从线条触摸字符串图像(原始图像)中提取线条触摸字符图像。 字符识别装置通过向预先存储的参考字符图像添加准行来创建线条添加的参考文字图像。

    Apparatus and method of analyzing layout of document, and computer product
    12.
    发明授权
    Apparatus and method of analyzing layout of document, and computer product 失效
    分析文件布局和计算机产品的装置和方法

    公开(公告)号:US07257253B2

    公开(公告)日:2007-08-14

    申请号:US10350180

    申请日:2003-01-24

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.

    摘要翻译: 在用于分析文档的布局的装置中,字符候选元素生成器从文档图像的黑色像素连接分量生成角色候选元素。 当水平方向的线矩形发生器在垂直方向上相对于水平线的每个位移量时,将多个字符候选元素设置为在水平行方向对齐的字符候选元素中的行候选矩形 取向小于或等于阈值。 水平定向的段落框生成器将在垂直方向上彼此具有大致相同长度的多个行候选元素设置为段落候选元素。

    Image distortion correcting method and apparatus, and storage medium
    14.
    发明授权
    Image distortion correcting method and apparatus, and storage medium 有权
    图像失真校正方法和装置以及存储介质

    公开(公告)号:US07418126B2

    公开(公告)日:2008-08-26

    申请号:US10609575

    申请日:2003-07-01

    IPC分类号: G06K9/00 G06K9/36

    摘要: An image distortion correcting apparatus is provided with an image input section to input an image of a flat rectangular paper surface imaged by an imaging section, as an input image, an imaging position estimating section to estimate a relative imaging position of the imaging section with respect to the paper surface from four vertexes of the rectangular paper surface within the input image, a rectangular paper surface estimating section to estimate four vertexes of the rectangular paper surface within a three-dimensional space based on the imaging position, and an image correcting section to correct a perspective transformation distortion in the paper surface within the input image based on the imaging position and the four vertexes within the three-dimensional space, so as to output an output image.

    摘要翻译: 图像失真校正装置设置有图像输入部分,以输入由成像部分成像的平面矩形纸表面的图像作为输入图像,成像位置估计部分,用于估计成像部分的相对成像位置, 在输入图像内从矩形纸表面的四个顶点到纸张表面的矩形纸表面估计部分,基于成像位置来估计三维空间内的矩形纸表面的四个顶点,以及图像校正部分 基于成像位置和三维空间内的四个顶点来校正输入图像内的纸张表面的透视变换失真,以输出输出图像。

    Apparatus, method, and computer program for analyzing document layout

    公开(公告)号:US20060204096A1

    公开(公告)日:2006-09-14

    申请号:US11175127

    申请日:2005-07-05

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.

    Image processing apparatus and method generating binary image from a multilevel image
    17.
    发明授权
    Image processing apparatus and method generating binary image from a multilevel image 有权
    从多级图像生成二值图像的图像处理装置和方法

    公开(公告)号:US07146047B2

    公开(公告)日:2006-12-05

    申请号:US09956933

    申请日:2001-09-21

    IPC分类号: G06K9/46

    CPC分类号: G06K9/346 G06K2209/01

    摘要: An information processing apparatus extracts a plurality of strokes from a multilevel image, and generates a stroke binary image. Next, the image processing apparatus extracts feature amounts indicating the thickness and the smoothed graylevel of a stroke in a neighboring region of a target pixel by using each pixel belonging to each of the strokes as the target pixel. Then, the apparatus generates a target stroke binary image from the stroke binary image based on the distribution of the extracted feature amounts.

    摘要翻译: 信息处理装置从多级图像提取多个笔画,并生成笔画二进制图像。 接下来,图像处理装置通过使用属于每个笔画的每个像素作为目标像素,提取指示目标像素的相邻区域中的笔画的厚度和平滑灰度的特征量。 然后,该装置基于提取的特征量的分布从笔划二进制图像生成目标笔划二进制图像。

    Character recognition method, character recognition device, and computer product
    19.
    发明授权
    Character recognition method, character recognition device, and computer product 有权
    字符识别方法,字符识别装置和计算机产品

    公开(公告)号:US07796817B2

    公开(公告)日:2010-09-14

    申请号:US11654180

    申请日:2007-01-16

    IPC分类号: G06K9/00 G06K9/18

    CPC分类号: G06K9/346 G06K2209/01

    摘要: Upon receiving, for example, document data including a character string from outside, a character recognition device detects a line from a line-touching character-string image in which at least one character (such as number, alphabet letter, kana character, and Chinese character) touches (or overlaps) a line in the document data, tentatively removes the line, and estimates a character region. The character recognition device extracts a line-touching character image from the line-touching character-string image (original image) based on the estimated character region. The character recognition device creates a line-added reference character image by adding a quasi-line to a reference character image stored in advance.

    摘要翻译: 例如,在从外部接收到包括字符串的文档数据的情况下,字符识别装置从至少一个字符(例如号码,字母,假名字符和中文)的线条触摸字符串图像中检测线 字符)触摸(或重叠)文档数据中的一行,暂时删除该行,并估计字符区域。 字符识别装置基于估计的字符区域从线条触摸字符串图像(原始图像)中提取线条触摸字符图像。 字符识别装置通过向预先存储的参考字符图像添加准行来创建线条添加的参考文字图像。