Image recognition apparatus, image recognition method, and storage medium recording image recognition program
    4.
    发明授权
    Image recognition apparatus, image recognition method, and storage medium recording image recognition program 有权
    图像识别装置,图像识别方法和存储介质记录图像识别程序

    公开(公告)号:US08503784B2

    公开(公告)日:2013-08-06

    申请号:US12250302

    申请日:2008-10-13

    IPC分类号: G06K9/00

    摘要: An image recognition apparatus recognizes the correspondence between character strings and logical elements composing a logical structure in an image in which the character strings are described as the logical elements to recognize each logical element. The image recognition apparatus includes outputting means for outputting the recognized logical elements when the correspondence is recognized or re-recognized; first determining means for determining a certain logical element to be correct when input of a determination request to determine the logical element is received from a user; second determining means for determining the correctness of all the logical elements output before the logical element determined by the first determining means and is positioned according to confirmation by the user; and re-recognizing means for re-recognizing the correspondence between logical elements that have not been determined to be correct and the character strings on the basis of the determination content for each logical element.

    摘要翻译: 图像识别装置识别字符串和组成逻辑结构的逻辑元件之间的对应关系,其中描述了字符串作为识别每个逻辑元件的逻辑元件的图像。 所述图像识别装置包括:输出装置,用于当所述对应被识别或重新识别时输出所识别的逻辑元件; 第一确定装置,用于当从用户接收到确定逻辑元件的确定请求的输入时,确定某个逻辑元件是正确的; 第二确定装置,用于确定在由第一确定装置确定的逻辑元件之前输出的所有逻辑元件的正确性,并且根据用户的确认定位; 以及重新识别装置,用于基于每个逻辑元素的确定内容来重新识别尚未被确定为正确的逻辑元素与字符串之间的对应关系。

    Recording medium for recording logical structure model creation assistance program, logical structure model creation assistance device and logical structure model creation assistance method
    6.
    发明授权
    Recording medium for recording logical structure model creation assistance program, logical structure model creation assistance device and logical structure model creation assistance method 有权
    用于记录逻辑结构模型创建辅助程序,逻辑结构模型创建辅助装置和逻辑结构模型创建辅助方法的记录介质

    公开(公告)号:US08249351B2

    公开(公告)日:2012-08-21

    申请号:US12328442

    申请日:2008-12-04

    IPC分类号: G06K9/00 G06F7/00 G06F17/00

    CPC分类号: G06F17/243

    摘要: A method for assisting in the creation of a logical structure model, which stores, from an image in which character strings associated respectively with a plurality of logical elements constituting a logical structure are described, the logical elements, character strings associated with the logical elements, and the logical structure, wherein character strings in an input image and the logical structure among the character strings in the input image are extracted, a logical element is selected among the plurality of logical elements according to the degrees of similarity between the extracted character strings and the character string associated respectively with the plurality of logical elements stored in the logical structure model, a character string associated with the selected logical element and a character string in the input image associated with the logical element based on the logical structure among the extracted character strings in the input image are extracted.

    摘要翻译: 一种辅助创建逻辑结构模型的方法,该逻辑结构模型存储从其中描述了分别与构成逻辑结构的多个逻辑元件相关联的字符串的图像,逻辑元素,与逻辑元素相关联的字符串, 以及逻辑结构,其中输入图像中的字符串和输入图像中的字符串之间的逻辑结构被提取,根据提取的字符串之间的相似度和多个逻辑元素之间的相似度来选择逻辑元素, 分别与存储在逻辑结构模型中的多个逻辑元素相关联的字符串,与所选择的逻辑元素相关联的字符串和基于提取的字符串中的逻辑结构与逻辑元素相关联的输入图像中的字符串 在输入图像中提取。

    IMAGE RECOGNITION APPARATUS, IMAGE RECOGNITION METHOD, AND STORAGE MEDIUM RECORDING IMAGE RECOGNITION PROGRAM
    8.
    发明申请
    IMAGE RECOGNITION APPARATUS, IMAGE RECOGNITION METHOD, AND STORAGE MEDIUM RECORDING IMAGE RECOGNITION PROGRAM 有权
    图像识别装置,图像识别方法和存储媒体记录图像识别程序

    公开(公告)号:US20090110282A1

    公开(公告)日:2009-04-30

    申请号:US12250302

    申请日:2008-10-13

    IPC分类号: G06K9/00

    摘要: An image recognition apparatus recognizes the correspondence between character strings and logical elements composing a logical structure in an image in which the character strings are described as the logical elements to recognize each logical element. The image recognition apparatus includes outputting means for outputting the recognized logical elements when the correspondence is recognized or re-recognized; first determining means for determining a certain logical element to be correct when input of a determination request to determine the logical element is received from a user; second determining means for determining the correctness of all the logical elements output before the logical element determined by the first determining means and is positioned according to confirmation by the user; and re-recognizing means for re-recognizing the correspondence between logical elements that have not been determined to be correct and the character strings on the basis of the determination content for each logical element.

    摘要翻译: 图像识别装置识别字符串和组成逻辑结构的逻辑元件之间的对应关系,其中描述了字符串作为识别每个逻辑元件的逻辑元件的图像。 所述图像识别装置包括:输出装置,用于当所述对应被识别或重新识别时输出所识别的逻辑元件; 第一确定装置,用于当从用户接收到确定逻辑元件的确定请求的输入时,确定某个逻辑元件是正确的; 第二确定装置,用于确定在由第一确定装置确定的逻辑元件之前输出的所有逻辑元件的正确性,并且根据用户的确认定位; 以及重新识别装置,用于基于每个逻辑元素的确定内容来重新识别尚未被确定为正确的逻辑元素与字符串之间的对应关系。

    STORAGE MEDIUM STORING DOCUMENT RECOGNITION PROGRAM, DOCUMENT RECOGNITION APPARATUS AND METHOD THEREOF
    9.
    发明申请
    STORAGE MEDIUM STORING DOCUMENT RECOGNITION PROGRAM, DOCUMENT RECOGNITION APPARATUS AND METHOD THEREOF 有权
    存储媒体存储文件识别程序,文档识别装置及其方法

    公开(公告)号:US20090226089A1

    公开(公告)日:2009-09-10

    申请号:US12392798

    申请日:2009-02-25

    IPC分类号: G06K9/18

    CPC分类号: G06K9/00463

    摘要: A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.

    摘要翻译: 一种程序使计算机作为文件识别装置起作用,具有用于从输入图像中提取像素的连接分量的提取单元,生成单元,用于生成由提取单元提取的像素的连接分量和组合元素 通过组合参考元素和与参考元素相邻的像素的连接分量作为要估计的元素获得的计算单元,用于计算确定性程度的计算单元,其表示由生成单元生成的要估计的元素多少是 字符和确定单元,用于基于由计算单元计算出的确定性程度来识别要估计的要素中的字符的元素。

    Apparatus and method of analyzing layout of document, and computer product
    10.
    发明授权
    Apparatus and method of analyzing layout of document, and computer product 失效
    分析文件布局和计算机产品的装置和方法

    公开(公告)号:US07257253B2

    公开(公告)日:2007-08-14

    申请号:US10350180

    申请日:2003-01-24

    IPC分类号: G06K9/34

    CPC分类号: G06K9/00463

    摘要: In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.

    摘要翻译: 在用于分析文档的布局的装置中,字符候选元素生成器从文档图像的黑色像素连接分量生成角色候选元素。 当水平方向的线矩形发生器在垂直方向上相对于水平线的每个位移量时,将多个字符候选元素设置为在水平行方向对齐的字符候选元素中的行候选矩形 取向小于或等于阈值。 水平定向的段落框生成器将在垂直方向上彼此具有大致相同长度的多个行候选元素设置为段落候选元素。