Character area extracting device, imaging device having character area extracting function, recording medium saving character area extracting programs, and character area extracting method
    11.
    发明申请
    Character area extracting device, imaging device having character area extracting function, recording medium saving character area extracting programs, and character area extracting method 有权
    字符区域提取装置,具有字符区域提取功能的成像装置,记录介质保存字符区域提取程序和字符区域提取方法

    公开(公告)号:US20110255785A1

    公开(公告)日:2011-10-20

    申请号:US13067133

    申请日:2011-05-11

    IPC分类号: G06K9/18

    摘要: A character area extracting device includes a reflective and non-reflective area separation unit separating image data into reflective and non-reflective areas, and binarizing the image data by changing a first threshold value when it is inappropriate; a reflective area binarizing unit separating the reflective area into character and background areas, and binarizing it by changing a second threshold value when it is inappropriate; a non-reflective area binarizing unit separating the non-reflective area into the character and background areas, and binarizing it by changing a third threshold value when it is inappropriate; a reflective and non-reflective area separation evaluation unit; and a line extracting unit connecting the character areas of the reflective and non-reflective areas and extracting positional information of the connected character areas in the image data.

    摘要翻译: 字符区域提取装置包括将图像数据分离成反射和非反射区域的反射和非反射区域分离单元,并且当不合适时通过改变第一阈值来二值化图像数据; 反射区域二值化单元,将反射区域分离成字符和背景区域,并且当不适当时通过改变第二阈值来对其进行二值化; 非反射区域二值化单元,将非反射区域分离成字符和背景区域,并且当不合适时通过改变第三阈值来二值化; 反射和非反射区域分离评估单元; 以及线提取单元,连接反射区域和非反射区域的字符区域,并提取图像数据中连接的字符区域的位置信息。

    Logical structure analyzing apparatus, method, and computer product
    12.
    发明授权
    Logical structure analyzing apparatus, method, and computer product 有权
    逻辑结构分析装置,方法和计算机产品

    公开(公告)号:US08010564B2

    公开(公告)日:2011-08-30

    申请号:US12180202

    申请日:2008-07-25

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06K9/00469

    摘要: A logical structure analyzing apparatus includes an extracting unit that extracts word candidates from a form, a first generating unit that classifies each of the word candidates into a group of heading candidates or a group of data candidates to generate, based on positions of the word candidates on the form, first candidates sets each including one heading candidate and one data candidate identifiable by the heading candidate, and a second generating unit that combines the first candidate sets to generate second candidate sets that each include plural heading candidates that differ and one data candidate. The apparatus also includes a removing unit that, based on positions of the heading candidates and the data word candidate in each second candidate set, removes from among the second candidates sets, a determined set including a data item and headings identifying the data item, and an output unit that outputs the determined set.

    摘要翻译: 逻辑结构分析装置包括从表单中提取词候选的提取单元,基于候选词的位置,将每个候选候选词划分成一组候选标题或一组候选数据的第一生成单元 在表格上,第一候选人设置每个包括一个候选候选人和一个可由候选候选人标识的候选数据候选的候选文件,以及第二生成单元,其组合第一候选组以生成第二候选组,每个候选组包括不同的多个标题候选和一个数据候选 。 该装置还包括一个删除单元,其基于每个第二候选集中的候选候选和候选字符的位置,从第二候选集中移除包括数据项和标识数据项的标题的确定集合,以及 输出单元,其输出所确定的集合。

    LOGICAL STRUCTURE ANALYZING APPARATUS, METHOD, AND COMPUTER PRODUCT
    13.
    发明申请
    LOGICAL STRUCTURE ANALYZING APPARATUS, METHOD, AND COMPUTER PRODUCT 有权
    逻辑结构分析设备,方法和计算机产品

    公开(公告)号:US20090112797A1

    公开(公告)日:2009-04-30

    申请号:US12180202

    申请日:2008-07-25

    IPC分类号: G06F17/30

    CPC分类号: G06K9/00469

    摘要: A logical structure analyzing apparatus includes an extracting unit that extracts word candidates from a form, a first generating unit that classifies each of the word candidates into a group of heading candidates or a group of data candidates to generate, based on positions of the word candidates on the form, first candidate sets each including one heading candidate and one data candidate identifiable by the heading candidate, and a second generating unit that combines the first candidate sets to generate second candidate sets that each include plural heading candidates that differ and one data candidate. The apparatus also includes a removing unit that, based on positions of the heading candidates and the data word candidate in each second candidate set, removes from among the second candidate sets, a determined set including a data item and headings identifying the data item, and an output unit that outputs the determined set.

    摘要翻译: 逻辑结构分析装置包括从表单中提取词候选的提取单元,基于候选词的位置,将每个候选候选词划分成一组候选标题或一组候选数据的第一生成单元 在表格上,包括一个标题候选的第一候选集和由标题候选可识别的一个数据候选,以及组合第一候选集以产生第二候选集的第二生成单元,其中每个候选组包括不同的多个候选候选项和一个候选数 。 该装置还包括一个删除单元,其基于每个第二候选集中的候选候选标题和数据字候选的位置从第二候选集中移除包括数据项和标识数据项的标题的确定集合,以及 输出单元,其输出所确定的集合。

    Method and apparatus for recognizing boundary line in an image information
    14.
    发明申请
    Method and apparatus for recognizing boundary line in an image information 有权
    用于识别图像信息中的边界线的方法和装置

    公开(公告)号:US20080199082A1

    公开(公告)日:2008-08-21

    申请号:US12071050

    申请日:2008-02-14

    IPC分类号: G06K9/48

    摘要: According to an aspect of an embodiment, a method of detecting boundary line information contained in image information comprising a plurality of pixels in either one of first and second states, comprising: detecting a first group of pixels in the first state disposed continuously in said image information to determine first line information and detecting a second group of pixels in the first state disposed adjacently with each other and surrounded by pixels in the second state to determine edge information based on the contour of the second group of pixels; and determining the boundary line information on the basis of the information of the relation of relative position of the line information and the edge information and the size of the first and second group of pixels.

    摘要翻译: 根据实施例的一个方面,一种检测包含在包括第一和第二状态中的任一个中的多个像素的图像信息中的边界线信息的方法,包括:检测连续设置在所述图像中的第一状态的第一组像素 信息,用于确定第一行信息并检测第一状态的第二组像素,彼此相邻并且由第二状态的像素包围,以基于第二组像素的轮廓确定边缘信息; 以及基于线信息的相对位置与边缘信息的关系的信息以及第一和第二像素组的大小来确定边界线信息。

    Method and apparatus for recognizing boundary line in an image information
    15.
    发明授权
    Method and apparatus for recognizing boundary line in an image information 有权
    用于识别图像信息中的边界线的方法和装置

    公开(公告)号:US08582888B2

    公开(公告)日:2013-11-12

    申请号:US12071050

    申请日:2008-02-14

    IPC分类号: G06K9/18

    摘要: According to an aspect of an embodiment, a method of detecting boundary line information contained in image information comprising a plurality of pixels in either one of first and second states, comprising: detecting a first group of pixels in the first state disposed continuously in said image information to determine first line information and detecting a second group of pixels in the first state disposed adjacently with each other and surrounded by pixels in the second state to determine edge information based on the contour of the second group of pixels; and determining the boundary line information on the basis of the information of the relation of relative position of the line information and the edge information and the size of the first and second group of pixels.

    摘要翻译: 根据实施例的一个方面,一种检测包含在包括第一和第二状态中的任一个中的多个像素的图像信息中的边界线信息的方法,包括:检测连续设置在所述图像中的第一状态的第一组像素 信息,用于确定第一行信息并检测第一状态的第二组像素,彼此相邻并且由第二状态的像素包围,以基于第二组像素的轮廓确定边缘信息; 以及基于线信息的相对位置与边缘信息的关系的信息以及第一和第二像素组的大小来确定边界线信息。

    Document type identifying method and document type identifying apparatus
    16.
    发明授权
    Document type identifying method and document type identifying apparatus 有权
    文件类型识别方法和文件类型识别装置

    公开(公告)号:US08275792B2

    公开(公告)日:2012-09-25

    申请号:US12585155

    申请日:2009-09-04

    IPC分类号: G06F17/30

    CPC分类号: G06K9/2054 G06K2209/01

    摘要: A document type identifying apparatus includes in advance a database storing therein keywords used as keys that identify document types in association with each document type. The document type identifying apparatus aligns word strings written on a document and generates partial keyword strings for each keyword by using the keywords stored in the database. The partial keyword strings are to be checked for matching with the word strings written on the document. Then, the document type identifying apparatus checks matching of the grouped and aligned word strings with the partial keyword strings and obtains, for each keyword, each number of matched words with the highest matching rates between the grouped word strings that are successfully matched and the partial keyword strings. Then, each number of matched words is used to calculate each evaluation value to determine the document type.

    摘要翻译: 文档类型识别装置预先包括在其中存储关键字的数据库,所述关键字用作与每个文档类型相关联的用于标识文档类型的键。 文档类型识别装置对准写在文档上的字串,并通过使用存储在数据库中的关键字为每个关键字生成部分关键字串。 要检查部分关键字字符串以匹配写在文档上的字串。 然后,文档类型识别装置检查分组和排列的字串与部分关键字串的匹配,并且为每个关键字获得在成功匹配的分组字串之间​​具有最高匹配速率的每个匹配字数, 关键字字符串。 然后,使用每个匹配字数来计算每个评估值以确定文档类型。

    Form processing method, form processing device, and computer product
    17.
    发明申请
    Form processing method, form processing device, and computer product 有权
    表格处理方法,表格处理设备和计算机产品

    公开(公告)号:US20080025618A1

    公开(公告)日:2008-01-31

    申请号:US11599685

    申请日:2006-11-15

    IPC分类号: G06K9/46 G06K9/72 G06K9/66

    CPC分类号: G06K9/00449

    摘要: A form processing apparatus extracts layout information and character information from a form document. A candidate extracting unit extracts word candidates from the character information. A frequency digitizing unit calculates emission probability of a word candidate from each element. A relation digitizing unit calculates transition probability that relationship between word candidates is established. An evaluating unit calculates an evaluation value indicative of a probability of appearance of word candidates in respective logical elements. A determining unit determines the element and a word candidate thereof as the element and a character string thereof in the form document, based on the evaluation value.

    摘要翻译: 表单处理装置从表单文档中提取布局信息和字符信息。 候选提取单元从字符信息中提取词候选。 频率数字化单元从每个元素计算单词候选的发射概率。 关系数字化单元计算建立词候选之间的关系的转移概率。 评估单元计算表示各逻辑元素中的词候选出现概率的评价值。 确定单元基于评估值,将元素及其候选词确定为表单文档中的元素和字符串。

    Form processing method, form processing device, and computer product
    18.
    发明授权
    Form processing method, form processing device, and computer product 有权
    表格处理方法,表格处理设备和计算机产品

    公开(公告)号:US07792369B2

    公开(公告)日:2010-09-07

    申请号:US11599685

    申请日:2006-11-15

    IPC分类号: G06K9/72

    CPC分类号: G06K9/00449

    摘要: A form processing apparatus extracts layout information and character information from a form document. A candidate extracting unit extracts word candidates from the character information. A frequency digitizing unit calculates emission probability of a word candidate from each element. A relation digitizing unit calculates transition probability that relationship between word candidates is established. An evaluating unit calculates an evaluation value indicative of a probability of appearance of word candidates in respective logical elements. A determining unit determines the element and a word candidate thereof as the element and a character string thereof in the form document, based on the evaluation value.

    摘要翻译: 表单处理装置从表单文档中提取布局信息和字符信息。 候选提取单元从字符信息中提取词候选。 频率数字化单元从每个元素计算单词候选的发射概率。 关系数字化单元计算建立词候选之间的关系的转移概率。 评估单元计算表示各逻辑元素中的词候选出现概率的评价值。 确定单元基于评估值,将元素及其候选词确定为表单文档中的元素和字符串。

    Image processing apparatus and image processing method
    19.
    发明授权
    Image processing apparatus and image processing method 有权
    图像处理装置和图像处理方法

    公开(公告)号:US08913117B2

    公开(公告)日:2014-12-16

    申请号:US13541228

    申请日:2012-07-03

    摘要: An image processing apparatus 10 acquires an image. And, the image processing apparatus 10 extracts a domain characterizing an object of gesture recognition from the acquired image. Then, the image processing apparatus 10 maps domains between frames of the image. And then, the image processing apparatus 10 extracts a moving direction of the domains. And then, the image processing apparatus 10 outputs the moving direction when a moving distance of the domains is greater than a predetermined threshold. And then, the image processing apparatus 10 updates the threshold using a moving distance exceeding the threshold.

    摘要翻译: 图像处理装置10获取图像。 并且,图像处理装置10从所获取的图像中提取表征手势识别对象的域。 然后,图像处理装置10在图像的帧之间映射域。 然后,图像处理装置10提取域的移动方向。 然后,当域的移动距离大于预定阈值时,图像处理装置10输出移动方向。 然后,图像处理装置10使用超过阈值的移动距离来更新阈值。

    APPARATUS FOR AND METHOD OF GENERATING CLASSIFIER FOR DETECTING SPECIFIC OBJECT IN IMAGE
    20.
    发明申请
    APPARATUS FOR AND METHOD OF GENERATING CLASSIFIER FOR DETECTING SPECIFIC OBJECT IN IMAGE 审中-公开
    用于检测图像中特定对象的分类器的装置和方法

    公开(公告)号:US20120163708A1

    公开(公告)日:2012-06-28

    申请号:US13335077

    申请日:2011-12-22

    IPC分类号: G06K9/62

    摘要: There provides an apparatus for and a method of generating a classifier for detecting a specific object in an image. The apparatus for generating a classifier for detecting a specific object in an image includes: a region dividing section for dividing, from a sample image, at least one square region having a side length equal to or shorter than the length of shorter side of the sample image; a feature extracting section for extracting an image feature from at least a part of the square regions divided by the region dividing section; and a training section for performing training based on the extracted image feature to generate a classifier. By using the apparatus for and method of generating the classifier, it becomes possible to make full use of recognizable regions of objects to be recognized with variable aspect ratios and improve speed and accuracy for recognizing in complex backgrounds.

    摘要翻译: 提供了一种用于生成用于检测图像中的特定对象的分类器的装置和方法。 用于产生用于检测图像中的特定物体的分类器的装置包括:区域分割部分,用于从样本图像中分离至少一个方边区域,其具有等于或短于样本的短边长度的边长 图片; 特征提取部分,用于从由所述区域划分部分划分的所述正方形区域的至少一部分中提取图像特征; 以及训练部,用于基于所提取的图像特征进行训练以生成分类器。 通过使用生成分类器的装置和方法,可以充分利用被识别的可识别区域的可变长宽比,并提高用于在复杂背景中识别的速度和精度。