Document processing device and document processing method
    3.
    发明授权
    Document processing device and document processing method 失效
    文件处理装置和文件处理方法

    公开(公告)号:US07680331B2

    公开(公告)日:2010-03-16

    申请号:US11071311

    申请日:2005-03-04

    IPC分类号: G06K9/00 G06K9/46

    CPC分类号: G06K9/6857

    摘要: The present invention provides a document processing device including: a general feature vector memory that stores feature vectors of a shape for each of plural characters; an input unit that optically reads in a document; a extracting unit that extracts feature vectors from the shapes of characters in a document read in by the input unit; a general shape recognition unit that estimates a character for which the feature vectors of its shape extracted by the extracting unit, based on the feature vectors extracted by the extracting unit and the content stored in the general feature vector memory; and a specific feature vector memory that stores the feature vectors extracted by the extracting unit in association with an estimation result of the general shape recognition unit.

    摘要翻译: 本发明提供了一种文件处理装置,包括:一般特征向量存储器,其存储多个字符中的每一个的形状的特征向量; 输入单元,其光学地读入文档; 提取单元,其从由所述输入单元读入的文档中的字符的形状提取特征向量; 一般形状识别单元,基于由提取单元提取的特征向量和存储在一般特征向量存储器中的内容来估计由提取单元提取的其形状的特征向量的特征; 以及特征向量存储器,其与通常形状识别单元的估计结果相关联地存储由提取单元提取的特征向量。

    Document processing device and document processing method
    7.
    发明申请
    Document processing device and document processing method 失效
    文件处理装置和文件处理方法

    公开(公告)号:US20050265602A1

    公开(公告)日:2005-12-01

    申请号:US11071311

    申请日:2005-03-04

    CPC分类号: G06K9/6857

    摘要: The present invention provides a document processing device including: a general feature vector memory that stores feature vectors of a shape for each of plural characters; an input unit that optically reads in a document; a extracting unit that extracts feature vectors from the shapes of characters in a document read in by the input unit; a general shape recognition unit that estimates a character for which the feature vectors of its shape extracted by the extracting unit, based on the feature vectors extracted by the extracting unit and the content stored in the general feature vector memory; and a specific feature vector memory that stores the feature vectors extracted by the extracting unit in association with an estimation result of the general shape recognition unit.

    摘要翻译: 本发明提供了一种文件处理装置,包括:一般特征向量存储器,其存储多个字符中的每一个的形状的特征向量; 输入单元,其光学地读入文档; 提取单元,其从由所述输入单元读入的文档中的字符的形状提取特征向量; 一般形状识别单元,基于由提取单元提取的特征向量和存储在一般特征向量存储器中的内容来估计由提取单元提取的其形状的特征向量的特征; 以及特征向量存储器,其与通常形状识别单元的估计结果相关联地存储由提取单元提取的特征向量。

    Document processing device, document processing method, and storage medium recording program therefor
    10.
    发明申请
    Document processing device, document processing method, and storage medium recording program therefor 审中-公开
    文件处理装置,文件处理方法和存储介质记录程序

    公开(公告)号:US20060062492A1

    公开(公告)日:2006-03-23

    申请号:US11080924

    申请日:2005-03-16

    IPC分类号: G06K9/54 H04N1/00 G06K9/00

    CPC分类号: G06F17/271

    摘要: The invention provides a document processing device including: a memory that stores syntax data expressing syntax of character strings whose probability of being a title of a document is high or-character strings whose probability of being a title of a document is low; an input unit that inputs document data obtained by digitizing a document; an extraction unit that analyzes the input document data and extracts character string data expressing character strings; a syntax analyzing unit that analyzes the extracted character string data and specifies the syntax of each character string contained in the document corresponding to the document data; and a specifying unit that specifies, from among the extracted character string data, character string data expressing a title of the document corresponding to the document data, based on results of specification by the syntax analyzing unit and content stored in the memory.

    摘要翻译: 本发明提供了一种文档处理装置,包括:存储器,其存储表示作为文档的标题的概率为高的字符串的语法的语法数据,或者作为文档的标题的概率低的字符串; 输入单元,其输入通过数字化文档而获得的文档数据; 提取单元,其分析输入文档数据并提取表示字符串的字符串数据; 语法分析单元,分析所提取的字符串数据并指定包含在与文档数据相对应的文档中的每个字符串的语法; 以及指定单元,其基于语法分析单元的指定结果和存储在存储器中的内容,从提取的字符串数据中指定表示与文档数据对应的文档的标题的字符串数据。