Document processing device, document processing method, and storage medium recording program therefor
    41.
    发明申请
    Document processing device, document processing method, and storage medium recording program therefor 审中-公开
    文件处理装置,文件处理方法和存储介质记录程序

    公开(公告)号:US20060039045A1

    公开(公告)日:2006-02-23

    申请号:US11080621

    申请日:2005-03-16

    IPC分类号: H04N1/387

    CPC分类号: G06K9/00469

    摘要: The present invention provides a document processing device including: an inputting unit that inputs page image data corresponding to images of pages of a document; an extracting unit that analyzes the page image data input by the inputting unit, specifies the content of each item contained in the document corresponding to that page image data, and extracts item data, the item data being character strings expressing that content; a generating unit that links the item data extracted by the extracting unit and generates name data, the name data being a character string expressing a name to be attached to the document; and a writing unit that associates the name data generated by the generating unit with the page image data input by the inputting unit and writes the name data and the page image data to a memory.

    摘要翻译: 本发明提供了一种文件处理装置,包括:输入单元,其输入与文档的页面的图像相对应的页面图像数据; 分析由输入单元输入的页面图像数据的提取单元,指定与该页面图像数据相对应的文档中包含的每个项目的内容,并提取项目数据,该项目数据是表示该内容的字符串; 链接由提取单元提取的项目数据并生成名称数据的生成单元,所述名称数据是表示要附加到文档的名称的字符串; 以及写入单元,其将由生成单元生成的名称数据与由输入单元输入的页面图像数据相关联,并将名称数据和页面图像数据写入存储器。

    Associate document retrieving apparatus and storage medium for storing
associate document retrieving program
    42.
    发明授权
    Associate document retrieving apparatus and storage medium for storing associate document retrieving program 失效
    相关文件检索装置和用于存储协理文件检索程序的存储介质

    公开(公告)号:US6076086A

    公开(公告)日:2000-06-13

    申请号:US41620

    申请日:1998-03-13

    IPC分类号: G06F17/30

    摘要: The present invention provides an associate document retrieving apparatus capable of associate document retrieval which reflects the relation among keywords connected by logical operators in a retrieval expression. In the apparatus, a document information storing element associates each of the documents with a keyword extracted from the document and stores the associated documents. A retrieval expression obtaining element receives a retrieval expression containing retrieval keywords that may be connected by logical operators. A number of documents calculating element specifies objective keywords from within the extracted keywords stored in the document information storing element and calculates several numbers of different kinds of documents. A degree of similarity determining element determines the degree of similarity between the retrieval expression received by the retrieval expression obtaining element and each of the objective keywords in accordance with a relationship between several numbers of documents calculated by the number of documents calculating element. A degree of association determining element obtains associate document information of a document containing any of the objective keywords and determines the degree of association between the retrieval expression and each of the documents based on the degree of similarity for each of the objective keywords and the associate document information.

    摘要翻译: 本发明提供一种能够将反映由检索表达式中的逻辑运算符连接的关键词之间的关系的文档检索的关联文档检索装置。 在该装置中,文档信息存储单元将每个文档与从文档中提取的关键字相关联并存储相关文档。 检索表达式获取元件接收包含可由逻辑运算符连接的检索关键字的检索表达式。 多个文档计算单元从存储在文档信息存储单元中的提取的关键字中指定客观关键词,并计算多个不同种类的文档。 相似度确定元素的程度根据由文件数计算单元计算出的多个文档之间的关系,确定由检索表达式获取元素接收的检索表达式与每个客观关键词之间的相似度。 关联度确定元素获得包含任何目标关键词的文档的关联文档信息,并且基于每个客观关键词和相关文档的相似度来确定检索表达式和每个文档之间的关联程度 信息。