-
公开(公告)号:US20140085323A1
公开(公告)日:2014-03-27
申请号:US14060942
申请日:2013-10-23
发明人: Lever Wang , Glenn Ricart , Cynthia Ann Thompson , Keith Wishon , Sheldon Laube
IPC分类号: G06K9/00
CPC分类号: G06K9/00442
摘要: A document processing system for accurately and efficiently analyzing documents and methods for making and using same. Each incoming document includes at least one section of textual content and is provided in an electronic form or as a paper-based document that is converted into an electronic form. Since many categories of documents, such as legal and accounting documents, often include one or more common text sections with similar textual content, the document processing system compares the documents to identify and classify the common text sections. The document comparison can be further enhanced by dividing the document into document segments and comparing the document segments; whereas, the conversion of paper-based documents likewise can be improved by comparing the resultant electronic document with a library of standard phrases, sentences, and paragraphs. The document processing system thereby enables an image of the document to be manipulated, as desired, to facilitate its review.
摘要翻译: 一种文件处理系统,用于准确高效地分析文件和制作和使用方法。 每个传入的文档包括文本内容的至少一部分,并以电子形式或转换为电子表格的纸质文档提供。 由于许多类型的文件(如法律和会计凭证)通常包括一个或多个具有相似文本内容的常见文本段落,所以文档处理系统比较文档以识别和分类普通文本段。 通过将文档分割成文档段并比较文档段,可以进一步增强文档比较; 而通过将得到的电子文件与标准短语,句子和段落的图书馆进行比较,可以改进纸质文件的转换。 因此,文档处理系统使得能够根据需要操纵文档的图像,以便于其审查。