Document storage and retrieval system for storing and retrieving
document image and full text data
    1.
    发明授权
    Document storage and retrieval system for storing and retrieving document image and full text data 失效
    用于存储和检索文档图像和全文数据的文档存储和检索系统

    公开(公告)号:US5628003A

    公开(公告)日:1997-05-06

    申请号:US111511

    申请日:1993-08-24

    IPC分类号: G06F17/30

    摘要: A document storage and retrieval system is provided with means for storing a document body in the form of image, means for storing text information in the form of a character code string for retrieval, means for executing a retrieval with reference to the text information, and means for displaying a document image relating thereto on a retrieval terminal according to the retrieval result. Such a form of the system is available for retrieving the full contents of a document and also for displaying the document body printed in a format easy to read straight in the form of image. Accordingly, users are capable of retrieving documents with arbitrary words and also capable of reading even such a document as is complicated to include mathematical expressions and charts through a terminal in the form of image, the same as on paper. Further, the invention provides a system wherein the text information for retrieval is extracted automatically from the document image through character recognition. Since a precision of the character recognition has not been satisfactory hitherto, a visual retrieval and correction have been carried out without fail by operators. However, there is no necessity for the operators to attend therefor according to the invention. Thus, the text information for retrieval can be generated at the cost of practical time and money even in case of volumes of documents.

    摘要翻译: 文件存储和检索系统提供有用于以图像形式存储文件主体的装置,用于存储用于检索的字符代码串形式的文本信息的装置,参考文本信息执行检索的装置,以及 用于根据检索结果在检索终端上显示与其相关的文档图像的装置。 系统的这种形式可用于检索文档的全部内容,并且还用于以图像的形式直接显示以易于阅读的格式打印的文档主体。 因此,用户能够以任意的单词检索文档,并且还能够读取甚至这样一个文件的复杂的文档,包括通过图像形式的终端的数学表达和图表,与纸上相同。 此外,本发明提供一种系统,其中通过字符识别从文档图像中自动提取用于检索的文本信息。 由于字符识别的精确度迄今尚未令人满意,因此操作者已经进行了视觉检索和校正。 然而,根据本发明,操作者不需要参加。 因此,即使在文件量的情况下,也可以以实际的时间和金钱为代价来生成用于检索的文本信息。

    Document retrieval system for displaying document image data with
inputted bibliographic items and character string selected from
multiple character candidates
    2.
    发明授权
    Document retrieval system for displaying document image data with inputted bibliographic items and character string selected from multiple character candidates 失效
    用于显示具有输入的书目项目的文档图像数据和从多个字符候选中选择的字符串的文档检索系统

    公开(公告)号:US5265242A

    公开(公告)日:1993-11-23

    申请号:US139781

    申请日:1987-12-30

    摘要: A document storage and retrieval system for storing a document body in the form of image, means for storing text information in the form of a character code string for retrieval, apparatus for executing a retrieval with reference to the text information, and apparatus for displaying a document image relating thereto on a retrieval terminal according to the retrieval result. Such a form of the system is available for retrieving the full contents of a document and also for displaying the document body printed in a format easy to read straight in the form of image. Users are capable of retrieving documents with arbitrary words and also capable of reading even such a document as is complicated to include mathematical expressions and charts through a terminal in the form of image, the same as on paper. A system is provided wherein the text information for retrieval is extracted automatically from the document image through character recognition. Since a precision of the character recognition has not been satisfactory hitherto, a visual retrieval and correction have been carried out without fail by operators. However, there is no necessity for the operators to attend therefor.

    摘要翻译: 用于以图像的形式存储文件主体的文件存储和检索系统,用于存储用于检索的字符代码串的形式的文本信息的装置,参考文本信息执行检索的装置,以及用于显示 根据检索结果在与检索终端相关的文档图像。 系统的这种形式可用于检索文档的全部内容,并且还用于以图像的形式直接显示以易于阅读的格式打印的文档主体。 用户能够以任意单词检索文档,并且能够读取甚至是复杂的文档,包括通过图像形式的终端的数学表达式和图表,与纸上相同。 提供了一种系统,其中通过字符识别从文档图像中自动提取用于检索的文本信息。 由于字符识别的精确度迄今尚未令人满意,因此操作者已经进行了视觉检索和校正。 但是,运营商没有必要参加。

    Document storage and retrieval system
    3.
    发明授权
    Document storage and retrieval system 失效
    文件存储和检索系统

    公开(公告)号:US4985863A

    公开(公告)日:1991-01-15

    申请号:US559994

    申请日:1990-07-30

    IPC分类号: G06K9/00 G06F17/30

    摘要: A document storage and retrieval system stores a document body in the form of an image, storing text information in the form of a character code string for retrieval, and executing a retrieval with reference to the text information, followed by displaying a document image relating thereto on a retrieval terminal according to the retrieval result. Such a form of the system is available for retrieving the full contents of a document and also for displaying the document body printed in a format easy to read straight in the form of an image.

    摘要翻译: 文件存储和检索系统以图像的形式存储文档主体,存储用于检索的字符代码串的形式的文本信息,并参照文本信息执行检索,然后显示与其相关的文档图像 根据检索结果在检索终端上。 这种系统的形式可用于检索文档的全部内容,并且还用于以图像的形式直接显示以易于阅读的格式打印的文档主体。

    Method and device for detecting the similarity between standard and
unknown patterns
    4.
    发明授权
    Method and device for detecting the similarity between standard and unknown patterns 失效
    用于检测标准和未知模式之间的相似性的方法和装置

    公开(公告)号:US4153897A

    公开(公告)日:1979-05-08

    申请号:US815825

    申请日:1977-07-14

    CPC分类号: G06K9/6203

    摘要: In a pattern recognition device for recognizing an unknown pattern in accordance with the magnitude of the similarities between the unknown pattern and a plurality of standard patterns, the similarity between the unknown pattern and one of the standard patterns is detected as follows.Similarities are detected at first in respective shifting conditions where the unknown and standard patterns are relatively shifted from each other over the first limited extent, including the condition without the shift. The maximum value of these similarities is then detected. The similarities are further detected in respective shifting conditions where the unknown and standard patterns are relatively shifted from each other over the second extend larger than the first limited extent, when the shifting condition which gave the maximum value is that without relative shift.

    摘要翻译: 在根据未知图案和多个标准图案之间的相似度的大小来识别未知图案的图案识别装置中,如下检测未知图案与其中一个标准图案之间的相似度。

    Image understanding system
    5.
    发明授权
    Image understanding system 失效
    图像理解系统

    公开(公告)号:US4907285A

    公开(公告)日:1990-03-06

    申请号:US253445

    申请日:1988-10-05

    IPC分类号: G06K9/20 G06K9/68

    CPC分类号: G06K9/6885 G06K9/00469

    摘要: An image understanding system of this invention uses a grammer describing a document image, and represents the structure of an unknown input image by parsing a statement (the structure of the grammar) written in accordance with this grammer. In other words, the grammer describes an image as substructures and the relative relation between them, and when the substructures and their relative relation are identified in parsing, search is then made whether or not the substructures and their relative relation exist in an unknown input image. The structure of the unknown input image is represented on the basis of the result of this search.

    摘要翻译: 本发明的图像理解系统使用描述文档图像的语法,并且通过解析根据该语法书写的语句(语法的结构)来表示未知输入图像的结构。 换句话说,语法描述了一个图像作为子结构和它们之间的相对关系,并且当在解析中识别子结构及其相关关系时,就进行子结构及其相关关系是否存在于未知输入图像中 。 基于该搜索的结果来表示未知输入图像的结构。

    Document image entry system
    6.
    发明授权
    Document image entry system 失效
    文件图像输入系统

    公开(公告)号:US4893188A

    公开(公告)日:1990-01-09

    申请号:US208116

    申请日:1988-06-17

    IPC分类号: G06T7/00 H04N1/40

    CPC分类号: H04N1/40062

    摘要: For halftone digital image data, an edge portion of the image is extracted for each pixel, and based on a density of each pixel or an average density of each group of a plurality of adjacent or neighboring pixels, there are extracted pixels of a background other than the edge portion of the image. Thereafter, the image data is subdivided into a plurality of blocks each including a predetermined number of pixels. Based on distribution states of the pixels of the block judged to belong to the edge and those of the block judged to belong to the background, a domain recognition is conducted to determine whether the block is a binary block or a halftone block. In addition, for each block, a state of areas in blocks encircling the block is examined such that depending on the state, an expansion/contraction processing is repeatedly achieved a predetermined times for the halftone domain, thereby separating the image data into a binary domain and a halftone domain in a real-time fashion.

    摘要翻译: 对于半色调数字图像数据,为每个像素提取图像的边缘部分,并且基于每个像素的密度或多个相邻或相邻像素中的每组的平均密度,提取背景其他的像素 比图像的边缘部分。 此后,图像数据被细分为多个块,每个块包括预定数量的像素。 基于被判定为属于边缘的块的像素的分布状态和被判定为属于背景的块的像素的分布状态,进行域识别以确定块是二进制块还是半色调块。 此外,对于每个块,检查环绕该块的块的区域的状态,使得根据该状态,针对半色调域重复实现预定次数的扩展/缩小处理,从而将图像数据分离成二进制域 和实时的半色调域。

    Knowledge based information retrieval system
    9.
    发明授权
    Knowledge based information retrieval system 失效
    基于知识的信息检索系统

    公开(公告)号:US5404506A

    公开(公告)日:1995-04-04

    申请号:US831093

    申请日:1992-02-10

    摘要: An information retrieval system with good human-interface methods to give the system ease-of-use having two distinctive features with the first being visual interface and the second being natural language interpretation. The visual interface provides for visual interaction for local search and natural language interpretation provides for linguistic interaction for global search. The visual interface provides versatile views onto the contents of the knowledge base that the system has, controlling mechanisms for browsing through the knowledge base, a capability of showing relevant information for the users, and a mechanism for editing a query expression that describes information to retrieve. By using the visual interface for information retrieval, the users can easily create query expressions, by consulting and reacting with the system. The natural language interpretation makes use of a conceptual network as a knowledge-base that stores important concepts and relationships among these concepts. Based on knowledge and information represented in the conceptual network, the meaning of a noun phrase or a nominal compound which is a string of adjectives and nouns with some prepositions can be inferred. The inferred interpretation of such a noun phrase is paraphrased into an expression that the information retrieval system can handle. Therefore, the user of the system can simply describe the desired information in a language to get the desired information.

    摘要翻译: 一种具有良好人机界面方法的信息检索系统,使系统易用性具有两个独特的特征,第一个是视觉界面,第二个是自然语言解释。 视觉界面提供用于本地搜索的视觉交互,并且自然语言解释为全球搜索提供语言交互。 视觉界面提供了系统具有的知识库内容的多种视图,控制浏览知识库的机制,显示用户相关信息的能力,以及编辑描述要检索的信息的查询表达式的机制 。 通过使用可视化界面进行信息检索,用户可以通过与系统进行协商和反应,轻松创建查询表达式。 自然语言解释使用概念网络作为存储这些概念之间的重要概念和关系的知识库。 基于概念网络中所代表的知识和信息,可以推断名词短语或名词复合词的含义,它是具有一些介词的形容词和名词串。 这种名词短语的推断解释被转换成信息检索系统可以处理的表达式。 因此,系统的用户可以简单地用语言描述期望的信息以获得期望的信息。

    System for character stream search using finite state automaton technique
    10.
    发明授权
    System for character stream search using finite state automaton technique 失效
    使用有限状态自动机技术的字符流搜索系统

    公开(公告)号:US5051886A

    公开(公告)日:1991-09-24

    申请号:US205923

    申请日:1988-06-13

    IPC分类号: G06F17/21 G06F17/30

    摘要: A character stream search system using an FSA for determining at a time whether or not a plurality of character streams as search objects exist in a search character stream which undergoes a search operation and which comprises a plurality of characters expressed with codes. In the system, a collation is conducted between the search character stream and a search object character. In a case where there exists a matched search object character as a result of the collation, a state transition is carried out of a predetermined state indicated by the FSA. In a case where there does not exist a matched search object character, a failure processing to effect a state transition to a transistion destination which is determined in association with the configuration of the FSA. The following processing is completed at a count which is a predetermined upper-limit value for each character undergone the search operation.

    摘要翻译: 一种使用FSA的字符流搜索系统,用于一次确定在经历搜索操作的搜索字符流中是否存在作为搜索对象的多个字符流,并且包括用代码表示的多个字符。 在系统中,在搜索字符流和搜索对象字符之间进行归类。 在作为对照的结果存在匹配的搜索对象字符的情况下,由FSA指示的预定状态执行状态转换。 在不存在匹配的搜索对象字符的情况下,执行与FSA的配置相关联地确定的转移目的地的状态转换的失败处理。 以对于每个经过搜索操作的字符的预定上限值的计数完成以下处理。