Data sheet identification device
    4.
    发明授权
    Data sheet identification device 失效
    数据表识别装置

    公开(公告)号:US06778712B1

    公开(公告)日:2004-08-17

    申请号:US09650762

    申请日:2000-08-29

    IPC分类号: G06K954

    CPC分类号: G06K9/2054 G06K2209/01

    摘要: A data sheet identification device of the invention includes: a character/graphics extracting section, an identical shape deciding section, a graphics collating section, an identification code/data sheet ID identifying section for collating characters that have been decided to have the same shape with an identification code/data sheet ID database in which a plurality of characters showing features of a plurality of data sheets respectively have been registered, and an identifying section for uniquely identifying the data sheet based on a result of the collation by the graphics collating section and a result of the collation by the identification code/data sheet ID identifying section.

    摘要翻译: 本发明的数据表识别装置包括:字符/图形提取部,相同形状决定部,图形对照部,识别码/数据表ID识别部,其将已经被判定为具有相同形状的字符与 识别码/数据表ID数据库,其中分别显示多个数据表的特征的多个字符已经被注册;以及识别部分,用于基于图形对照部分的对照结果和 由识别码/数据表ID识别部分进行核对的结果。

    Image collating apparatus and image collating method
    6.
    发明授权
    Image collating apparatus and image collating method 有权
    图像整理装置和图像整理方法

    公开(公告)号:US06694065B2

    公开(公告)日:2004-02-17

    申请号:US09733077

    申请日:2000-12-11

    IPC分类号: G06K960

    CPC分类号: G06K9/2054 G06K2209/01

    摘要: An image collating apparatus and an image collating method, in which a document larger than an image reader can be collated using a preregistered image even in the case where the document is read by the image reader, are disclosed. The image of the document read by the image reader is collated sequentially with the document images of different sizes preregistered in a data base. The document size is determined from the document image input from the image reader. In the case where the document size is smaller than the preregistered image size, the preregistered image is retrieved in accordance with the document size, so that the document features are extracted and matched between the read document image and the registered image set in size to each other, thereby collating the read image with the preregistered image.

    摘要翻译: 公开了一种图像对照装置和图像对照方法,其中即使在由图像读取器读取文档的情况下,也可以使用预先注册的图像来比较大于图像读取器的文档。 由图像读取器读取的文档的图像与预先登记在数据库中的不同大小的文档图像顺序对照。 文档尺寸是从图像读取器输入的文档图像确定的。 在文档尺寸小于预注册图像尺寸的情况下,根据文档大小检索预注册图像,从而提取文档特征并将其读取文档图像和大小分配给每个 从而将读取的图像与预先注册的图像整理。

    Format recognition method, apparatus and storage medium
    7.
    发明授权
    Format recognition method, apparatus and storage medium 有权
    格式识别方法,装置和存储介质

    公开(公告)号:US06567545B1

    公开(公告)日:2003-05-20

    申请号:US09421481

    申请日:1999-10-20

    IPC分类号: G06K934

    CPC分类号: G06K9/00449

    摘要: Disclosed is a format recognition method, apparatus and its storage medium for automatically recognizing the format of a form, whereby the format is automatically determined by examining the arrangement of the smallest rectangles. According to the present invention, the smallest rectangles are extracted from a form, and the positional relationship of these rectangles is obtained. The attribute of the smallest rectangle is determined from the positional relationship. In accordance with the attribute, the smallest rectangles are sorted into a headline portion and a data portion, and a character string in the data portion is recognized.

    摘要翻译: 公开了一种用于自动识别格式格式的格式识别方法,装置及其存储介质,由此通过检查最小矩形的布置来自动确定格式。 根据本发明,从形式中提取最小的矩形,并且获得这些矩形的位置关系。 根据位置关系确定最小矩形的属性。 根据属性,将最小矩形分类为标题部分和数据部分,并且识别数据部分中的字符串。

    Method of recognizing characters
    8.
    发明授权
    Method of recognizing characters 失效
    识别字符的方法

    公开(公告)号:US06549662B1

    公开(公告)日:2003-04-15

    申请号:US09084356

    申请日:1998-05-27

    IPC分类号: G06K934

    CPC分类号: G06K9/00469

    摘要: Characters of data on a document are recognized by automatically determining the definitions of characters of the data from the arrangement of character strings of the data. Character strings on the document are extracted by reading the document, and headers and data on the document are distinguished from each other by determining the positional relationship between the character strings. Character attributes of the data are determined by recognizing characters of the character strings of the headers using a header recognition dictionary. Characters of the character strings of the data are recognized according to the determined character attributes of the data. Since character attributes of the data are determined from recognized characters of the headers after the headers and the data are distinguished from each other from the layout on the document, it is possible to enter automatically the character attributes of the data.

    摘要翻译: 通过从数据的字符串的排列自动确定数据的字符的定义来识别文档上的数据的字符。 通过读取文档来提取文档上的字符串,并且通过确定字符串之间的位置关系来区分文档上的标题和数据。 通过使用标题识别字典识别标题的字符串的字符来确定数据的字符属性。 根据确定的数据的字符属性来识别数据的字符串的字符。 由于数据的字符属性是从头文件的标识符和数据从文档的布局彼此区分开来的识别字符确定的,所以可以自动输入数据的字符属性。

    Method of and apparatus for extracting dotted line, and storage medium thereof
    9.
    发明授权
    Method of and apparatus for extracting dotted line, and storage medium thereof 失效
    提取虚线的方法和设备及其存储介质

    公开(公告)号:US06556701B1

    公开(公告)日:2003-04-29

    申请号:US09421283

    申请日:1999-10-20

    IPC分类号: G06K900

    摘要: Disclosed are a method of and an apparatus for extracting a dotted line from an binary image of a document, and a storage medium thereof. The isolated points are extracted from the binary image. The isolated points configuring a candidate of the dotted line are extracted based on a positional relationship between the extracted isolated points. A validity of the isolated points configuring the candidate of the dotted line is checked. The dotted line from a positional relationship between groups of the extracted isolated points of the candidate of the dotted line. The dotted line can be thereby precisely extracted even if some isolated points are lost due to an. under-density of the image etc.

    摘要翻译: 公开了一种从文件的二进制图像及其存储介质中提取虚线的方法和装置。 从二值图像中提取孤立点。 基于所提取的孤立点之间的位置关系来提取构成虚线候选的孤立点。 检查配置虚线候选者的隔离点的有效性。 来自虚线候选者的提取的孤立点的组之间的位置关系的虚线。 因此即使有一些孤立的点由于丢失而能够精确地提取虚线。 图像密度不足等

    Document identifying device and method
    10.
    发明授权
    Document identifying device and method 有权
    文件识别装置和方法

    公开(公告)号:US07110600B1

    公开(公告)日:2006-09-19

    申请号:US10049635

    申请日:1999-09-30

    IPC分类号: G06K9/34 G06K9/00

    CPC分类号: G06K9/2054 G06K2209/01

    摘要: The invention relates to a document discriminating apparatus and a discriminating method for use in processing documents at financial institutions. A characteristic portion inherent in an optional format is cut out from image data read from a document in the optional format. Color constituents of the cut out image data are analyzed, and a color constituent exhibiting characteristics is selected from the constituents, and a color separation parameter is set for the selected color constituent. Data information is prepared which is related to the image data cut out based upon the color separation parameter. On the other hand, data information is prepared from image data obtained by reading a document to be discriminated based on the color separation parameter. Then, the data information is compared for determination with the data information stored in the document discriminating dictionary unit, whereby the document is discriminated.

    摘要翻译: 本发明涉及一种在金融机构处理文件的文件鉴别装置和鉴别方法。 从可选格式的文档读取的图像数据中切出可选格式固有的特征部分。 分析切出的图像数据的颜色成分,并从成分中选择显示特征的颜色成分,并为所选色成分设定色分离参数。 准备与基于颜色分离参数切出的图像数据有关的数据信息。 另一方面,从通过根据颜色分离参数读取待鉴别的文档获得的图像数据来准备数据信息。 然后,将数据信息与存储在文档识别字典单元中的数据信息进行比较以确定文档,从而识别文档。