OCR-based image compression
    1.
    发明授权
    OCR-based image compression 有权
    基于OCR的图像压缩

    公开(公告)号:US06487311B1

    公开(公告)日:2002-11-26

    申请号:US09304861

    申请日:1999-05-04

    IPC分类号: G06K968

    CPC分类号: H04N1/4115

    摘要: A method for compressing a digitized image of a document using optical character recognition (OCR). The method includes performing optical character recognition (OCR) on the digitized image, identifying, based, at least in part, on a result of the performing step, a plurality of classes of characters comprised in the image, each the class of characters having an associated character value and comprising at least one character, pruning each class of characters, thereby producing information describing the plurality of classes of characters and a residual image, and utilizing the information describing the plurality of classes of characters and the residual image as a compressed digitized image in further processing. Related methods and apparatus are also disclosed.

    摘要翻译: 一种使用光学字符识别(OCR)压缩文档的数字化图像的方法。 所述方法包括对所述数字化图像执行光学字符识别(OCR),至少部分地基于所述执行步骤的结果识别所述图像中包含的多个字符类别,每个所述字符类具有 相关联的字符值并且包括至少一个字符,修剪每个类别的字符,从而产生描述多个字符类别和残留图像的信息,并且利用描述多个类别的字符的信息和残差图像作为压缩数字化 图像进一步处理。还公开了相关方法和装置。

    Fast location of address blocks on gray-scale images
    3.
    发明授权
    Fast location of address blocks on gray-scale images 失效
    地图块在灰度图像上的快速位置

    公开(公告)号:US06343139B1

    公开(公告)日:2002-01-29

    申请号:US09268137

    申请日:1999-03-12

    IPC分类号: G06K900

    摘要: A method for locating a structured field in a gray-scale image of an object, including choosing a plurality of anchor points in the image, each anchor point having a gray-scale value associated therewith. For each anchor point there is determined a horizontal variation dependent on a difference between the gray-scale value of the anchor point and the gray-scale value of a horizontally neighboring anchor point, and there is also determined a vertical variation dependent on a difference between the gray-scale value of the anchor point and the gray-scale value of a vertically neighboring anchor point. Those anchor points whose vertical and horizontal variations obey a first or a second predefined condition are defined as vertically or horizontally dominant respectively. One or more kernels are defined in the image, each such kernel comprising a group of anchor points n predetermined mutual proximity and satisfying a third predefined condition relating the number of vertically-dominant and horizontally-dominant anchor points in the group. The structured field in the image is located using one or more kernels.

    摘要翻译: 一种用于在物体的灰度级图像中定位结构化场的方法,包括选择图像中的多个锚点,每个锚点具有与其相关联的灰度值。 对于每个锚点,确定取决于锚点的灰度值与水平相邻锚点的灰度值之间的差异的水平变化,并且还确定垂直变化取决于 锚点的灰度值和垂直相邻锚点的灰度值。 其垂直和水平变化遵守第一或第二预定条件的锚定点分别被定义为垂直或水平方位。 在图像中定义一个或多个内核,每个这样的内核包括一组预定相互接近的锚定点,并且满足与组中垂直显性和水平占优势的锚点的数量相关联的第三预定义条件。 图像中的结构化字段使用一个或多个内核来定位。

    Automatic template and field definition in form processing
    4.
    发明授权
    Automatic template and field definition in form processing 有权
    自动模板和字段定义在表单处理中

    公开(公告)号:US06886136B1

    公开(公告)日:2005-04-26

    申请号:US09566058

    申请日:2000-05-05

    摘要: A method for processing a plurality of input images containing variable content that is filled into respective, fixed templates. The method includes comparing the images to collect a group of the images having a high degree of similarity therebetween, and combining the images in the group so as to distinguish the variable content from a fixed portion common to a preponderant number of the images in the group. The fixed portion is processed to reconstruct the fixed template that is common to at least some of the images among the preponderant number, and information is extracted from the images using the reconstructed template.

    摘要翻译: 一种用于处理包含可变内容的多个输入图像的方法,所述多个输入图像被填充到相应的固定模板中。 该方法包括比较图像以收集其间具有高度相似度的图像组,并且组合组中的图像,以便将变量内容与组中优先数量的公共数量相同的固定部分进行区分 。 对固定部分进行处理以重建优先权数中至少一些图像所共有的固定模板,并且使用重建的模板从图像中提取信息。

    Coding system for high data volume
    5.
    发明授权
    Coding system for high data volume 有权
    高数据量编码系统

    公开(公告)号:US06662168B1

    公开(公告)日:2003-12-09

    申请号:US09575014

    申请日:2000-05-19

    IPC分类号: G06F1518

    CPC分类号: G06N99/005

    摘要: A method for automated coding of a text phrase relative to a catalog of codes. The method includes finding a plurality of the codes that are candidates for coding of the phrase and identifying a category to which one or more of the candidate codes belong. The phrase is conveyed together with the one or more candidate codes in the identified category to a human operator specialized in the identified category, for verification by the operator of one of the candidate codes in the category for assignment to the phrase.

    摘要翻译: 相对于代码目录自动编码文本短语的方法。 该方法包括找到作为短语的编码候选的多个代码,并且识别一个或多个候选代码所属的类别。 该短语与所识别的类别中的一个或多个候选代码一起传送给专门用于识别的类别的人类操作者,以由操作者验证用于分配给该短语的类别中的候选代码之一。