Methods and apparatus for selecting semantically significant images in a
document image without decoding image content
    2.
    发明授权
    Methods and apparatus for selecting semantically significant images in a document image without decoding image content 失效
    在文件图像中选择语义有意义的图像而不对图像内容进行解码的方法和装置

    公开(公告)号:US5390259A

    公开(公告)日:1995-02-14

    申请号:US794191

    申请日:1991-11-19

    摘要: A method and apparatus for processing a document image, using a programmed general or special purpose computer, includes forming the image into image units, and at least one image unit classifier of at least one of the image units is determined, without decoding the content of the at least one of the image units. The classifier of the at least one of the image units is then compared with a classifier of another image unit. The classifier may be image unit length, width, location in the document, font, typeface, cross-section, the number of ascenders, the number of descenders, the average pixel density, the length of the top line contour, the length of the base contour, the location of image units with respect to neighboring image units, vertical position, horizontal inter-image unit spacing, and so forth. The classifier comparison can be a comparison with classifiers of image units of words in a reference table, or with classifiers of other image units in the document. Equivalent classes of image units can be generated, from which word frequency and significance can be determined. The image units can be determined by creating bounding boxes about identifiable segments or extractable units of the image, and can contain a word, a phrase, a letter, a number, a character, a glyph or the like.

    摘要翻译: 一种用于使用编程的通用或专用计算机处理文档图像的方法和装置,包括将图像形成为图像单元,并且确定至少一个图像单元的至少一个图像单元分类器,而不对 该至少一个图像单元。 然后将至少一个图像单元的分类器与另一图像单元的分类器进行比较。 分类器可以是图像单元长度,宽度,文档中的位置,字体,字体,横截面,上升数,下降数,平均像素密度,顶线轮廓的长度, 基本轮廓,图像单元相对于相邻图像单元的位置,垂直位置,水平图像间距等。 分类器比较可以是与参考表中的单词的图像单位的分类器或文档中的其他图像单元的分类器的比较。 可以生成等效的图像单位类别,从中可以确定字频率和重要性。 可以通过创建关于图像的可标识段或可提取单元的边界框来确定图像单元,并且可以包含单词,短语,字母,数字,字符,字形等。

    Detecting function words without converting a scanned document to
character codes
    4.
    发明授权
    Detecting function words without converting a scanned document to character codes 失效
    检测功能字,而不将扫描的文档转换为字符代码

    公开(公告)号:US5455871A

    公开(公告)日:1995-10-03

    申请号:US242990

    申请日:1994-05-16

    IPC分类号: G06K9/46 G06K9/00 G06K9/34

    CPC分类号: G06K9/00

    摘要: A method and apparatus detects function words in a first image of a scanned document without first converting the image to character codes. Function words include determiners, prepositions, articles, and other words that play a largely grammatical role, as opposed to words such as nouns and verbs that convey topic information. Non-content based morphological characteristics of image units are predetermined as well as the presence or omission of character ascenders and descenders in image units. Predetermined characteristics of function word image units are compared with the image units of an image and when a match occurs, the image unit is identified as a function word. Conversely when no matching characteristics occur, the image unit is identified as a non-function word. Additionally, image units are classified and identified as containing only upper case characters, only lower case characters, only digits, and mixed character types.

    摘要翻译: 方法和装置检测扫描文件的第一图像中的功能词,而无需首先将图像转换成字符代码。 功能词包括决定者,介词,文章和其他发挥主要语法作用的单词,而不是传达主题信息的名词和动词。 图像单位的基于非内容的形态特征是预先确定的,以及图像单元中角色上升器和下降器的存在或不存在。 将功能字图像单元的预定特征与图像的图像单位进行比较,并且当匹配发生时,图像单元被识别为功能字。 相反,当没有匹配特征出现时,图像单元被识别为非功能字。 此外,图像单位被分类并标识为仅包含大写字母,仅包含小写字母,仅数字和混合字符类型。

    Segmentation of text styles
    5.
    发明授权

    公开(公告)号:US5570435A

    公开(公告)日:1996-10-29

    申请号:US365251

    申请日:1994-12-28

    IPC分类号: G06K9/20 G06K9/68 G06K9/36

    CPC分类号: G06K9/00456 G06K9/6835

    摘要: A method and apparatus for differentiating and extracting handwritten annotations and machine printed text in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate for example, the handwritten annotations from an image. A separation mask is produced that, for example, converts all the image pixels corresponding to machine printed text, and none of the image pixels corresponding to handwritten or handprinted annotations. The separation mask is used in conjunction with the original image to produce separate handwritten annotations and machine printed text images. The invention also provides a method and apparatus for identifying the location of specialized type styles such as bold and italic is disclosed. The method erodes a binary image utilizing structuring elements which provide a relatively large number of hits in regions containing the specialized type styles. The destination image resulting from the erosion is coalesced so as to form masks which may be used to extract portions of the original image containing the specialized type styles.

    Segmentation of text styles
    6.
    发明授权
    Segmentation of text styles 失效
    细分文本样式

    公开(公告)号:US5402504A

    公开(公告)日:1995-03-28

    申请号:US750156

    申请日:1991-08-28

    CPC分类号: G06K9/00456 G06K9/6835

    摘要: A method and apparatus for differentiating and extracting handwritten annotations and machine printed text in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate for example, the handwritten annotations from an image. A separation mask is produced that, for example, converts all the image pixels corresponding to machine printed text, and none of the image pixels corresponding to handwritten or handprinted annotations. The separation mask is used in conjunction with the original image to produce separate handwritten annotations and machine printed text images. The invention also provides a method and apparatus for identifying the location of specialized type styles such as bold and italic is disclosed. The method erodes a binary image utilizing structuring elements which provide a relatively large number of hits in regions containing the specialized type styles. The destination image resulting from the erosion is coalesced so as to form masks which may be used to extract portions of the original image containing the specialized type styles.

    摘要翻译: 一种用于在图像中区分和提取手写注释和机器印刷文本的方法和装置。 该方法提供了使用形态学操作,优选地以较小的比例来消除例如来自图像的手写注释。 产生分离掩模,其例如转换对应于机器打印文本的所有图像像素,并且不对应于手写或手印注释的图像像素。 分离掩模与原始图像结合使用,以产生单独的手写注释和机器打印的文本图像。 本发明还提供了一种用于识别专门类型样式的位置的方法和装置,例如粗体和斜体。 该方法使用在包含专门类型样式的区域中提供相对大量命中的结构元素来侵蚀二进制图像。 由侵蚀产生的目的地图像被合并以形成可用于提取包含专门类型样式的原始图像的部分的掩模。

    Dynamic programming operation with skip mode for text line image decoding
    7.
    发明授权
    Dynamic programming operation with skip mode for text line image decoding 有权
    用于文本行图像解码的跳过模式的动态编程操作

    公开(公告)号:US06594393B1

    公开(公告)日:2003-07-15

    申请号:US09569531

    申请日:2000-05-12

    IPC分类号: G06K968

    CPC分类号: G06K9/6297 Y10S707/99936

    摘要: In a text recognition system, the computational efficiency of a text line image decoding operation is improved by utilizing the characteristic of a graph known as the cut set. The branches of the data structure that represents the image are initially labeled with estimated scores. When estimated scores are used, the decoding operation must perform iteratively on a text line before producing the best path through the data structure. After each iteration, nodes in the best path are re-scored with actual scores. The decoding operation incorporates an operating mode called skip mode. When the number of consecutive image positions for which the change value of cumulative path scores between current and prior iterations is substantially constant and exceeds a threshold, this signals the presence of a cut set, and the score change value is added to a previously computed path score until a re-scored node is encountered, thereby eliminating the expensive computation of new cumulative path scores at those image positions.

    摘要翻译: 在文本识别系统中,通过利用称为切割集的图形的特征​​来提高文本行图像解码操作的计算效率。 表示图像的数据结构的分支最初用估计分数标记。 当使用估计分数时,在通过数据结构生成最佳路径之前,解码操作必须在文本行上迭代执行。 每次迭代后,最佳路径中的节点用实际分数重新计分。 解码操作包括称为跳过模式的操作模式。 当当前迭代和以前迭代之间的累积路径得分的变化值基本上恒定并超过阈值的连续图像位置的数量时,这表示切割集合的存在,并将得分改变值添加到先前计算的路径 得分,直到遇到重新计分的节点,从而消除了在这些图像位置处的新累积路径分数的昂贵计算。

    Document copy authentication
    8.
    发明授权
    Document copy authentication 失效
    文件复印认证

    公开(公告)号:US5157726A

    公开(公告)日:1992-10-20

    申请号:US810644

    申请日:1991-12-19

    摘要: A system for authenticating a hard copy of an original document. The system employs a special copying machine at the sender's end together with a special ID card (smart card) or other user identification for activating the special machine, and a special copying machine at the receiving end. At the sender's station, the original document and ID card are inserted into the machine. The latter digitizes the document text, to produce a digital signature which incorporates unique information from the sender's ID card. This machine then produces a hard copy of the document to which is added the digital signature. The sender retains the original, but forwards the copy to the recipient or receiver. The receiver then inserts the received copy into the machine at his location, which digitizes and processes the document text and signature and indicates whether the digital signature is valid. Preferably a dual key authentication system is used, with the digital signature incorporating the sender's secret signing key, and the receiver using the related public key in the validation process.

    Data detection and optical focus error detection system for rotating
optical media
    9.
    发明授权
    Data detection and optical focus error detection system for rotating optical media 失效
    用于旋转光学介质的数据检测和光学聚焦误差检测系统

    公开(公告)号:US4801794A

    公开(公告)日:1989-01-31

    申请号:US45746

    申请日:1987-04-29

    IPC分类号: G11B7/09 G11B11/105 G01J1/20

    摘要: A magneto-optic optical disc system which uses the magneto-optic differential data detection channel, with addition only a low pass filter, to also detect focus error. The differential data detection channel includes a pair of photodetectors, the first photodetector being located a predetermined distance within the focal length of the detector lens associated with that photodetector and the second photodetector being located beyond the focal length of the detector lens associated with that photodetector. The output of a differential amplifier receiving the photodetector outputs is the data signal and the output of a low pass filter connected to the output of the differential amplifier is the focus error signal. The dual functionality of the differential data detection channel eliminates a separate optical focus channel, and relative to separate astigmatic focus and data detection channels elmininates a quadrature detector, several optical elements, several electrical elements, and the space they occupy.

    摘要翻译: 磁光盘系统使用磁光差分数据检测通道,只加一个低通滤波器,也可以检测聚焦误差。 差分数据检测通道包括一对光电检测器,第一光电检测器位于与该光电检测器相关联的检测器透镜的焦距内的预定距离处,并且第二光电检测器位于与该光电检测器相关联的检测器透镜的焦距之外。 接收光电检测器输出的差分放大器的输出是数据信号,并且连接到差分放大器的输出的低通滤波器的输出是聚焦误差信号。 差分数据检测通道的双重功能消除了单独的光学聚焦通道,并且相对于单独的散光焦点和数据检测通道消除了正交检测器,几个光学元件,几个电气元件以及它们所占据的空间。

    Methods for generating anti-aliased text and line graphics in compressed document images
    10.
    发明授权
    Methods for generating anti-aliased text and line graphics in compressed document images 有权
    在压缩文档图像中生成反锯齿文本和线图形的方法

    公开(公告)号:US07266250B2

    公开(公告)日:2007-09-04

    申请号:US11354044

    申请日:2006-02-15

    IPC分类号: G06K9/40

    CPC分类号: H04N1/46 H04N1/41

    摘要: A method and system for storing and generating anti-aliased text and lineart data from compressed document images files, using a MRC model that represents the image as an ordered set of mask/image pairs at resolutions appropriate to the content of each layer. The method and system provide the ability to generate for anti-aliased text data to improve appearance at both high and low resolution, and to avoid baseline jitter of compressed tokens.

    摘要翻译: 一种用于从压缩文档图像文件存储和生成抗锯齿文本和线条数据的方法和系统,其使用将所述图像表示为适合于每层内容的分辨率的掩模/图像对的有序集合的MRC模型。 该方法和系统提供生成抗锯齿文本数据以改善高分辨率和低分辨率外观的能力,并避免压缩令牌的基线抖动。