Identification and segmentation of finely textured and solid regions of
binary images
    41.
    发明授权
    Identification and segmentation of finely textured and solid regions of binary images 失效
    二维图像精细纹理和实体区域的识别与分类

    公开(公告)号:US5065437A

    公开(公告)日:1991-11-12

    申请号:US449624

    申请日:1989-12-08

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    CPC分类号: G06K9/00456 H04N1/40062

    摘要: A method of separating finely textured and solid regions in a binary image from other regions such as those containing text and line graphics. The image is subjected to a first set of operations (10) that eliminates OFF pixels that are near ON pixels, which tends to thicken text and lines and solidify textured regions. The image is then subjected to a second set of operations (12) that eliminates ON pixels that are near OFF pixels. This thins out and eliminates the previously thickened text and lines, but leaves the previously solidified textured regions substantially intact.

    Comparing text pages using image features based on word positions
    42.
    发明授权
    Comparing text pages using image features based on word positions 有权
    使用基于字位置的图像特征比较文本页

    公开(公告)号:US08151186B1

    公开(公告)日:2012-04-03

    申请号:US13228352

    申请日:2011-09-08

    IPC分类号: G06F15/177

    摘要: A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.

    摘要翻译: 生成文本页面的签名。 签名作为文本页面的标识符。 确定文本页面中单词的位置。 相对于文本页面中的第一个单词的位置,确定文本页面中多个第二个单词的位置。 生成描述相对于第一个单词位置的第二个单词位置的签名值。 签名值被存储。 可以生成文本页面的附加签名,每个签名描述文本页面中其他单词相对于正在生成签名的文本页面中的单词的位置。 签名可用于将文本页面与另一个文本页面进行比较,并生成描述比较结果的相似度度量。

    System for embedded digital data that allows embedding of data around known obstructions
    43.
    发明授权
    System for embedded digital data that allows embedding of data around known obstructions 有权
    用于嵌入式数字数据的系统,允许在已知障碍物周围嵌入数据

    公开(公告)号:US06641051B1

    公开(公告)日:2003-11-04

    申请号:US09404755

    申请日:1999-09-24

    IPC分类号: G06K1906

    CPC分类号: B41M5/00

    摘要: A system for printing glyph frames around known obstructions. All frames in an area are determined to be obstructed or unobstructed, based on their location with respect to other printed areas. The unobstructed locations can be numbered and glyph data printed within. In the alternative, the good locations can be numbered modulo some number much smaller that the number of available locations to provide redundancy. The unobstructed locations can be stored in either the sync lines or in the data area of other locations known to be unobstructed. Also, the frame itself can be identified as obstructed or unobstructed to provide more redundancy.

    摘要翻译: 用于在已知障碍物周围打印字形框架的系统。 基于它们相对于其他印刷区域的位置,确定区域中的所有框架被阻挡或不阻塞。 无阻碍的位置可以编号,字形数据打印在其中。 在替代方案中,良好的位置可以被编号成一些数量小得多的可用位置的数量来提供冗余。 无阻塞的位置可以存储在已知无阻碍的其他位置的同步线或数据区中。 此外,框架本身可以被识别为阻塞或不阻塞以提供更多的冗余。

    Self-clocking glyph shape codes
    44.
    发明授权
    Self-clocking glyph shape codes 失效
    自定时字形形状代码

    公开(公告)号:US6076738A

    公开(公告)日:2000-06-20

    申请号:US240798

    申请日:1994-05-10

    摘要: This invention provides self-clocking glyph shape codes for encoding digital data in the shapes of glyphs that are suitable for printing on hardcopy recording media. Advantageously, the glyphs are selected so that they tend not to degrade into each other when they are degraded and/or distorted as a result, for example, of being photocopied, transmitted via facsimile, and/or scanned-in to an electronic document processing system. Moreover, for at least some applications, the glyphs desirably are composed of printed pixel patterns containing nearly the same number of ON pixels and nearly the same number of OFF pixels, such that the code that is rendered by printing such glyphs on substantially uniformly spaced centers appears to have a generally uniform texture. In the case of codes printed at higher spatial densities, this texture is likely to be perceived as a generally uniform gray tone. Binary image processing and convolution filtering techniques for decoding such codes also are diclosed, but this application focuses on the codes.

    摘要翻译: 本发明提供了自适应字形形状代码,用于编码适合于在硬拷贝记录介质上打印的字形的数字数据。 有利地,字形被选择为使得当它们被降级和/或变形时,它们趋向于不会相互劣化,例如,被复印,通过传真和/或扫描传输到电子文档处理 系统。 此外,对于至少一些应用,字形理想地由包含几乎相同数量的ON像素和几乎相同数量的OFF像素的印刷像素图案组成,使得通过在基本均匀间隔的中心上打印这样的字形而呈现的代码 似乎具有大致均匀的质地。 在以更高的空间密度打印的代码的情况下,该纹理可能被认为是大致均匀的灰色调。 用于对这些代码进行解码的二进制图像处理和卷积滤波技术也被分类,但该应用集中在代码上。

    Automatic method of identifying sentence boundaries in a document image
    45.
    发明授权
    Automatic method of identifying sentence boundaries in a document image 失效
    在文档图像中识别句子边界的自动方法

    公开(公告)号:US5892842A

    公开(公告)日:1999-04-06

    申请号:US572597

    申请日:1995-12-14

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    IPC分类号: G06K9/62 G06K9/32 G06K9/34

    CPC分类号: G06K9/32 G06K2209/01

    摘要: A method of automatically identifying sentence boundaries in a document image without performing character recognition to generate an ASCII representation of the document text. The identification process begins by selecting a connected component from the multiplicity of connected components of a text line. Next, it is determined whether the selected connected component might represent a period based upon its shape. If the selected connected component is dot shaped, then it is determined whether the selected connected component might represent a colon. Finally, if the selected connected component is dot shaped and not part of a colon, the selected connected component is labeled as a sentence boundary.

    摘要翻译: 自动识别文档图像中的句子边界而不执行字符识别以生成文档文本的ASCII表示的方法。 识别过程通过从文本行的多个连接的组件中选择连接的组件开始。 接下来,确定所选择的连接分量是否可以基于其形状来表示周期。 如果所选连接的组件是点状,则确定所选择的连接组件是否可能代表冒号。 最后,如果所选择的连接组件是点状的,而不是冒号的一部分,则所选择的连接组件被标记为句子边界。

    Automatic method of generating thematic summaries from a document image
without performing character recognition
    46.
    发明授权
    Automatic method of generating thematic summaries from a document image without performing character recognition 失效
    从文档图像生成专题摘要而不进行字符识别的自动方法

    公开(公告)号:US5848191A

    公开(公告)日:1998-12-08

    申请号:US572848

    申请日:1995-12-14

    摘要: A method of automatically generating a thematic summary from a document image without performing character recognition to generate an ASCII representation of the document text. The method begins with decomposition of the document image into text blocks, and text lines. Using the median x-height of text blocks the main body of text is identified. Afterward, word image equivalence classes and sentence boundaries within the blocks of the main body of text are determined. The word image equivalence classes are used to identify thematic words. These, in turn are used to score the sentences within the main body of text, and the highest scoring sentences are selected for extraction.

    摘要翻译: 一种从文档图像自动生成专题摘要而不执行字符识别以生成文档文本的ASCII表示的方法。 该方法从文档图像分解成文本块和文本行开始。 使用文本的中间x高度块标识文本的主体。 之后,确定文本主体块内的单词图像等价类和句子边界。 单词图像等价类用于标识专题词。 这些反过来用于对文本主体内的句子进行分数,并且选择最高的评分句子用于提取。

    Method and article of manufacture for determining whether a scanned
image is an original image or fax image
    47.
    发明授权
    Method and article of manufacture for determining whether a scanned image is an original image or fax image 失效
    用于确定扫描图像是原始图像还是传真图像的方法和制品

    公开(公告)号:US5828771A

    公开(公告)日:1998-10-27

    申请号:US573617

    申请日:1995-12-15

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    IPC分类号: G06K9/00 G06T7/40 G06T7/60

    CPC分类号: G06K9/036 G06T7/40 G06T7/60

    摘要: An efficient image processing technique automatically analyzes an image scanned at 300 or greater dpi and measures an image characteristic of the input image from which it is possible to determine whether the image has ever been previously scanned or printed at low resolution at some time in its history. The technique is effective in classifying an image that was at one time embodied in paper form and scanned at a vertical resolution of 100 dpi or less, such as a facsimile document scanned in standard mode, or at 200 pixels/inch (referred to as "fine fax mode".) The technique performs measurements on the pixels included in the vertical or horizontal edges of symbols contained in the input image, and produces a distribution of the measurements. A numerical interpretation of the measurement distribution data is used to classify the image. The invention is computationally efficient because it may be applied to only a small percentage (e.g., 7%) of a document image as long as the subimage selected contains symbols such as characters. The invention may be incorporated into a document image management system where identification of documents that contain the artifacts of low resolution document images could be used to improve subsequent processing of the image, such as, for example, in an OCR system.

    摘要翻译: 有效的图像处理技术自动分析以300或更大dpi扫描的图像,并且测量输入图像的图像特征,从该图像特征可以确定在其历史中的某个时间是否先前扫描或以低分辨率打印图像 。 该技术有效地分类了一次体现在纸张形式并以100dpi或更小的垂直分辨率扫描的图像,例如以标准模式扫描的传真文件,或以200像素/英寸(称为“ 精细传真模式“)。该技术对包含在输入图像中的符号的垂直或水平边缘中包括的像素进行测量,并产生测量的分布。 使用测量分布数据的数字解释来对图像进行分类。 本发明在计算上是有效的,因为只要所选择的子图像包含诸如字符的符号,其可以仅应用于文档图像的小百分比(例如7%)。 本发明可以并入到文档图像管理系统中,其中包含低分辨率文档图像的伪影的文档的识别可以用于改进图像的后续处理,例如在OCR系统中。

    Mark sensing on a form
    48.
    发明授权

    公开(公告)号:US5572601A

    公开(公告)日:1996-11-05

    申请号:US326121

    申请日:1994-10-19

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    CPC分类号: G06K9/2054

    摘要: A robust technique for determining whether a field (43, 45, 47a-d) on a form (40'), which has been converted to a binary input image, contains a mark utilizes an approach of making an initial determination of the approximate location of the field, and then refining such determination. The form is assumed to have registration marks (fiducials) with the field at a known location relative to the fiducials. The fiducials are identified (50), and the approximate location of the field is determined (55) from the fiducial positions and the known relation between the fiducials and the field. At this point, a portion of the image (referred to as the subimage) is extracted (57). The subimage is typically somewhat larger than the field so that it can be assumed that the field is within the subimage. The field has machine-printed lines along at least part of the field perimeter. In order to distinguish these lines from hand-printed marks in the field, a copy of the subimage is subjected to a set of operations (60) on the actual pixels in the subimage that provides nominal information on the location of these lines. The boundaries of the subimage are then changed (62) to nominally exclude the lines.

    Identification of a blank page in an image processing system
    49.
    发明授权
    Identification of a blank page in an image processing system 失效
    识别图像处理系统中的空白页

    公开(公告)号:US5467410A

    公开(公告)日:1995-11-14

    申请号:US855375

    申请日:1992-03-20

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    摘要: The present invention provides a robust technique for quickly determining whether a binary input image originated as a blank page. The technique provides reliable sensing in the presence of various image and scanner noise in the input image. In broad terms, the invention contemplates reducing the input image with a low threshold, labeling (by size) connected components (8-connected or 4-connected), and performing a threshold analysis. The threshold analysis typically entails size and numerical thresholds, taking into account the characteristic dimensions of expected types of noise. In specific embodiments, the reduction is performed as a textured reduction wherein the image is divided into tiles, and a single row of pixels in each tile is checked to see whether there are any ON pixels. If there are, the corresponding pixel in the reduced image is ON, otherwise it is OFF. Optional morphological operations are performed to remove expected sources of noise (e.g., pepper noise and thin horizontal lines). The invention further recognizes that a faxed page may contain vertical streaks that are not part of the original paper document. Thus, the threshold analysis typically allows a certain number of such streaks to be present without concluding that the page is not blank. A further understanding of the nature and advantages of the present invention may be realized by reference to the remaining portions of the specification and the drawings.

    摘要翻译: 本发明提供一种用于快速确定二进制输入图像是否起始为空白页的鲁棒技术。 该技术在输入图像中存在各种图像和扫描仪噪声的情况下提供可靠的感测。 在广义上,本发明考虑以低阈值,(通过尺寸标注)连接的组件(8连接或4连接)来缩小输入图像,以及执行阈值分析。 考虑到预期噪声类型的特征尺寸,阈值分析通常需要大小和数值阈值。 在具体实施例中,减少作为纹理化减少执行,其中图像被分割成瓦片,并且检查每个瓦片中的单行像素以查看是否存在任何ON像素。 如果存在,缩小图像中的对应像素为ON,否则为OFF。 执行可选的形态学操作以去除预期的噪声源(例如,胡椒噪声和细水平线)。 本发明还认识到,传真页面可以包含不是原始纸质文档的一部分的垂直条纹。 因此,阈值分析通常允许一定数量的这样的条纹存在,而不断定页面不是空白的。 可以通过参考说明书和附图的其余部分来实现对本发明的性质和优点的进一步理解。

    Word spotting in bitmap images using word bounding boxes and hidden
Markov models
    50.
    发明授权
    Word spotting in bitmap images using word bounding boxes and hidden Markov models 失效
    使用字边界框和隐马尔可夫模型在位图图像中发现字

    公开(公告)号:US5438630A

    公开(公告)日:1995-08-01

    申请号:US991913

    申请日:1992-12-17

    摘要: Font-independent spotting of user-defined keywords in a scanned image. Word identification is based on features of the entire word without the need for segmentation or OCR, and without the need to recognize non-keywords. Font-independent character models are created using hidden Markov models (HMMs) and arbitrary keyword models are built from the character HMM components. Word or text line bounding boxes are extracted from the image, a set of features based on the word shape, (and preferably also the word internal structure) within each bounding box is extracted, this set of features is applied to a network that includes one or more keyword HMMs, and a determination is made. The identification of word bounding boxes for potential keywords includes the steps of reducing the image (say by 2.times.) and subjecting the reduced image to vertical and horizontal morphological closing operations. The bounding boxes of connected components in the resulting image are then used to hypothesize word or text line bounding boxes, and the original bitmaps within the boxes are used to hypothesize words. In a particular embodiment, a range of structuring elements is used for the closing operations to accommodate the variation of inter- and intra-character spacing with font and font size.

    摘要翻译: 在扫描图像中用户定义的关键字的字体独立检测。 词识别基于整个词的特征,而不需要分割或OCR,并且不需要识别非关键字。 使用隐马尔可夫模型(HMM)创建字体无关的字符模型,并且使用字符HMM组件构建任意关键字模型。 从图像中提取词或文本行边界框,提取基于每个边界框内的单词形状(并且优选地也称为单词内部结构)的一组特征,该特征集合被应用于包括一个 或更多关键字HMM,并进行确定。 用于潜在关键词的单词界限框的识别包括以下步骤:减少图像(例如,2x),并使缩小后的图像进行垂直和水平形态关闭操作。 然后,使用所得到的图像中的连接分量的边界框来假设单词或文本行界限框,并且使用框内的原始位图来假设单词。 在特定实施例中,结构化元素的范围用于关闭操作以适应字体和字体间距与字体间距的变化。