Segmentation of handwriting and machine printed text
    11.
    发明授权
    Segmentation of handwriting and machine printed text 失效
    手写和机器打印文本的分段

    公开(公告)号:US5181255A

    公开(公告)日:1993-01-19

    申请号:US627284

    申请日:1990-12-13

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    CPC分类号: G06K9/00456 G06K9/6835

    摘要: A method and apparatus for differentiating and extracting handwritten annotations and machine printed text in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate for example, the handwritten annotations from an image. A separation mask is produced that, for example, covers all the image pixels corresponding to machine printed text, and none of the image pixels corresponding to handwritten or handprinted annotations. The separation mask is used in conjunction with the original image to produce separate handwritten annotations and machine printed text images.

    Comparing text pages using image features based on word positions
    12.
    发明授权
    Comparing text pages using image features based on word positions 有权
    使用基于字位置的图像特征比较文本页

    公开(公告)号:US08910037B1

    公开(公告)日:2014-12-09

    申请号:US13407149

    申请日:2012-02-28

    IPC分类号: G06F17/00

    摘要: A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.

    摘要翻译: 生成文本页面的签名。 签名作为文本页面的标识符。 确定文本页面中单词的位置。 相对于文本页面中的第一个单词的位置,确定文本页面中多个第二个单词的位置。 生成描述相对于第一个单词位置的第二个单词位置的签名值。 签名值被存储。 可以生成文本页面的附加签名,每个签名描述文本页面中其他单词相对于正在生成签名的文本页面中的单词的位置。 签名可用于将文本页面与另一个文本页面进行比较,并生成描述比较结果的相似度度量。

    Methods for generating anti-aliased text and line graphics in compressed document images
    13.
    发明授权
    Methods for generating anti-aliased text and line graphics in compressed document images 有权
    在压缩文档图像中生成反锯齿文本和线图形的方法

    公开(公告)号:US07489830B2

    公开(公告)日:2009-02-10

    申请号:US11878082

    申请日:2007-07-20

    IPC分类号: G06K9/40 G06K9/36

    CPC分类号: H04N1/46 H04N1/41

    摘要: A method and system for storing and generating anti-aliased text and lineart data from compressed document image files, using a MRC model that represents the image as an ordered set of mask/image pairs at resolutions appropriate to the content of each layer. The method and system provide the ability to generate for anti-aliased text data to improve appearance at both high and low resolution, and to avoid baseline jitter of compressed tokens.

    摘要翻译: 一种用于从压缩文档图像文件中存储和生成抗锯齿文本和线条数据的方法和系统,使用将所述图像表示为适合于每层内容的分辨率的掩码/图像对的有序集合的MRC模型。 该方法和系统提供生成抗锯齿文本数据以改善高分辨率和低分辨率外观的能力,并避免压缩令牌的基线抖动。

    Performing document image management tasks using an iconic image having
embedded encoded information
    15.
    发明授权
    Performing document image management tasks using an iconic image having embedded encoded information 失效
    使用具有嵌入编码信息的图标图像执行文档图像管理任务

    公开(公告)号:US5765176A

    公开(公告)日:1998-06-09

    申请号:US709055

    申请日:1996-09-06

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    IPC分类号: G06T11/00 G06T3/00

    CPC分类号: G06T11/00

    摘要: Encoded data embedded in an iconic, or reduced size, version of an original text image is decoded and used in a variety of document image management applications to provide input to, or to control the functionality of, an application. The iconic image may be printed in a suitable place (e.g., the margin or other background region) in the original text image so that a text image so annotated will then always carry the embedded data in subsequent copies made from the annotated original. The iconic image may also be used as part of a graphical user interface as a surrogate for the original text image. An encoding operation encodes the data unobtrusively in the form of rectangular blocks that have a foreground color and size dimensions proportional to the iconic image so that when placed in the iconic image in horizontal lines, the blocks appear to a viewer to be representative of the text portion of the original image that they replace. Several embodiments are illustrated, including using the iconic image as a document surrogate for the original text image for data base retrieval operations. The iconic image may also be used in conjunction with the original text image for purposes of authenticating the original document using a digital signature encoded in the iconic image, or for purposes of controlling the authorized distribution of the document. The iconic image may also carry data about the original image that may be used to enhance the performance and accuracy of a subsequent character recognition operation.

    摘要翻译: 嵌入在原始文本图像的标识或缩小尺寸版本中的编码数据被解码并用于各种文档图像管理应用程序中,以向应用程序提供输入或控制应用程序的功能。 标识图像可以打印在原始文本图像中的适当位置(例如,边距或其他背景区域)中,使得如此注释的文本图像将始终将嵌入数据携带在从注释的原稿制成的随后的副本中。 标识图像也可以用作图形用户界面的一部分,作为原始文本图像的代理。 编码操作以不引人注目的方式对具有与标志性图像成比例的前景颜色和尺寸尺寸的矩形块的形式进行编码,使得当以水平线放置在标志性图像中时,块对于观看者来说是代表文本 他们取代的原始图像的一部分。 示出了几个实施例,包括使用标志性图像作为用于数据库检索操作的原始文本图像的文档替代。 为了使用在标志性图像中编码的数字签名来认证原始文档,或为了控制文档的授权分发的目的,标识图像也可以与原始文本图像结合使用。 标示图像还可以携带关于原始图像的数据,其可以用于增强随后的字符识别操作的性能和准确性。

    Detection of highlighted regions
    16.
    发明授权
    Detection of highlighted regions 失效
    检测突出显示的区域

    公开(公告)号:US5619592A

    公开(公告)日:1997-04-08

    申请号:US477358

    申请日:1995-06-07

    摘要: A method and apparatus for detection of highlighted regions of a document. A document containing highlighted regions is scanned using a gray scale scanner. Morphology and threshold reduction techniques are used to separate highlighted and non-highlighted portions of the document. Having separated the highlighted and non-highlighted portions, optical character recognition (OCR) techniques can then be used to extract text from the highlighted regions.

    摘要翻译: 用于检测文档的突出显示区域的方法和装置。 使用灰度扫描仪扫描包含突出显示区域的文档。 使用形态学和阈值削减技术分离文档的突出显示部分和非突出显示部分。 在分离了突出显示和未突出显示的部分之后,可以使用光学字符识别(OCR)技术从突出显示的区域中提取文本。

    Use of fast textured reduction for discrimination of document image
components
    18.
    发明授权
    Use of fast textured reduction for discrimination of document image components 失效
    使用快速纹理缩小来区分文件图像组件

    公开(公告)号:US5434953A

    公开(公告)日:1995-07-18

    申请号:US854156

    申请日:1992-03-20

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    CPC分类号: G06K9/00456 H04N1/40068

    摘要: A technique for reducing images that provides useful information about the image and allows fast computation. Using threshold values near the extreme possible values for the convolution window size and using large subsampling tiles nevertheless allows extraction of the information about the typical textures that exist in the document image: text words, text lines, rules, and halftones. In a particular embodiment, 16.times.16 tiles are used for subsampling, 16.times.1 and 1.times.16 windows are used for the convolution, and threshold values of 1 and 16 are used. If the horizontal windows in tiles are aligned with 16-bit boundaries in the computer, the implementation is particularly efficient. For the 16.times.1 horizontal window, a threshold convolution with T=1 can be done on any of the sixteen 16-bit words in the tile by checking whether the word is zero or non-zero. For a 1.times.16 vertical window, a threshold convolution with T=1 can be done on any of the sixteen 16-bit columns in the tile by ORing the sixteen appropriately masked words.

    摘要翻译: 一种减少图像提供图像有用信息并允许快速计算的技术。 在卷积窗口尺寸的极端可能值附近使用阈值,然后使用大的二次抽样平铺,仍允许提取关于文档图像中存在的典型纹理的信息:文本字,文本行,规则和半色调。 在特定实施例中,16×16瓦片用于子采样,16x1和1x16窗口用于卷积,并且使用1和16的阈值。 如果瓦片中的水平窗口与计算机中的16位边界对齐,则该实现特别有效。 对于16x1水平窗口,可以通过检查该字是零还是非零,可以在瓦片中的十六个16位字中的任何一个上对T = 1进行阈值卷积。 对于1x16垂直窗口,可以通过对十六个适当屏蔽的字进行或运算,在瓦片中的十六个16位列中的任何一个上对T = 1进行阈值卷积。

    Segmentation of text and graphics
    19.
    发明授权
    Segmentation of text and graphics 失效
    文本和图形的分割

    公开(公告)号:US5202933A

    公开(公告)日:1993-04-13

    申请号:US449626

    申请日:1989-12-08

    申请人: Dan S. Bloomberg

    发明人: Dan S. Bloomberg

    IPC分类号: G06K9/20

    CPC分类号: G06K9/00456

    摘要: A method and apparatus for differentiating and extracting text and line graphics in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate vertical rules and lines from an image followed by the elimination of horizontal rules and lines, remaining text regions are then solidified to produce a separation mask. The mask is used in conjunction with the original image to produce separate text and graphics images.

    摘要翻译: 一种用于在图像中区分和提取文本和线图形的方法和装置。 该方法提供了使用形态学操作,优选地以较小规模的方式从图像中消除垂直规则和线,然后消除水平规则和线,然后将剩余的文本区域固化以产生分离掩模。 该面具与原始图像结合使用以产生单独的文本和图形图像。

    High speed halftone detection technique
    20.
    发明授权
    High speed halftone detection technique 失效
    高速半色调检测技术

    公开(公告)号:US5193122A

    公开(公告)日:1993-03-09

    申请号:US621478

    申请日:1990-12-03

    IPC分类号: G06T7/00 H04N1/40

    CPC分类号: H04N1/40062

    摘要: A simple technique for determining and indicating, in real times as an image is scanned, the presence of halftones within a page. in brief, the technique contemplates monitoring a pixel stream, typically on a line basis, determining the proportion of pixel transitions (relative to the overall number of pixel intervals), and controlling the process based on this information. In one embodiment, a numerical value representing such a proportion is compared to a threshold, and a value in excess of the threshold is taken to signify the presence of halftone regions. Based on this, special processing for halftones is enabled or special processing for non-halftone regions is disabled. In a specific hardware embodiment, the pixel monitoring circuitry includes a transition detector (50), an up/down activity counter (52), a threshold selector (55), and a counter controller (57).

    摘要翻译: 用于在实际上确定和指示作为图像的实时扫描的简单技术中,网页内存在半色调。 简而言之,该技术考虑通常以线为基础来监测像素流,确定像素转换的比例(相对于像素间隔的总数),以及基于该信息来控制过程。 在一个实施例中,将表示这种比例的数值与阈值进行比较,并且取超过阈值的值来表示半色调区域的存在。 基于此,启用了半色调的特殊处理或禁用非半色调区域的特殊处理。 在特定的硬件实施例中,像素监视电路包括转换检测器(50),上/下活动计数器(52),阈值选择器(55)和计数器控制器(57)。