摘要:
A method and apparatus for differentiating and extracting handwritten annotations and machine printed text in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate for example, the handwritten annotations from an image. A separation mask is produced that, for example, covers all the image pixels corresponding to machine printed text, and none of the image pixels corresponding to handwritten or handprinted annotations. The separation mask is used in conjunction with the original image to produce separate handwritten annotations and machine printed text images.
摘要:
A signature for a page of text is generated. The signature serves as an identifier of the text page. Positions of words in a text page are determined. Positions of multiple second words in the text page are determined relative to the position of a first word in the text page. A signature value is generated that describes the second word positions relative to the first word position. The signature value is stored. Additional signatures for the text page can be generated, each signature describing positions of other words in the text page relative to a word in the text page for which the signature is being generated. The signatures can be used to compare the text page to another text page and generate a measure of similarity that describes the result of the comparison.
摘要:
A method and system for storing and generating anti-aliased text and lineart data from compressed document image files, using a MRC model that represents the image as an ordered set of mask/image pairs at resolutions appropriate to the content of each layer. The method and system provide the ability to generate for anti-aliased text data to improve appearance at both high and low resolution, and to avoid baseline jitter of compressed tokens.
摘要:
The glyphs of self-clocking glyph codes are written on regular hexagonal or pseudo-hexagonal lattice-like patterns of centers to reduce the risk of interglyph interference during the read process while also enabling the glyphs to be packed more densely while maintaining a given center-to-center spacing between them.
摘要:
Encoded data embedded in an iconic, or reduced size, version of an original text image is decoded and used in a variety of document image management applications to provide input to, or to control the functionality of, an application. The iconic image may be printed in a suitable place (e.g., the margin or other background region) in the original text image so that a text image so annotated will then always carry the embedded data in subsequent copies made from the annotated original. The iconic image may also be used as part of a graphical user interface as a surrogate for the original text image. An encoding operation encodes the data unobtrusively in the form of rectangular blocks that have a foreground color and size dimensions proportional to the iconic image so that when placed in the iconic image in horizontal lines, the blocks appear to a viewer to be representative of the text portion of the original image that they replace. Several embodiments are illustrated, including using the iconic image as a document surrogate for the original text image for data base retrieval operations. The iconic image may also be used in conjunction with the original text image for purposes of authenticating the original document using a digital signature encoded in the iconic image, or for purposes of controlling the authorized distribution of the document. The iconic image may also carry data about the original image that may be used to enhance the performance and accuracy of a subsequent character recognition operation.
摘要:
A method and apparatus for detection of highlighted regions of a document. A document containing highlighted regions is scanned using a gray scale scanner. Morphology and threshold reduction techniques are used to separate highlighted and non-highlighted portions of the document. Having separated the highlighted and non-highlighted portions, optical character recognition (OCR) techniques can then be used to extract text from the highlighted regions.
摘要:
Machine readable electronic domain definitions of part or all of the electronic domain descriptions of hardcopy documents and/or of part or all of the transforms that are performed to produce and reproduce such hardcopies documents are encoded in codes that are printed on such documents, thereby permitting the electronic domain descriptions of such documents and/or such transforms to be recovered more robustly and reliably when the information carried by such documents is transformed from the hardcopy domain to the electronic domain.
摘要:
A technique for reducing images that provides useful information about the image and allows fast computation. Using threshold values near the extreme possible values for the convolution window size and using large subsampling tiles nevertheless allows extraction of the information about the typical textures that exist in the document image: text words, text lines, rules, and halftones. In a particular embodiment, 16.times.16 tiles are used for subsampling, 16.times.1 and 1.times.16 windows are used for the convolution, and threshold values of 1 and 16 are used. If the horizontal windows in tiles are aligned with 16-bit boundaries in the computer, the implementation is particularly efficient. For the 16.times.1 horizontal window, a threshold convolution with T=1 can be done on any of the sixteen 16-bit words in the tile by checking whether the word is zero or non-zero. For a 1.times.16 vertical window, a threshold convolution with T=1 can be done on any of the sixteen 16-bit columns in the tile by ORing the sixteen appropriately masked words.
摘要:
A method and apparatus for differentiating and extracting text and line graphics in an image. The method provides for the use of morphological operations, preferably at reduced scale, to eliminate vertical rules and lines from an image followed by the elimination of horizontal rules and lines, remaining text regions are then solidified to produce a separation mask. The mask is used in conjunction with the original image to produce separate text and graphics images.
摘要:
A simple technique for determining and indicating, in real times as an image is scanned, the presence of halftones within a page. in brief, the technique contemplates monitoring a pixel stream, typically on a line basis, determining the proportion of pixel transitions (relative to the overall number of pixel intervals), and controlling the process based on this information. In one embodiment, a numerical value representing such a proportion is compared to a threshold, and a value in excess of the threshold is taken to signify the presence of halftone regions. Based on this, special processing for halftones is enabled or special processing for non-halftone regions is disabled. In a specific hardware embodiment, the pixel monitoring circuitry includes a transition detector (50), an up/down activity counter (52), a threshold selector (55), and a counter controller (57).