摘要:
A method and apparatus for processing a document image, using a programmed general or special purpose computer, includes forming the image into image units, and at least one image unit classifier of at least one of the image units is determined, without decoding the content of the at least one of the image units. The classifier of the at least one of the image units is then compared with a classifier of another image unit. The classifier may be image unit length, width, location in the document, font, typeface, cross-section, the number of ascenders, the number of descenders, the average pixel density, the length of the top line contour, the length of the base contour, the location of image units with respect to neighboring image units, vertical position, horizontal inter-image unit spacing, and so forth. The classifier comparison can be a comparison with classifiers of image units of words in a reference table, or with classifiers of other image units in the document. Equivalent classes of image units can be generated, from which word frequency and significance can be determined. The image units can be determined by creating bounding boxes about identifiable segments or extractable units of the image, and can contain a word, a phrase, a letter, a number, a character, a glyph or the like.
摘要:
A method and apparatus for identifying and correcting for document skew. Lines of a bitmap are scanned and a variance in the number of ON pixels as a function of skew angle is calculated. Skew of the original document occurs when the variance is a maximum.
摘要:
A method of apparatus for automatic page orientation of a scanned image which compares the number of character ascending pixels to the number of character descending pixels in the image to determine if the image is properly aligned or is 90.degree. or 180.degree. out of orientation. The method and apparatus includes morphologically processing the bitmap of the scanned image using structuring elements for isolating the character ascenders and descenders. When page orientation is improper, the bitmap image of the scanned image is rotated to correct the misalignment.
摘要:
A method and apparatus for identifying and correcting for document skew. Lines of a bitmap are scanned and a variance in the number of ON pixels as a function of skew angle is calculated. Skew of the original document occurs when the variance is a maximum. Once the skew has been identified, the document is deskewed accordingly.
摘要:
Binary image processing techniques are provided for decoding bitmap image space representations of self-clocking glyph shape codes of various types (e.g., codes presented as original or degraded images, with one or a plurality of bits encoded in each glyph, while preserving the discriminability of glyphs that encode different bit values) and for tracking the number and locations of the ambiquities (sometimes referred to herein as "errors") that are encountered during the decoding of such codes. A substantial portion of the image processing that is performed in the illustrated embodiment of this invention is carried out through the use of morphological filtering operations because of the parallelism that is offered by such operations. Moreover, the error detection that is performed in accordance with this invention may be linked to or compared against the error statistics from one or more alternative decoding process, such as the convolution filtering process that is disclosed herein, to increase the reliability of the decoding that is obtained.
摘要:
A method for creating a mask for separating halftone regions in a binary image from other regions comprises: constructing a seed image that includes pixels only in halftone regions and at least one pixel in every halftone region (67); constructing a clipping mask that covers in a connected manner all ON pixels in halftone regions (70); and filling the seed while clipping to the mask (72). Thresholded reductions and morphological operations are preferred.
摘要:
Weighted and unweighted convolution filtering processes are provided for decoding bitmap image space representations of self-clocking glyph shape codes and for tracking the number and locations of the ambiquities or "errors" that are encountered during the decoding. This error detection may be linked to or compared against the error statistics from an alternative decoding process, such as the binary image processing techniques that are described herein to increase the reliability of the decoding that is obtained.
摘要:
A method and apparatus for detection of highlighted regions of a document. A document containing highlighted regions is scanned using a gray scale scanner. Morphology and threshold reduction techniques are used to separate highlighted and non-highlighted portions of the docment. Having separated the highlighted and non-highlighted portions, optical character recognition (OCR) techniques can then be used to extract text from the highlighted regions.