摘要:
A method and apparatus for detecting proper page orientation of a scanned document image. The document is subdivided into a series of word boxes. The number of ascending and descending text characters within the word can then be compared with the number expected for a properly oriented document to verify page orientation.
摘要:
In a character recognition system, a method and apparatus for segmenting a document image into areas containing text and non-text. Document segmentation in the present invention is comprised generally of the steps of: providing a bit-mapped representation of the document image, extracting run lengths for each scanline from the bit-mapped representation of the document image; constructing rectangles from the run lengths; initially classifying each of the rectangles as either text or non-text; correcting for the skew in the rectangles; merging associated text into one or more text blocks; and logically ordering the text blocks.
摘要:
In a character recognition system, a method and apparatus for correcting the skew of a document image. Skew correction is typically performed during segmentation of the document image into text and non-text parts. Skew correction generally involves skew angle determination and correction of the document image based on the skew angle. A skew angle is determined through the steps of: providing a set of associated rectangles representing the document image, identifying a column edge associated with the set of associated rectangles, comparing rectangles from the set of associated rectangles to identify those that are in the same column and suitably far apart, calculating a tangential angle between the rectangles identified and identifying the most common tangential angle as the skew angle. Once the skew angle is determined, correction of the document image is made by constructing real skewed rectangles from corresponding extracted rectangles and rotating each of the real skewed rectangles around an origin coordinate for a distance based on the skew angle.
摘要:
In a character recognition system, a method and apparatus for segmenting a document image into areas containing text and non-text. Document segmentation in the present invention is comprised generally of the steps of: providing a bit-mapped representation of the document image, extracting run lengths for each scanline from the bit-mapped representation of the document image; constructing rectangles from the run lengths; initially classifying each of the rectangles as either text or non-text; correcting for the skew in the rectangles; merging associated text into one or more text blocks; and logically ordering the text blocks.
摘要:
In a character recognition system, a method and apparatus for correcting the skew of a document image. Skew correction is typically performed during segmentation of the document image into text and non-text parts. Skew correction generally involves skew angle determination and correction of the document image based on the skew angle. A skew angle is determined through the steps of: providing a set of associated rectangles representing the document image, identifying a column edge associated with the set of associated rectangles, comparing rectangles from the set of associated rectangles to identify those that are in the same column and suitably far apart, calculating a tangential angle between the rectangles identified and identifying the most common tangential angle as the skew angle. Once the skew angle is determined, correction of the document image is made by constructing real skewed rectangles from corresponding extracted rectangles and rotating each of the real skewed rectangles around an origin coordinate for a distance based on the skew angle.
摘要:
A method and apparatus for expanding whitespace between lines of text in a document image. The method and apparatus preserves the text size and relative spatial orientation within text areas.
摘要:
A data sheet is composed of an upper part and a lower part. The upper part is used as a user interface including a reduced image of contents of a document. The lower part is an interface for a reading device such as a copy machine, including a code obtained by encoding the document. By use of the data sheet, the user can easily distribute or carry an electronic document data with the user. In addition, the user can recognize contents of the electronic document data by looking at the reduced image printed on the data sheet.
摘要:
A method and apparatus for detecting the skew angle of a document image. Skew angle determination is performed by the steps of determining a set of sampling points from an input document image and processing X and Y coordinates of the sampling points in order to calculate a regression coefficient of the sampling points. The skew angle of the document is determined using the regression coefficient. To evaluate a calculated skew angle which corresponds to the regression coefficient, a correlation coefficient is calculated and evaluated. As coordinates of sampling points are obtained for a plurality of sets of data corresponding to different ruled lines or lines of characters, a histogram may be used to determine the most probable skew angle.
摘要:
A method applicable to a character recognition system is disclosed which assigns direction codes to a number of boundary picture elements contained in a two-level character pattern. The direction of connectivity of a boundary picture element observed is fractionized to minimize the error due to the quantization of the direction. The direction codes are converted into those which correspond to connectivity directions which should be finally grasped. Such direction codes allow strokes to be extracted quite faithfully to the original character pattern.
摘要:
A device includes a display monitor configured to display on a screen of the display monitor at least one of a current image and a preceding image taken by an optical unit, a storage unit configured to store the preceding image, and a control unit configured to display a part of the preceding image on the screen when the current image is displayed on the screen, the displayed part of the preceding image indicating a positional relationship between the current image and the preceding image. The current image is taken when the displayed part of the preceding image substantially overlaps a corresponding portion of the current image such that the current image and the preceding image can be processed to form a panoramic image.