摘要:
Upon receiving, for example, document data including a character string from outside, a character recognition device detects a line from a line-touching character-string image in which at least one character (such as number, alphabet letter, kana character, and Chinese character) touches (or overlaps) a line in the document data, tentatively removes the line, and estimates a character region. The character recognition device extracts a line-touching character image from the line-touching character-string image (original image) based on the estimated character region. The character recognition device creates a line-added reference character image by adding a quasi-line to a reference character image stored in advance.
摘要:
In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.
摘要:
A three-dimensional curved-surface model can be estimated by using both two-dimensional outlines obtained from a piece of image photographed from the top and a restriction that the paper is rectangular. Then, only the three-dimensional distortion in the image can be corrected based on the obtained three-dimensional curved-surface model.
摘要:
An image distortion correcting apparatus is provided with an image input section to input an image of a flat rectangular paper surface imaged by an imaging section, as an input image, an imaging position estimating section to estimate a relative imaging position of the imaging section with respect to the paper surface from four vertexes of the rectangular paper surface within the input image, a rectangular paper surface estimating section to estimate four vertexes of the rectangular paper surface within a three-dimensional space based on the imaging position, and an image correcting section to correct a perspective transformation distortion in the paper surface within the input image based on the imaging position and the four vertexes within the three-dimensional space, so as to output an output image.
摘要:
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
摘要:
It is judged for each pixel in an inputted multilevel image whether the pixel is a background pixel, and the pixel is locally binarized if it is judged not to be a background pixel. Then, it is judged whether the pixel belongs to a background or a stroke, such as of a character, ruled line, etc., and a binary image is generated.
摘要:
An information processing apparatus extracts a plurality of strokes from a multilevel image, and generates a stroke binary image. Next, the image processing apparatus extracts feature amounts indicating the thickness and the smoothed graylevel of a stroke in a neighboring region of a target pixel by using each pixel belonging to each of the strokes as the target pixel. Then, the apparatus generates a target stroke binary image from the stroke binary image based on the distribution of the extracted feature amounts.
摘要:
An information processing apparatus extracts a plurality of strokes from a multilevel image, and generates a stroke binary image. Next, the image processing apparatus extracts feature amounts indicating the thickness and the smoothed graylevel of a stroke in a neighboring region of a target pixel by using each pixel belonging to each of the strokes as the target pixel. Then, the apparatus generates a target stroke binary image from the stroke binary image based on the distribution of the extracted feature amounts.
摘要:
Upon receiving, for example, document data including a character string from outside, a character recognition device detects a line from a line-touching character-string image in which at least one character (such as number, alphabet letter, kana character, and Chinese character) touches (or overlaps) a line in the document data, tentatively removes the line, and estimates a character region. The character recognition device extracts a line-touching character image from the line-touching character-string image (original image) based on the estimated character region. The character recognition device creates a line-added reference character image by adding a quasi-line to a reference character image stored in advance.
摘要:
A rather expanded binary image and a rather blurry binary image are generated from a multiple-valued image. A ruled line candidate area is extracted from the rather expanded binary image, and the extracted ruled line candidate area is verified using the rather blurry binary image.