摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.
摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.
摘要:
According to an aspect of an embodiment, a method of character recognition out of an image having a frame and a plurality of characters in an area, comprises the steps of: dividing the area into a plurality of partial areas having a plurality of partial images, respectively; providing a template image having a reference frame image; calculating differences between the partial images and the reference frame image of the template image, respectively; calculating misalignment of the image from the template image based on the average of the differences of the partial images and the reference frame image; and recognizing the characters out of the image upon correction of the misalignment.
摘要:
A key word is first and automatically extracted from a character string group to be recognized, and entered. Then, a character is recognized by segmenting an individual character from a character string image to be recognized, and a character string corresponding to the extracted/entered key word id extracted. Then, a word area delimited by a key word is extracted from the character string image, and a word is recognized. Furthermore, a word recognition result is verified, and a final character string recognition result is output.
摘要:
A document information input apparatus detects a position and an attribute of an area of a real document to be input designated by a user with high accuracy. Based on the detected position and attribute, the document information input apparatus recognizes an image of the area as text information by performing recognition processes suitable for the detected attribute such as character recognition, table recognition and a figure process. Then, the document information input apparatus pastes the resulting information to a pertinent position of an electronic document on a display. As a result, it is possible to input information such as a character sequence, a table and a figure from a real document to an electronic document at high speed and with high accuracy.
摘要:
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
摘要:
In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.
摘要:
This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.