摘要:
A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.
摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.
摘要:
In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.
摘要:
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
摘要:
This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.
摘要:
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
摘要:
This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.
摘要:
A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.