摘要:
A projection set of geodesic lines which are parallel with each other on a curved surface of a paper face is extracted from an image in which a paper face has been imaged by an image-pickup device, using the paper face contents as a clue; and also a projection set of ruling lines which form a ruled surface corresponding to the curved surface of the paper face is extracted from the projection set of geodesic lines. Then, the curved surface of the paper face is estimated from the projection set of the geodesic lines and ruling lines, and distortion of the image is corrected based on this curved surface of the paper face. If this is done, correspondence with various types of diverse distortions becomes possible, and distortion correction can be performed even when only one part of the paper face appears in the image.
摘要:
A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.
摘要:
An area extraction method including obtaining a character lattice showing a connection relation between unit areas, which are obtained by separating a character string pattern in an image into patterns each recognized as corresponding to a single character, judging whether or not all combinations of each of the unit areas in the obtained character lattice and each of the unit areas in a regular lattice defining a regular connection relation between the unit areas are likely to be established, generating a path coupling between nodes corresponding to the combination of the unit areas which is determined as likely to be established, determining an optimum path from the generated paths based on a degree of coincidence with the regular lattice or the character lattice, and extracting from an image the unit areas in the character lattice corresponding to the determined optimum path.
摘要:
A set of straight lines that associate a top parallel geodesic projection positioned at an upper end with a bottom parallel geodesic projection positioned at a lower end, among sets of parallel geodesic projections, is extracted as a set of ruled-line candidate projections as a search target of a set of ruled line projections. A deviation of neighborhood, which is a distance between a cross ratio vector of the ruled-line candidate projection and a cross ratio vector of a neighboring line obtained by shifting the ruled-line candidate projection by a predetermined interval, is calculated for each ruled-line candidate projection. A set of straight lines having the smallest sum total of deviations of neighborhood, in the set of straight lines, which do not intersect with each other, among the sets of ruled-line projection candidates is extracted as a set of ruled line projections by continuous dynamic programming.
摘要:
A layout analysis program, a layout analysis apparatus, layout analysis method and a medium can highly accurately extract a text block from an image if the image is a color image. The layout analysis program causes a computer to execute a divided region extracting step that extracts a region partitioned by a pattern according to a binary image so as to use the outcome of extraction as divided region, a set of character elements extracting step that extracts a set of the character elements extracted by a first binary image layout analysis process for each extracted divided region so as to use the outcome of extraction as set of character elements, a text block extracting step that extracts a region including the extracted set of character elements in each divided region so as to avoid overlapping the non-character elements extracted by a second binary image layout analysis process and use the outcome of extraction as text block and a layout information generating step that generates layout information according to the text block and the non-character elements extracted by the second binary image layout analysis process.
摘要:
A document layout analysis program capable of extracting an appropriate set of text blocks from a given document image even in the case where the document layout is so complicated that conventional extraction methods with a single extraction condition would not work well. A plurality of different extraction conditions are stored in an extraction condition memory for use in extracting text blocks from a given document image. In accordance with those extraction conditions, a text block extractor extracts a plurality of sets of text blocks from the document image. A text block consolidator produces a consolidated set of text blocks by performing character recognition on each extracted text block, evaluating validity of each text block based on a result of the character recognition, and selecting most valid text blocks from among the plurality of sets of text blocks.
摘要:
According to an aspect of an embodiment, an apparatus for analyzing and determining correlation of information contained in a given form containing blocks, at least one of the blocks containing data indicative of a header, the rest of the blocks containing data in association with header information, comprising: a memory for storing templates having nodes, character data associated with said nodes respectively, and relative position information between said nodes; and a processor for analyzing and determining correlation of the information according to a process comprising: obtaining data contained in said blocks in the given form, determining relative position of said blocks to produce relative position information, analyzing the data obtained from the blocks and the relative position information of the blocks in comparison with the character data and the relative position information of said nodes of said templates, and determining correlation of the data contained in said blocks.
摘要:
This invention provides a correcting device and a correcting method for perspective transformation of document images. The correcting device comprises a horizontal vanishing point determining unit, for detecting a horizontal vanishing point of the perspective transformed document image; a vertical vanishing point determining unit, for detecting a vertical vanishing point of the perspective transformed document image; and a perspective transformation correcting and converting unit, for correcting the perspective transformed document image; wherein the horizontal vanishing point determining unit comprises a direct horizontal line segment detecting unit, an indirect horizontal line segment detecting unit and a horizontal vanishing point detecting unit, and wherein the horizontal vanishing point detecting unit detects a horizontal vanishing point in accordance with a direct horizontal line segment detected by the direct horizontal line segment detecting unit and an indirect horizontal line segment detected by the indirect horizontal line segment detecting unit.
摘要:
A ruled line extracting apparatus, a ruled line extracting program and a ruled line extracting method re-extract a ruled line by changing the predetermined requirements to be met by ruled line s when a ruled line candidate extracted according to the requirements shows a low reliability. A ruled line extracting program that causes a computer to extract a ruled line in an image of a document comprises an extraction step that extracts a ruled line candidate from the image of a document according to the first requirement predefined to be met by the figures of the elements of the ruled lines, a judgment step that judges if the ruled line candidate is stable or unstable according to the structural stability of the ruled line candidate extracted in the extraction step, a requirement determination step that determines the second requirement to be met by the figures of the elements of the ruled line different from the first requirement according to the ruled line candidate judged as stable in the judgment step and the first requirement and a re-extraction step that re-extracts a ruled line candidate according to the second requirement determined in the requirement determination step.
摘要:
A layout analysis program, a layout analysis apparatus, layout analysis method and a medium can highly accurately extract a text block from an image if the image is a color image. The layout analysis program causes a computer to execute a divided region extracting step that extracts a region partitioned by a pattern according to a binary image so as to use the outcome of extraction as divided region, a set of character elements extracting step that extracts a set of the character elements extracted by a first binary image layout analysis process for each extracted divided region so as to use the outcome of extraction as set of character elements, a text block extracting step that extracts a region including the extracted set of character elements in each divided region so as to avoid overlapping the non-character elements extracted by a second binary image layout analysis process and use the outcome of extraction as text block and a layout information generating step that generates layout information according to the text block and the non-character elements extracted by the second binary image layout analysis process.