摘要:
Collation can be carried out at a high speed without reducing the accuracy thereof, whereby processing of identifying documents is accurately performed at a high speed. To this end, characteristic data of a predetermined pattern is registered in advance. Characteristic data in a first area greater than an area of the predetermined pattern registered in advance in an image to be identified is compared and collated with characteristic data of the predetermined pattern. A second area smaller than the first area is cut out from the first area based on the result of comparison, so that characteristic data of an image in the second area is compared and collated with the characteristic data of the predetermined pattern, thus making it possible to identify the predetermined pattern contained in the image based on the result of comparison.
摘要:
A method for identifying payment forms accurately identifies types of forms without adding special form identification data. A payment form discrimination method for discriminating payment forms which state a payee account number and a payment amount includes a step of acquiring an image of the form, a step of making a search for the payee account number in the image in accordance with an account number searching rule, and a step of discriminating the type of form based on the searched payee account number. Types of forms can be identified accurately and fast without adding special form identification data, because the form identification is performed using easily searchable account numbers.
摘要:
A data sheet identification device of the invention includes: a character/graphics extracting section, an identical shape deciding section, a graphics collating section, an identification code/data sheet ID identifying section for collating characters that have been decided to have the same shape with an identification code/data sheet ID database in which a plurality of characters showing features of a plurality of data sheets respectively have been registered, and an identifying section for uniquely identifying the data sheet based on a result of the collation by the graphics collating section and a result of the collation by the identification code/data sheet ID identifying section.
摘要:
Disclosed is a format recognition method, apparatus and its storage medium for automatically recognizing the format of a form, whereby the format is automatically determined by examining the arrangement of the smallest rectangles. According to the present invention, the smallest rectangles are extracted from a form, and the positional relationship of these rectangles is obtained. The attribute of the smallest rectangle is determined from the positional relationship. In accordance with the attribute, the smallest rectangles are sorted into a headline portion and a data portion, and a character string in the data portion is recognized.
摘要:
Disclosed are a method of and an apparatus for extracting a dotted line from an binary image of a document, and a storage medium thereof. The isolated points are extracted from the binary image. The isolated points configuring a candidate of the dotted line are extracted based on a positional relationship between the extracted isolated points. A validity of the isolated points configuring the candidate of the dotted line is checked. The dotted line from a positional relationship between groups of the extracted isolated points of the candidate of the dotted line. The dotted line can be thereby precisely extracted even if some isolated points are lost due to an. under-density of the image etc.
摘要:
One system to which the present invention is applied obtains the digitized form image of a form, recognizes a character string existing in the obtained form image, extracts a headline wording being a predetermined character string from the recognized character strings, determines a table structure existing in the form image, on the basis of the extracted headline wording and the arrangement of headline wordings in the form image and specifies a correspondence relationship between a headline wording and a character string other than the headline wording that is recognized, using the determination result.
摘要:
The present invention firstly roughly classifies an analysis range specified by the operator in the color image data of a form into background, a character frame and a character, precisely specifies a character frame on the basis of the classification result, eliminates the character from the color image data from which the background is eliminated and recognizes the remaining character.
摘要:
One system to which the present invention is applied obtains the digitized form image of a form, recognizes a character string existing in the obtained form image, extracts a headline wording being a predetermined character string from the recognized character strings, determines a table structure existing in the form image, on the basis of the extracted headline wording and the arrangement of headline wordings in the form image and specifies a correspondence relationship between a headline wording and a character string other than the headline wording that is recognized, using the determination result.
摘要:
The present invention firstly roughly classifies an analysis range specified by the operator in the color image data of a form into background, a character frame and a character, precisely specifies a character frame on the basis of the classification result, eliminates the character from the color image data from which the background is eliminated and recognizes the remaining character.
摘要:
An image processing device includes a feature emphasis unit for extracting first image frequency information from image data for each first unit area, a boundary provisional determination unit for defining a value obtained by adding a predetermined weight to the first image frequency information as representative feature information, and provisionally determining as a boundary a first unit area whose variance from the representative feature information of the adjacent area is at or higher than a predetermined level, and a boundary determination unit for extracting the second image frequency information for each second unit area smaller than the first unit area in the range of a provisionally determined position and the vicinity, and generating boundary information using as the boundary a second unit area whose value based on the variance from the second image frequency information of the adjacent area is at or higher than a predetermined level.