摘要:
The form data extracting apparatus, even input form does not have a logical structure stored in the generic logical structure DB, by using logical elements in the existing logical structure and a registered form obtained on the basis of (a) the logical structure, (b) pieces of position information of the logical elements, and (c) a relation between the logical elements. A logical element and a logical structure are extracted from the input form, and the extracted logical structure can be defined as a new registered form or a new logical structure.
摘要:
A method for assisting in the creation of a logical structure model, which stores, from an image in which character strings associated respectively with a plurality of logical elements constituting a logical structure are described, the logical elements, character strings associated with the logical elements, and the logical structure, wherein character strings in an input image and the logical structure among the character strings in the input image are extracted, a logical element is selected among the plurality of logical elements according to the degrees of similarity between the extracted character strings and the character string associated respectively with the plurality of logical elements stored in the logical structure model, a character string associated with the selected logical element and a character string in the input image associated with the logical element based on the logical structure among the extracted character strings in the input image are extracted.
摘要:
A form processing program which is capable of automatically extracting keywords. When the image of a scanned form is entered, a layout recognizer extracts a readout region of the form image, a character recognizer recognizes characters within the readout region. A form logical definition database stores form logical definitions defining strings as keywords according to logical structures which are common to forms of same type. A possible string extractor extracts as possible strings combinations of recognized characters each of which satisfies defined relationships of a string. A linking unit links the possible strings according to positional relationships, and determines a combination of possible strings as keywords.
摘要:
A form processing apparatus extracts layout information and character information from a form document. A candidate extracting unit extracts word candidates from the character information. A frequency digitizing unit calculates emission probability of a word candidate from each element. A relation digitizing unit calculates transition probability that relationship between word candidates is established. An evaluating unit calculates an evaluation value indicative of a probability of appearance of word candidates in respective logical elements. A determining unit determines the element and a word candidate thereof as the element and a character string thereof in the form document, based on the evaluation value.
摘要:
A document type identifying apparatus includes in advance a database storing therein keywords used as keys that identify document types in association with each document type. The document type identifying apparatus aligns word strings written on a document and generates partial keyword strings for each keyword by using the keywords stored in the database. The partial keyword strings are to be checked for matching with the word strings written on the document. Then, the document type identifying apparatus checks matching of the grouped and aligned word strings with the partial keyword strings and obtains, for each keyword, each number of matched words with the highest matching rates between the grouped word strings that are successfully matched and the partial keyword strings. Then, each number of matched words is used to calculate each evaluation value to determine the document type.
摘要:
The form data extracting apparatus, even input form does not have a logical structure stored in the generic logical structure DB, by using logical elements in the existing logical structure and a registered form obtained on the basis of (a) the logical structure, (b) pieces of position information of the logical elements, and (c) a relation between the logical elements. A logical element and a logical structure are extracted from the input form, and the extracted logical structure can be defined as a new registered form or a new logical structure.
摘要:
An image recognition apparatus recognizes the correspondence between character strings and logical elements composing a logical structure in an image in which the character strings are described as the logical elements to recognize each logical element. The image recognition apparatus includes outputting means for outputting the recognized logical elements when the correspondence is recognized or re-recognized; first determining means for determining a certain logical element to be correct when input of a determination request to determine the logical element is received from a user; second determining means for determining the correctness of all the logical elements output before the logical element determined by the first determining means and is positioned according to confirmation by the user; and re-recognizing means for re-recognizing the correspondence between logical elements that have not been determined to be correct and the character strings on the basis of the determination content for each logical element.
摘要:
A character recognition apparatus includes a hash table registering unit and a recognition processing unit. The hash table registering unit creates a hash table indicating a characteristic of each of partial character images as an area of each character. The recognition processing unit divides an input image into partial input images, and calculates a characteristic of each partial input image. The recognition processing unit searches the hash table for a partial character image having a characteristic similar to that of each partial input image. The recognition processing unit compares a positional relationship of the partial input images with that of the partial character images to determine whether they match, and recognizes a character in each area of the input image.
摘要:
According to an aspect of an embodiment, an apparatus for analyzing and determining correlation of information contained in a given form containing blocks, at least one of the blocks containing data indicative of a header, the rest of the blocks containing data in association with header information, comprising: a memory for storing templates having nodes, character data associated with said nodes respectively, and relative position information between said nodes; and a processor for analyzing and determining correlation of the information according to a process comprising: obtaining data contained in said blocks in the given form, determining relative position of said blocks to produce relative position information, analyzing the data obtained from the blocks and the relative position information of the blocks in comparison with the character data and the relative position information of said nodes of said templates, and determining correlation of the data contained in said blocks.
摘要:
A three-dimensional curved-surface model can be estimated by using both two-dimensional outlines obtained from a piece of image photographed from the top and a restriction that the paper is rectangular. Then, only the three-dimensional distortion in the image can be corrected based on the obtained three-dimensional curved-surface model.