摘要:
Forms such as business forms used in banks and post offices are automatically classified using a form search apparatus and method. The method of classifying forms comprises extracting features from the image data of the input form and comparing the extracted features with stored features of a set of template forms corresponding to a set of known classifications of forms. The comparing step compares extracted features which comprise attributes of tables contained in the template forms and the input form respectively. The attributes of tables may be the number of tables in the form, or the number of cells comprising the tables. An approximate matching step is used to reduce the number of candidate template forms.
摘要:
Forms such as business forms used in banks and post offices are automatically classified using a form search apparatus and method. The method of classifying forms comprises extracting features from the image data of the input form and comparing the extracted features with stored features of a set of template forms corresponding to a set of known classifications of forms. The comparing step compares extracted features which comprise attributes of tables contained in the template forms and the input form respectively. The attributes of tables may be the number of tables in the form, or the number of cells comprising the tables. An approximate matching step is used to reduce the number of candidate template forms.
摘要:
Forms such as business forms used in banks and post offices are automatically classified using a form search apparatus and method. The method of classifying forms comprises extracting features from the image data of the input form and comparing the extracted features with stored features of a set of template forms corresponding to a set of known classifications of forms. The comparing step compares extracted features which comprise attributes of tables contained in the template forms and the input form respectively. The attributes of tables may be the number of tables in the form, or the number of cells comprising the tables. An approximate matching step is used to reduce the number of candidate template forms.
摘要:
Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.
摘要:
This invention provides the following environment. That is, an original document file corresponding to a document to be copied is specified from image data of that document to be copied, and a print process is made based on the specified file so as to prevent deterioration of image quality. Also, when a document to be copied is not registered, a registration process is executed to suppress deterioration of image quality in an early stage. Furthermore, since the document is converted into vector data, re-use of such document is facilitated, and deterioration of image quality can be suppressed even when an image process such as enlargement or the like is made. To this end, when an original digital file cannot be specified, an apparatus of this embodiment executes a vectorization process (S54), converts the obtained vector data into a data format that can be re-used by an application (S55), and registers the converted file in a file server (S56). With this registration process, since the location of the file is settled, that location information is composited on an image to be scanned using an identifier such as a two-dimensional barcode or the like (S48), and the composite image can be printed (S49). Even when the printed document is scanned again, a registered digital file can be easily specified.
摘要:
This invention generates a digital document by applying character recognition to character images in a document image, and rendering the character recognition result on the document image in a transparent color. This digital document allows to specify a part corresponding to a search keyword on the document image upon conducting a search. When this digital document is generated, it includes a description required to use glyph data (font data) of a simple character shape commonly to a plurality of character types as font data used upon rendering the character recognition result. Therefore, even when the digital document needs to save font data, an increase in file size can be minimized. Also, by rendering using a simple character shape, the data size of the font data itself can be reduced.
摘要:
Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.
摘要:
The invention significantly improves operability by automatically discriminating a plurality of image orientations, which are not assured of always being fed in common orientations, and reduces possible burdens to operators by eliminating efforts required to arrange the images in a common orientation before feeding or to correct each orientations into a common orientation after feeding. The invention improves the operability also by enabling modes in which orientation discrimination as well as tilt corrections can be performed before operator's instructions, if the Auto mode has been specified for the orientation recognition function. The invention also improves accuracy of processing by determining whether orientations or tilt recognition is proper and providing the result to the operators.
摘要:
Stored digital data is searched for on the basis of an input image, difference information is extracted by comparing the retrieved digital data and the input image, and the difference information is composited to the digital data. Digital data generated by composition is stored. When no digital data is retrieved, the input image is converted into vector data, and the image that has been converted into the vector data is stored as digital data. Obtained region segmentation information and an input image are composited, the composite image is displayed on an operation screen of an MFP, and a rectangular block to be vectorized is designated as a specific region from the displayed region segmentation information. A user designates the specific region by designating rectangular blocks in an image using a pointing device.
摘要:
This invention generates a digital document by applying character recognition to character images in a document image, and rendering the character recognition result on the document image in a transparent color. This digital document allows to specify a part corresponding to a search keyword on the document image upon conducting a search. When this digital document is generated, it includes a description required to use glyph data (font data) of a simple character shape commonly to a plurality of character types as font data used upon rendering the character recognition result. Therefore, even when the digital document needs to save font data, an increase in file size can be minimized. Also, by rendering using a simple character shape, the data size of the font data itself can be reduced.