摘要:
When the content of a paper document is aiming to be recognized in an apparatus that has a scanner, an image of the paper document is subjected to region segmentation processing immediately after the paper document is read, and a recognition operation to be performed on each segmented region is specified by an operator on the basis of the result of the region segmentation processing. Therefore, a recognition method to be performed on each recognition target item can be selected from among recognition by using a recognition service server, recognition by downloading a program module from a program server, and recognition by using a locally-stored program module. As a result, recognition processing can be performed more efficiently.
摘要:
When a region obtained by predetermined region division processing is a character image, the image is converted at one of predetermined compression ratios (S505). Data in a certain information amount that the region has as pixel data after image conversion is subjected to character recognition, and the degree of similarity (score) to character types registered in advance is calculated (S506). If the calculated score is equal to or smaller than a predetermined threshold value, the information amount is smaller than a minimum information amount necessary for reconstruction of the character image contained in the region. The character image is converted at a compression ratio lower than that in S505 by one step (S507-S509).
摘要:
According to the present invention, it is possible to create electronic document data capable of highlighting an object detected through a search so that a user can easily recognize it. An image processing apparatus extracts an object from an input image and extracts metadata related to the object. The image processing apparatus, when determines to describe with a shape in accordance with the shape of the object, creates a vector path description of frame described with a shape in accordance with the shape of the object. Then, the image processing apparatus creates an electronic document including data of the input image and the vector path description of frame with which the metadata is associated. When a keyword search is performed on the created electronic document, highlight display is performed in accordance with the vector path description of frame with which metadata that matches the keyword is associated.
摘要:
An image processing apparatus extracts an object area (e.g., character, picture, line drawing, and table) from an input image and acquires a metadata to be associated with the object. The image processing apparatus generates a transparent graphics description for an object area having an attribute that requires generation of the transparent graphics description, and generates an electronic document while associating the transparent graphics description with the metadata. As transparent graphics description, an arbitrary shape of graphics can be used. Accordingly, the image processing apparatus can generate electronic document data suitable for a highlight expression, which is easy for users to recognize in a search operation using a keyword to search an object included in an electronic document.
摘要:
An apparatus comprises: unit configured to divide input document data into a body region, a caption region, and an object region; unit configured to acquire text information included in each of the body region and the caption region; unit configured to search the text information in the body region for an anchor term, to extract an anchor term from the text information in the caption region, and to generate a bi-directional link between a portion corresponding to the anchor term in the body region and a portion of the object region to which the caption region is appended; and unit configured to convert the input document data into digital document data in which the portion corresponding to the anchor term in the body region and the portion corresponding to the object region to which the caption region is appended are bi-directionally linked based on the link.
摘要:
An image processing apparatus segments an image into a plurality of regions in accordance with attributes of a plurality of types, and acquires feature amount data from image information of a region of a first attribute (an image region) from among the plurality of regions. The apparatus then applies compression processing to the image and acquires compressed data. The apparatus outputs the acquired feature amount data and compressed data as output data of the image.
摘要:
Even when captions of a plurality of objects use an identical anchor expression, the present invention can associate an appropriately explanatory text in a body text as metadata with the objects.
摘要:
An image processing apparatus includes a character recognition unit configured to perform character recognition on a plurality of character images in a document image to acquire a character code corresponding to each character image, and a generation unit configured to generate an electronic document, wherein the electronic document includes the document image, a plurality of character codes acquired by the character recognition unit, a plurality of glyphs, and data which indicates the glyphs to be used to render each of the character codes, wherein each of the plurality of glyphs is shared and used by different character codes based on the data when rendering characters that correspond to the plurality of character codes acquired by the recognition unit.
摘要:
An object of the present invention is to achieve both of high compressibility and high image quality property of an electronic file to improve user friendliness of the electronic file.
摘要:
In retrieval of a registered image that resembles an input image, retrieval is performed accurately in a short period of time irrespective of orientation of the input image. Specifically, there is disclosed an information processing method for retrieving image data, which has a high degree of similarity to input image data, from registered image data, the method includes an area identification step (S402) of identifying a text area and a non-text area in the input image data; a direction identification step (S404) of recognizing text in the identified text area and identifying orientation of the input image data based upon orientation of the text recognized; a rotation step (S406) of rotating the identified input image data to a prescribed orientation based upon the orientation identified; and a retrieval step (S409) of retrieving image data, which has a high degree of similarity to the input image data after the rotation thereof, from the registered image data.