摘要:
An image processing method and apparatus which efficiently detect an image input direction, from an image with much differential information to an image with little differential information. The direction of an image including a character area, inputted into a computer, is detected. First, a binary image of the input image is generated, and a tile image is generated by adding a predetermined value to respective tiles, each tile corresponding to a predetermined size area in the binary image. Next, an area of the binary image, corresponding to an area within a circumscribed rectangle surrounding connected pixels having the same value in the tile image, is extracted as a character area. Then, the direction of characters included in the character area is recognized and thereby the direction of the input image is detected.
摘要:
Character code data and vector drawing data are both listed and provided in a re-editable manner. Electronic data is generated in which information obtained by vectorizing character areas in an image and information obtained by recognizing characters in the image are stored in respective storage locations. As for the electronic data generated in this manner, because character code data and vector drawing data generated from the input image are both presented by a display and edit program, a user can immediately utilize the both data.
摘要:
In an electronic document of drawing descriptions of a page image and a character, it is desired that although a font data necessary for drawing the character is held in the electronic document, the size of the electronic document is minimized. Furthermore, it is desired to ensure visibility at the time of highlighting of search. There is generated an electronic document in which a document image, a plurality of character codes obtained by executing a character recognition processing with respect to the document image, and a plurality of kinds of glyph data to be utilized in common with respect to the plurality of character codes when drawing characters corresponding to the plurality of character codes are stored. The plurality of kinds of glyph data are selectively used when characters corresponding to the character codes are drawn. It is desirable that the glyph data be the one in a simple form.
摘要:
Even when captions of a plurality of objects use an identical anchor expression, the present invention can associate an appropriately explanatory text in a body text as metadata with the objects.
摘要:
An image processing apparatus successively designates each page of an input page image as a processing target, detects an anchor expression constituted by a specific character string, and associates a highlight position corresponding to the anchor expression with a link identifier. When the anchor expression and the link identifier are registered in a link configuration management table, if the same anchor expression is already registered in the table, the apparatus updates the table in such a way as to mutually associate the link identifiers of the same anchor expression. The apparatus generates page data of an electronic document based on a link identifier relating to a processing target page image and its highlight position and transmits the generated page data. The apparatus generates information usable to link the relevant link identifiers based on the link configuration management table, after completing the processing for all pages, and transmits the generated information.
摘要:
An image processing apparatus segments an image into a plurality of regions in accordance with attributes of a plurality of types, and acquires feature amount data from image information of a region of a first attribute (an image region) from among the plurality of regions. The apparatus then applies compression processing to the image and acquires compressed data. The apparatus outputs the acquired feature amount data and compressed data as output data of the image.
摘要:
This invention provides the following environment. That is, an original document file corresponding to a document to be copied is specified from image data of that document to be copied, and a print process is made based on the specified file so as to prevent deterioration of image quality. Also, when a document to be copied is not registered, a registration process is executed to suppress deterioration of image quality in an early stage. Furthermore, since the document is converted into vector data, re-use of such document is facilitated, and deterioration of image quality can be suppressed even when an image process such as enlargement or the like is made. To this end, when an original digital file cannot be specified, an apparatus of this embodiment executes a vectorization process (S54), converts the obtained vector data into a data format that can be re-used by an application (S55), and registers the converted file in a file server (S56). With this registration process, since the location of the file is settled, that location information is composited on an image to be scanned using an identifier such as a two-dimensional barcode or the like (S48), and the composite image can be printed (S49). Even when the printed document is scanned again, a registered digital file can be easily specified.
摘要:
The area and position of each cell in a table are analyzed on the basis of the layout state of ruled lines and character strings contained in the image of the table to obtain a table structure. The obtained table structure is displayed. When a user instructs to correct the area of a cell in the displayed table structure, the area and position of the cell are corrected on the basis of the correction instruction to obtain a corrected table structure. After correction of the table structure, characters in each cell of the corrected table structure are recognized, and table format data is generated and output on the basis of the recognition result and table structure.
摘要:
This invention generates a digital document by applying character recognition to character images in a document image, and rendering the character recognition result on the document image in a transparent color. This digital document allows to specify a part corresponding to a search keyword on the document image upon conducting a search. When this digital document is generated, it includes a description required to use glyph data (font data) of a simple character shape commonly to a plurality of character types as font data used upon rendering the character recognition result. Therefore, even when the digital document needs to save font data, an increase in file size can be minimized. Also, by rendering using a simple character shape, the data size of the font data itself can be reduced.
摘要:
A binary image is generated by binarizing a multilevel image. An edge image is generated by extracting an edge component in the multilevel image. The binary image is segmented into a plurality of regions with different attributes. An outline candidate of a halftone region is extracted from the edge image. A second region segmentation result is output on the basis of the information of the outline candidate and information of the region segmentation result.