摘要:
Systems and methods are described that facilitate determining an original document format for a scanned document by analyzing a bitmap thereof. Text objects are extracted from the document, binarized, and segmented to identify text. Page orientation and text size are used to distinguish between a slideshow-type document, and a word processing or spreadsheet-type document. To further distinguish between the word processing and spreadsheet types, text column structure and count is analyzed.
摘要:
In preparation for rendering respective portions of a document via a respective plurality of engines, objects within the document are identified and characterized. A determination is made as to whether gamut variations between the engines might result in objectionable variations in the appearance of rendered versions of identified objects having similar characteristics. For those objects within the document for which the determination is made that variations might be objectionable, a target gamut is selected to be an intersection gamut of the engines to be used to render the document. For those objects within the document for which the determination is made that variations would be unobjectionable, the target gamut is selected to be that of selected individual engines. A system for selecting target gamuts for objects within a document can include an object identifier, a characteristic identifier and a gamut selector.
摘要:
A method for enhancing color fidelity in multi-reproduction, includes scanning an image to be reproduced, wherein the image contains an invisible digital watermark including color information; decoding the color information contained in the watermark; comparing the decoded color information with the scanned image; generating a correction table from the differences between the decoded color information and the scanned image; and performing color correction on the scanned image using the correction table. This method confines the color error to one generation, even when copies go through multiple reproduction.
摘要:
A method and system is provided for generating a variable data differential line pattern font comprising forming a periodic line pattern suitable for tessellation disposition within a printed document and selectively distorting a portion of the periodic line pattern in a predetermined manner wherein the distorting comprises generating a distinguishable font corresponding to the distorting. A plurality of different distinguishable fonts are formed by a corresponding plurality of distorted line patterns, respectively.
摘要:
A method for removal of punched hole artifacts in digital images includes, for a scanned document page, deriving an original digital image that defines the page in terms of a plurality of input pixels. A reduced resolution bitonal image is generated from the original image. The method further includes providing for identifying of candidate punched hole artifacts in the reduced resolution bitonal image and providing for testing the candidate punched hole artifacts for at least one of shape, size, and location. Where a candidate punched hole artifact meets the at least one test, the method includes generating a modified image. This includes erasing the candidate punched hole artifact from the original digital image.
摘要:
A method for run-time streak removal from a scanned image includes providing a scan line of image data from the scanned image; detecting corrupted data within the scan line; evaluating image data located in a neighborhood before and after the corrupted data on the scan line; if the evaluated image data in the neighborhood is smooth, replacing the corrupted data with image data determined by a linear interpolation process; and else if the evaluated image data in the neighborhood is not smooth, replacing the corrupted data with image data determined by the linear prediction process. Various techniques can be used to evaluate the image data located in the surrounding neighborhood. For example, a filter selection step may be used based on prediction discrepancies.
摘要:
A method for editing image data includes segmenting input image data into a plurality of discrete objects, wherein each of the objects is defined by a plurality of input pixels that are spatially grouped and that relate to a common content type and feature of the input image data so as to define an objectized input image from the input image data. The objectized input image and a holding area image are generated and simultaneously displayed. Editing input is received from a user by user selection of an object of the objectized input image that the user desires to be moved from the objectized input image to the holding area image based upon the user's visual inspection of the objectized input image. The objectized input image and the holding area image are updated based upon the received editing input so that the selected object is deleted from an original location in the objectized input image and inserted into the holding area image as a temporary object at an insertion location that spatially corresponds to the original location of the objectized input image. The method further includes receiving replacement input data from the user that indicates a selected replacement object in a replacement object database to be inserted into the original location of the objectized input image. The objectized input image is updated to include the selected replacement object in the original location to define an objectized output image.
摘要:
The present invention is a method for image segmentation to produce a mixed raster content (MRC) image with constant foreground layers. The invention extracts uniform text and other uniform color objects that carry detail information. The method includes four primary steps. First, the objects are extracted from the image. Next, the objects are tested for color consistency and other features to decide if they should be chosen for coding to the MRC foreground layers. The objects that are chosen are then clustered in color space. The image is finally segmented such that each foreground layer codes the objects from the same color cluster.
摘要:
A method of indexing images contained in scanned documents, wherein said scanned documents are stored in a repository, includes: for each document to be stored in the repository, dividing the document into a plurality of sections; scanning the plurality of sections; segmenting each scanned segment according to a predetermined coding model into image segment and non-image segments; associating each of the image segments with the document; and generating an index correlating the image segments with the document. The method may further include, at the time of image recall, displaying the index of image segments in a user interface; and responsive to selection of an image segment from the index, displaying the document information associated with the image segment in the user interface.
摘要:
In accordance with one embodiment, apparatus are provided, which include a digital continuous-tone two-dimensional authentic image and an image processor. Tamper message data is provided which represents a tamper message when viewed. Two halftoning screens are provided to be applied to at least a portion of the continuous-tone two-dimensional authentic image, to embed the tamper message data within a portion of the authentic image, in a manner so as to be substantially not visible in a printed or displayed version of the authentic image absent image processing or tampering of the authentic image. The screens include a first screen to apply first elements arranged in a first way, and a second screen to apply second elements arranged in a second way. A halftoner applies the two halftoning screens to visibly portray desired information of the continuous-tone authentic image. The first screen is applied in a limited area of the authentic image and in a form defined by the tamper message data. The second screen is applied in an area abutting the limited area.