Abstract:
The present disclosure discloses methods and systems for adding one or more annotations from a printed version of a document to a digital version of the document. The methods and systems include receiving the printed document with one or more annotations, which represent review comments of a reviewer. The printed document including one or more annotations is scanned to obtain a scanned document. Thereafter, the scanned document is compared with the original digital version of the document to identify the one or more annotations. The identified one or more annotations are then extracted and added to the digital version of the document to obtain a new digital version, which can be used for changes by the user or any other user.
Abstract:
The present disclosure discloses methods and systems for adjusting text colors in scanned documents. The method includes receiving a document for scanning from a user. Then, the document is scanned to generate scanned data. The scanned data is segmented into an image layer and one or more text layers, wherein the one or more text layers include textual content. Thereafter, the text color of the textual content in each text layer is identified. Then, the identified text color of the textual content in the text layer is compared with one or more pre-defined colors. Based on comparison, the text color of the textual content in each text layer is adjusted to match with the one or more pre-defined colors thereby generates a modified text layer. Finally, the modified text layer and the image layer are combined to create a final scanned document.
Abstract:
A computer-implemented method and system for processing a document page is based on text and non-text detections in an image path. An object block is generated for each image content identified within an input page. the blocks are extracted from the input document for downstream processing. Then, the object blocks are classified based on whether the content is text or non-text. The extracted non-text objects—which are specified by their bounding boxes—are processed separately from the rest of the content. The processing is performed on the text content blocks in parallel. However, when error diffusion is employed, the error diffusion is performed sequentially on the non-text content blocks to avoid stitching artifacts between individually error diffused patches in non-text areas.
Abstract:
Methods and systems for generating a mixed raster content (MRC) format file are disclosed. The method includes receiving an input document. The input document is segmented into an image layer and a text layer with a color mask information. A determination is made whether a text group in the text layer has a color different than the color mask information. Upon an affirmative determination, an ascertainment is made whether the color mask is above a threshold value. An average of the color mask is calculated and based on the calculation, a new color is applied to the text group to modify the text layer. Then, the modified text layer is compressed using a first compression scheme, the image layer is compressed using a second compression scheme, the compressed the modified text layer and the image layer are integrated to generate an image data in an MRC file format.
Abstract:
Systems, methods, and devices are configured to print and reuse customized sample sets while printing documents. They include receiving instructions to retrieve an electronic document and processing the electronic document in a sample print mode. The electronic document are arranged in a plurality of page with each page containing readable information. They further include identifying a subset of pages of the electronic document to print in a first phase of the sample print mode and printing the subset of pages on a physical readable media in the sample print mode. They additionally include displaying a message on a user interface after completion of the first phase of the sample print mode and printing remaining pages of the electronic document on a physical readable media in response to user input based on the displayed message.
Abstract:
A document printing system may include one or more print devices, each is capable of detecting a portable memory device, and also detecting a document file and associated configuration file stored in the portable memory device, where the configuration file may contain information about an interruption of a print job associated with the document file. The interruption may have occurred previously on any of the print devices. Each print device may extract and use the information from the configuration file, to automatically resume the previously interrupted print job. Each print device may also be capable of detecting an interruption of a current print job and recording information about the interruption in the configuration file stored in the portable memory device. In some embodiments, the information about the interruption of the print job may include a page number, cause of the interruption and/or job attributes of the interrupted print job.
Abstract:
An electronic image in a first color space, used by image acquisition devices, is obtained and converted into a second color space with a larger gamut, to produce a second color space image. Corrections are made to the second color space image to produce a corrected image, without performing background suppression. The corrected image is converted into a third color space, used by printing devices, to produce a third color space image, and this process of converting the corrected image into the third color space includes performing background suppression. This process of converting the corrected image into the third color space samples nodes of the second color space values, classifies the nodes as background nodes or non-background nodes, multiplies the second color space values of the background nodes by a percentage less than 100%, and after multiplying, interpolates values between the nodes.
Abstract:
Embodiments of the present disclosure disclose methods and systems for halftone frequency detection in a halftone image. The method includes receiving a first window, a second window, and the halftone image having a predetermined resolution; assigning an active value to a max variable and a min variable associated with each pixel within a first pixel neighborhood defined by the first window based on a pixel intensity value being compared with a three-way threshold set determined based on values a plurality of predetermined statistical parameters; estimating a first frequency estimate and the second frequency estimate based on a number of pixels having active values in the max variable and the min variable respectively within a second pixel neighborhood defined by the second window; and determining the halftone frequency based on the first frequency estimate value, the second frequency estimate value, and the predetermined resolution.
Abstract:
In systems and methods, an original image was previously converted to a lower bit count per pixel. The resulting image included pixel error based on the lower bit count per pixel. The conversion used an error weighting matrix (which diffuses the pixel error of each pixel to adjacent pixels). Methods and systems produce an output image from the lower bit count per pixel image using the error weighting matrix to determine, for each pixel, the amount of error added because of the adjacent pixels during the conversion process. Then, such methods and systems remove, from each pixel, the amount of error that was previously added based on the adjacent pixels. After removing the amount of error added based on the adjacent pixels from all pixels in the input image, an output image is reconstructed using a filter, such as a low pass filter.
Abstract:
Methods and systems for classifying a region of interest in an electronic document. A first region from the electronic document is extracted. The first region includes at least the region of interest. A first projection profile of pixels is determined in the region of interest. The first projection profile of pixels is indicative of a presence of a content in the region of interest. The region of interest is classified in at least one category based on a comparison of the first projection profile with a predetermined projection profile associated with the region of interest.