摘要:
A system and method for processing documents is described. The system and method provide for executing a command as part of the execution of an application program, where execution of the command causes the transfer of the document between a processing device in a computer system and a peripheral device. The present invention also provides for transferring the document data between the processing device and the peripheral device in response to the command. The present invention further provides for archiving the document data in a memory in the computer system in response to the command and transparently to the execution of the application program.
摘要:
An electronic document management system that takes advantage of advanced document analysis techniques. The electronic document management system may provide automatic archiving of documents and retrieval without the need to navigate through a directory structure or specify a filename. Document comparison is facilitated by automatic retrieval of a previous version of a document. A digital copier alerts a user when a document to be copies already exists electronically within a database.
摘要:
A system for restoring images with undefined pixel values at known locations is described. The threshold value and a neighborhood configuration are defined and are used to restore the image. The neighborhood configuration defines a geometric region, typically a fixed number of pixels, surrounding the target pixel. The threshold value specifies a number of pixels in the neighborhood configuration for which pixel values are known. In our system, for each pixel in one of the unknown regions an analysis is performed over the entire area defined by the neighborhood configuration. If the threshold number of pixels within that region is known, then the value of the unknown pixel is calculated. If the threshold value is not achieved, then analysis proceeds to the next pixel location. By continuing the process and reducing the threshold value when necessary or desirable, the complete image can be restored.
摘要:
A method and apparatus for document matching using structural information. The present invention provides a method and apparatus for identifying documents based on the visual structure of the document. Structural information describing a document is generated and used to search for matching stored documents. In one embodiment, images are converted to a point set and the point sets are compared. For example, an image of a document that is sought is converted to a point set and the point set is compared to point sets corresponding to stored documents. When point sets match within a predetermined tolerance, the documents match. In one embodiment, the Hausdorff measure is used to compare point sets.
摘要:
An example page taken from each document in a document database is processed by a page processor to yield an iconic representation for the example page. To form the iconic representation, the example page is segmented into text regions, line art regions, photograph regions, etc., and each region is reduced in a manner appropriate for that image type. Text is replaced with a block font and reduced, while graphics are reduced in level and/or spatial resolution. The reduced regions of the example page are then reassembled into the icon. When multiple icons are printed on a guide page, a user can visually identify the icon for an example page of a target document and supply the icon, or a label for the icon, to a document retrieval system, which selects candidate matching documents from the document database. For simplified processing characters can be blocked and words formed into solid line segments with lengths proportional to word lengths. For regular spacing type languages, such as Japanese, character density is used instead of word lengths to generate feature descriptors.
摘要:
A system for manipulating image fragments so that image processing devices such as copiers, fax machines and scanners may efficiently process oversize images. The system provides a user interface so that when an oversize image is scanned in multiple parts to produce multiple image fragments the user can manipulate the image fragments by performing drag, drop and merge operations on the image fragments. Embodiments of the invention include the use of a touch screen or mouse to allow the user to perform the operations. Other operations are selectable such as rotating an image fragment, zooming in or out on portions of the displayed fragments, merging the fragments into an integrated image and storing or printing the image fragments.
摘要:
A method and apparatus for improving a text image represented as a bitmap in a digital system. Character instances in the bitmap image are recognized and categorized according to their character type. The instances are used to derive a prototype character for each character type. The prototype character is an average of the instances, thus providing for cancellation of extraneous marks in the bitmap image. The prototype character is substituted for each character instance of its type in the bitmap image thus providing for uniformity of characters in a regenerated version of the original bitmap.
摘要:
A method for retrieving user-supplied information from a scanned version of a completed document is described. The method includes the steps of obtaining a first image of the document having information printed thereon in its blank format before other information has been added to it by the user. A second image of the document is obtained after information has been added to it by the user. The two images are aligned, and for each pixel in the first image which corresponds to information on the document, those pixels are deleted from the second image to create an image which corresponds to subtraction of the first image from the second image. Finally, a step is performed to electronically restore the information added by the user which was deleted during the subtraction operation.
摘要:
A method and apparatus for formatting a document and creating a best document layout from an input list of picture and text objects is disclosed. The method includes calculating multiple document layouts while maintaining the correct reading order of the picture and text objects at all times. The method positions each picture and text object at multiple anchor points to create multiple document layouts, and then selects a best document layout which is the layout using the least number of pages to display the entire list of objects. If more than one layout uses the least number of pages, the layout positioning the least number of objects on the last page is the best layout.
摘要:
A method and apparatus for placing digital data on plain paper. One embodiment of the present invention allows for the digital data to undergo encryption before being placed on the plain paper. In one embodiment, a photocopier is used for transferring digital encrypted data to and from a plain piece of paper. The photocopier allows digital data to be stored onto plain paper after encryption, such that the digital data is secure. The photocopier also includes a device to recognize the encrypted digitized pixels on the page such that they may be decrypted and the original image reproduced.