摘要:
A method for recognizing a table structure from delineated table region in an electronic document using hierarchical clustering of data strings. The cluster groupings are segregated effectively using the distances from a positional vector associated with words and groups of words rather than a minimum number of blank spaces between words. Once a data tree of the hierarchical clusterings is constructed, the tree is scanned downward from the root to find appropriate column boundaries using a columnization algorithm. Then using successive heuristic algorithms, determine column and row headers and row boundaries.
摘要:
A method of managing storage in a document image database using document analysis to partition documents into logical regions and modified by reducing storage size of the regions using different reduction means according to various storage preference rules. Storage preference rules are intended to maintain high quality representations of important document information while reducing storage requirements at the expense of lesser important aspects of the document. In particular, the different reduction means applied to stored document images includes reducing sampling depth, reducing sampling resolution based on minimum font size, utilizing lossy and lossless compression schemes and discarding unimportant regions of document image. Over time, document analysis and modification can be repeated to further reduce the storage size of previously stored data files.
摘要:
The method and apparatus enables any user to search for logos in document images stored in a bitmap format. The search efficiently compares bitmap or image data by extracting a series of connected components. These connected components are grouped according to region where each region may be a potential logo. Shape and density parameters of a region are determined and compared to the parameters of the stored logo image. If a region is successfully matched then that region is aligned and scaled to the corresponding stored logo image. Thereafter, a bitwise comparison is then performed between the scaled and aligned region and the logo image. A match score is assigned to each region along with other pertinent information about the region, and is stored in a ranked logo list database. The ranked logo list database represents a list of logos found in the document image.
摘要:
A system and method for storage and retrieval of pictographic images as computer file names. Upon generation of an input image, the computer saves the image and extracted spatial and temporal information about the generation of the image. The extracted information can be accessed for comparison, ordering and retrieving of stored pictographic file names and subsequent display to the user. Final selection of the desired name is performed visually by the user while viewing a display of the ordered file names via a document browser.
摘要:
Apparatus and a method for comparing an electronic handwritten pattern to a stored string are provided. The string includes a group of portions, each having at least one stroke. Movement of a stylus forms the pattern, and a sequence of strokes is generated. Each stroke represents a stylus movement within a predetermined alphabet. The sequence of strokes has a plurality of portions. A linear systolic array processor determines an edit distance between the string and the pattern. The processor compares a first portion of the string to a first portion of the pattern. A plurality of edit distance components are generated based on the comparison. Each component corresponds to a different set of operations that transforms the first portion of the stored string into the first portion of the pattern. The components are calculated based on a further comparison between additional portions of the stored string and the pattern. The component which has a minimum value is selected. The comparison is performed between each respective portion of the pattern and the corresponding portion of the stored string. The total edit distance is based on the component selected during a last comparison between a last portion of the stored string and a last portion of the pattern.
摘要:
Inventive two-dimensional barcodes, each having encoded digital information in a bitmap representing preferably randomized encoded data bits, are printed onto a printed medium. Preferably, error correction codes are added to the digital information to ensure that the decoding process accurately reproduce the digital information. In one embodiment, the bitmap may further include “anchor” bits in each corner, which are used as part of the skew estimation and deskewing processes during decoding. In a second embodiment, no “anchor” bits are required. The encoded digital information is mapped into the two-dimensional barcode in such a way as to minimize the errors caused by damage to particular rows and/or columns, for example, row damage caused by faxing the printed barcode. To extract the encoded digital information from the printed medium, the printed medium is scanned, then the bitmap is located within the printed medium. The skew of the bitmap, if any, is determined, and the bitmap is deskewed if necessary. The bitmap is then cropped, and the randomized digital information is read from the bitmap. The digital information is derandomized and any error correction codes are removed, in the process correcting and/or recording any errors discovered, thereby reproducing the original encoded digital information.
摘要:
The user communicates through a digitizing writing surface with the audio/video control apparatus. An on-screen display is generated, providing the user with a user environment in which a wide range of different tasks and functions can be performed. The digitizing writing surface can be incorporated into a hand-held remote control unit and the audio/video control apparatus may likewise be incorporated into existing home entertainment or computer equipment. By tapping on the writing surface a command bar is presented on the screen, allowing the user to select among various functions. Included in these functions is an on-screen programming feature, allowing the user to select programs for viewing or recording by entry of user-drawn annotations or commands via the writing surface.
摘要:
A certificate, calculated on the text of a document to be reproduced, is included in the document in a form which is machine-readable. Upon optical character recognition scanning, or other digitizing reproduction, the certificate is also scanned and stored for comparison. A new certificate is calculated on the contents of the reproduced document and compared to the scanned certificate. If the two certificates are different, it is known that at least one error exists in the reproduced document. More than one certificate can be provided for a document in order to facilitate error location on the document and error correction therein.
摘要:
Inventive two-dimensional barcodes, each having encoded digital information in a bitmap representing preferably randomized encoded data bits, are printed onto a printed medium. The bitmap may further include a plurality of block identifiers, spaced a predetermined number of encoded data bits apart, which are used to make corrections for missing or added data bits when the barcode is decoded. Upon decoding a barcode printed on the printed media, the digital information is scanned and the number of horizontal and vertical edges in each respective column and row of the barcode are determined. An edge is determined by selecting a pixel and determining if the two pixels adjacent to the selected pixel are of different colors. After all of the edges are counted, selected groups of columns and rows are analyzed to determine local minimas in the number of counted horizontal and vertical edges in each selected group which provide the column and row center line for each selected group. Thereafter, the binary data located at the intersection of each column and row center line are read out to produce a signal representative of the encoded digital information in the printed barcode.
摘要:
A document marker, including first values dependent upon the layout and the contents of the document and assigned by generating or preprocessing software, is provided in machine-readable symbology on the face of a printed version of the document. The marker may include encoded document layout information and values assigned on sequences of the original text, including text-dependent decimation sequences, error correction codes or check-sums. Upon optical character recognition scanning, or other digitizing reproduction, the marker is also scanned. The scanning computer, having corresponding software, assigns second values dependent upon the layout and contents of the reproduced document. Upon comparison of the first and second decimation sequences, line and character errors can be detected and some errors corrected, thereby generating re-aligned candidate sequences. Optional error correction codes can provide further correcting capabilities, as applied to the re-aligned reproduced document sequences; and, an optional check-sum comparison can be utilized to verify the accuracy of the reproduced sequences are correct.