Abstract:
A method of generating a self-authenticating printed document and authenticating the printed document. The back side of the printed document contains 2d barcode which encode extracted features of the document content. The features are hashed into a hash code, converted to a barcode stamp element, and transformed into a hierarchical barcode stamp by repeating the stamp element. The hierarchical barcode stamp is printed as a gray background pattern on the front side of the same sheet of printed document. To authenticate the printed document, the barcodes on the back side are read to extract the document features. The features are hashed into a hash code and compared to the hash code extracted from the hierarchical barcode stamp on the front side of the document to detect any alterations of the back side barcodes. Further, the document features extracted from the front and back sides of the document are compared.
Abstract:
A document alteration detection method compares a target image with an original image by comparing character shape features without actually recognizing the characters. Bounding boxes for the characters are generated for both images, each enclosing one or more connected groups of pixels of one character. The bounding boxes in the original and target images are matched into pairs. Addition and deletion of text is detected if a bounding box in one image does not have a matching one in the other image. Each pair of bounding boxes is processed to compare their shape features. The shape features include the Euler numbers of the characters, the aspect ratio of the bounding boxes, the pixel density of the bounding boxes, and the Hausdorff distance between the two characters. The two characters are determined to be the same or different based on the shape feature comparisons.
Abstract:
A document authenticating method is disclosed by which numerous small-sized two-dimensional barcode stamps are generated and placed in a distributed manner on a printed document. The small-sized barcode stamps collectively encode the content of the document to be used for document authentication. In one example, the stamp size is about ¼ by ¼ inches or less and the tile size for the stamps is 4 by 4 pixels at a resolution of 400 dpi. The document is segmented into segments each containing a paragraph or a line of text. For each segment, a set of barcode stamps encoding the authentication data for the segment is placed in the vicinity of the segment. They may be placed in the empty space in the last line of each paragraph, in the empty space between adjacent paragraphs, or at the beginning or end of each line.
Abstract translation:公开了一种文件认证方法,其中生成大量的小尺寸二维条形码邮票并以分布的方式放置在打印文档上。 小尺寸条形码邮票集中编码要用于文件认证的文档的内容。 在一个示例中,邮票大小约为1/4英寸或更小,邮票的尺寸为400 dpi的分辨率为4 x 4像素。 文档被分割成段,每个段包含一段或一行文本。 对于每个段,将编码该段的认证数据的一组条形码标记放置在段的附近。 它们可以放置在每段最后一行的空白处,相邻段落之间的空白空间,或每行开头或末尾。
Abstract:
A document authentication method uses a watermark added in a printed document to detection possible alterations made to the document after it was printed. First, a visible watermark in the form of a dot pattern is overlapped with an original digital image. The watermarked image is printed out as a halftone image at a first resolution. The watermark in the printed document appears as a light gray shade. Later, the printed document is scanned back using a grayscale scan at a resolution higher than the first resolution. In the scanned image, altered areas would appear flat (lacking intensity variation) whereas unaltered areas will have relatively large density variations due to the watermark dots and the fact that the image was halftone printed at a lower resolution. Alternations are detected by identifying flat areas within the image using a combination of flat block detection and a multiple thresholds method.
Abstract:
A document authenticating method is disclosed by which a plurality of two-dimensional barcode stamps are generated and printed on a back side of the document forming a pre-defined special pattern. The barcode stamps collectively encode the content of the document to be used for document authentication. Each barcode stamp encodes linking information indicating the position of the next barcode stamp. An index barcode may be printed on the front side of the document which encodes the position information for all of the barcode stamps on the back side.
Abstract:
A document authenticating method is disclosed by which numerous small-sized two-dimensional barcode stamps are generated and placed in a distributed manner on a printed document. The small-sized barcode stamps collectively encode the content of the document to be used for document authentication. In one example, the stamp size is 1 cm by 1 cm and the tile size for the stamps is 4 by 4 pixels at a resolution of 400 dpi. A total of approximately 80 such small barcode stamps are placed on the page to form a pattern that resembles a border on one or more sides of the page. In one particular example, the stamps are placed on all four sides of the page to form a frame. Such a stamp layout minimizes undesirable visual effect and can create a decorative effect. Small-sized stamps also allow reduced tile sizes and increased information density.
Abstract:
An apparatus and method for deskewing a scanned printed document is described. The original printed document is scanned with the scanner to obtain a first digital image, which contains skew (rotation) with respect to the original printed document. The first digital image is digitally flipped around a centerline to obtain a second digital image, which is printed to generate a second printed document. The second printed document is scanned using the same scanner to obtain a third digital image, which contains skew errors with respect to the second printed document. The third digital image is digitally flipped around the centerline to obtain a final digital image, which is substantially free of the skew introduced by the scanner.
Abstract:
A content-based accounting method is implemented in a management section for a copier, scanner, printer or multifunction device (referred to as MFP), or on a networked server accessible by the copier, scanner, printer or MFP. When copying, scanning or printing a document, the management section automatically extracts content information from the documents being copied, scanned or printed, groups the documents based on the content, and updates an accounting database. The accounting database contains user accounts that store usage information according to content groups. For copied and scanned documents, textual content is extracted from the document image using OCR techniques. For printed documents, textual information is extracted from the digital data used to print the document.
Abstract:
A color barcode can record much more amount of information than black white barcode. However, the problem of the color barcode is that it is easy to lose integrity or authenticity in printing and copy generation. This invention provides an apparatus and a method to keep the integrity or authenticity of the color barcode. Such is accomplished by the color information portion of the color barcode representing the color information about what colors are used for color tiles of data portion of the color barcode and an apparatus and a method for producing and reproducing such color barcode.
Abstract:
A method for decoding digital data in a color barcode having a plurality of data cells, including the steps of: scanning the color barcode of the hardcopy document, separating color image of the color barcode into print primary color planes, computing peaks of each print primary color plane, projecting, for at least one of the print primary color planes, the data cells along a horizontal direction and a vertical direction at the peaks of the at least one of the primary color planes in each direction which represent data cell center locations respectively, and creating a grid where each of its intersection is a respective data cell center location, assigning a color to each grid intersection which corresponds to a respective data cell by examining values of the print primary color planes at such location, and decoding digital data from the data cells based on the respective color assigned to each data cell.