摘要:
Computer-readable media having computer-executable instructions and apparatuses detect a structured handwritten object is detected in a handwritten document. A primitive structure, which is associated with a set of component objects, is formed. A candidate handwritten object is identified by detecting a corresponding logical structure. A classifier for the candidate handwritten object is determined. Consequently, the candidate handwritten object is validated from the classifier.
摘要:
A strategy is described for producing an animated scene from multiple high resolution still images. The strategy involves: creating a graph based on an analysis of similarity among the plural still images; performing partial temporal order recovery to define a partial ordering among the plural still images; and extracting an output sequence from the plural still images using second-order Markov Chain analysis, using the partial ordering as a reference. The strategy can perform the above-described analysis with respect to multiple independent animated regions (IARs) within the still images. Further, the strategy can decompose any IAR with a significant amount of motion into multiple semi-independent animated regions (SIARs). The SIARs are defined to be weakly interdependent.
摘要:
A method and system for cleaning handwriting for redisplay of the handwriting or for improved recognition accuracy is provided. The cleanup system receives handwriting that has been digitized. The cleanup system then analyzes the handwriting to identify strokes that satisfy a cleanup criterion. When a stroke has been identified as satisfying some cleanup criteria, the cleanup system cleans up the handwriting based on the detected criteria. In this way, the cleanup system generates handwriting that may have a more visually pleasing appearance to the reader.
摘要:
In accordance with embodiments of the invention, global metadata, such as a document identifier, which may be a globally unique identifier, is embedded into an embedded interactive code document by combining a first m-array and a plurality of copies of the first m-array to generate a combined m-array with encoded global metadata such that respective start positions (xd,yd)i of the plurality of copies of the first m-array in the combined m-array are each shifted, by respective amounts that are based on respective portions of the global metadata, relative to a start position of the first m-array in the combined m-array. Global metadata may be decoded from the combined m-array by determining the respective amounts by which the plurality of copies of the first m-array are shifted, relative to the first m-array, in the combined m-array and by combining the respective amounts to produce a decoded value of the global metadata.
摘要:
This disclosure describes an integrated framework for class-unsupervised object segmentation. The class-unsupervised object segmentation occurs by integrating top-down constraints and bottom-up constraints on object shapes using an algorithm in an integrated manner. The algorithm describes a relationship among object parts and superpixels. This process forms object shapes with object parts and oversegments pixel images into the superpixels, with the algorithm in conjunction with the constraints. This disclosure describes computing a mask map from a hybrid graph, segmenting the image into a foreground object and a background, and displaying the foreground object from the background.
摘要:
Embodiments of the invention determine whether an image has been altered. Sets of patches are selected in the image, and corresponding inverse response functions are provided to a support vector machine (SVM). The support vector machine is trained with exemplary normal and abnormal inverse response functions. Once trained, the support vector machine analyzes inverse response functions corresponding to a suspected image. The support vector machine determines if the inverse response functions are normal or abnormal by analyzing a set of features. In one embodiment, features include measures for monotonic characteristics, fluctuation characteristics, and divergence characteristics of the red, green, and blue components of a tuple. Each tuple of inverse response functions is associated with a set of patches selected in the image.
摘要:
Global metadata, such as a document identifier, which may be a globally unique identifier, is embedded into an embedded interactive code document by combining a first m-array and a plurality of copies of the first m-array to generate a combined m-array with encoded global metadata such that respective start positions (xd,yd)i of the plurality of copies of the first m-array in the combined m-array are each shifted, by respective amounts that are based on respective portions of the global metadata, relative to a start position of the first m-array in the combined m-array. Global metadata may be decoded from the combined m-array by determining the respective amounts by which the plurality of copies of the first m-array are shifted, relative to the first m-array, in the combined m-array and by combining the respective amounts to produce a decoded value of the global metadata.
摘要:
A strategy is described for producing an animated scene from multiple high resolution still images. The strategy involves: creating a graph based on an analysis of similarity among the plural still images; performing partial temporal order recovery to define a partial ordering among the plural still images; and extracting an output sequence from the plural still images using second-order Markov Chain analysis, using the partial ordering as a reference. The strategy can perform the above-described analysis with respect to multiple independent animated regions (IARs) within the still images. Further, the strategy can decompose any IAR with a significant amount of motion into multiple semi-independent animated regions (SIARs). The SIARs are defined to be weakly interdependent.
摘要:
Systems and methods for detecting doctored JPEG images are described. In one aspect, a JPEG image is evaluated to determine if the JPEG image comprises double quantization effects of double quantized Discrete Cosine Transform coefficients. In response to results of these evaluation operations, the systems and methods determine whether the JPEG image has been doctored and identify any doctored portion.
摘要:
Ink-parser-parameter optimization may be performed via parallel processing to accelerate searching for a set of optimal ink-parser parameters. Evaluators may parse pages of ink notes with different groups of parameters and may compute corresponding values for evaluation functions. Separate evaluation functions may be defined for the following types of ink-parker parsing engines: writing parser, writing/drawing classification, table detection, and list detection. A searcher may perform a grid-searching algorithm or a genetic algorithm to generate groups of parameters and may then pass the parameters to available evaluators for evaluation until evaluation-function values for a group of parameters satisfy a convergence condition.