摘要:
Systems and methods for detecting doctored JPEG images are described. In one aspect, a JPEG image is evaluated to determine if the JPEG image comprises double quantization effects of double quantized Discrete Cosine Transform coefficients. In response to results of these evaluation operations, the systems and methods determine whether the JPEG image has been doctored and identify any doctored portion.
摘要:
Ink-parser-parameter optimization may be performed via parallel processing to accelerate searching for a set of optimal ink-parser parameters. Evaluators may parse pages of ink notes with different groups of parameters and may compute corresponding values for evaluation functions. Separate evaluation functions may be defined for the following types of ink-parker parsing engines: writing parser, writing/drawing classification, table detection, and list detection. A searcher may perform a grid-searching algorithm or a genetic algorithm to generate groups of parameters and may then pass the parameters to available evaluators for evaluation until evaluation-function values for a group of parameters satisfy a convergence condition.
摘要:
A method and system for cleaning handwriting for redisplay of the handwriting or for improved recognition accuracy is provided. The cleanup system receives handwriting that has been digitized. The cleanup system then analyzes the handwriting to identify strokes that satisfy a cleanup criterion. When a stroke has been identified as satisfying some cleanup criteria, the cleanup system cleans up the handwriting based on the detected criteria. In this way, the cleanup system generates handwriting that may have a more visually pleasing appearance to the reader.
摘要:
A process and apparatus for determining the location of a captured array from a larger image is described. A non-repeating sequence may be folded into a non-repeating array in which the array is unique for every neighboring window of a given size. A portion of the array of the neighboring window may be captured and a subset of extracted bits corresponding to the captured array is decoded to identify error bits. The location of the captured array is determined within the non-repeating array by further processing the decoded bits.
摘要:
Systems and methods for detecting annotation digital ink strokes and further associating annotation digital ink strokes with word digital ink strokes are presented. Ink strokes are captured on a writing surface and then classified as words or annotations. Annotations are then anchored to corresponding words. When words are relocated or edited on the writing surface, the anchored annotations are also relocated and may even be reshaped according to the changes in the anchored words.
摘要:
A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.
摘要:
A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.
摘要:
Systems and methods for detecting annotation digital ink strokes and further associating annotation digital ink strokes with word digital ink strokes are presented. Ink strokes are captured on a writing surface and then classified as words or annotations. Annotations are then anchored to corresponding words. When words are relocated or edited on the writing surface, the anchored annotations are also relocated and may even be reshaped according to the changes in the anchored words.
摘要:
A “Scene Re-Lighter” provides various techniques for using an automatically reconstructed light transport matrix derived from a sparse sampling of images to provide various combinations of complex light transport effects in images, including caustics, complex occlusions, inter-reflections, subsurface scattering, etc. More specifically, the Scene Re-Lighter reconstructs the light transport matrix from a relatively small number of acquired images using a “Kernel Nyström” based technique adapted for low rank matrices constructed from sparsely sampled images. A “light transport kernel” is incorporated into the Nyström method to exploit nonlinear coherence in the light transport matrix. Further, an adaptive process is used to efficiently capture the sparsely sampled images from a scene. The Scene Re-Lighter is capable of achieving good reconstruction of the light transport matrix with only few hundred images to produce high quality relighting results. Further, the Scene Re-Lighter is also effective for modeling scenes with complex lighting effects and occlusions.
摘要:
Tensor linear Laplacian discrimination for feature extraction is disclosed. One embodiment comprises generating a contextual distance based sample weight and class weight, calculating a within-class scatter using the at least one sample weight and a between-class scatter for multiple classes of data samples in a sample set using the class weight, performing a mode-k matrix unfolding on scatters and generating at least one orthogonal projection matrix.