摘要:
A method and system for cleaning handwriting for redisplay of the handwriting or for improved recognition accuracy is provided. The cleanup system receives handwriting that has been digitized. The cleanup system then analyzes the handwriting to identify strokes that satisfy a cleanup criterion. When a stroke has been identified as satisfying some cleanup criteria, the cleanup system cleans up the handwriting based on the detected criteria. In this way, the cleanup system generates handwriting that may have a more visually pleasing appearance to the reader.
摘要:
A process and apparatus for determining the location of a captured array from a larger image is described. A non-repeating sequence may be folded into a non-repeating array in which the array is unique for every neighboring window of a given size. A portion of the array of the neighboring window may be captured and a subset of extracted bits corresponding to the captured array is decoded to identify error bits. The location of the captured array is determined within the non-repeating array by further processing the decoded bits.
摘要:
Systems and methods for detecting annotation digital ink strokes and further associating annotation digital ink strokes with word digital ink strokes are presented. Ink strokes are captured on a writing surface and then classified as words or annotations. Annotations are then anchored to corresponding words. When words are relocated or edited on the writing surface, the anchored annotations are also relocated and may even be reshaped according to the changes in the anchored words.
摘要:
A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.
摘要:
A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.
摘要:
Systems and methods for detecting annotation digital ink strokes and further associating annotation digital ink strokes with word digital ink strokes are presented. Ink strokes are captured on a writing surface and then classified as words or annotations. Annotations are then anchored to corresponding words. When words are relocated or edited on the writing surface, the anchored annotations are also relocated and may even be reshaped according to the changes in the anchored words.
摘要:
A “Scene Re-Lighter” provides various techniques for using an automatically reconstructed light transport matrix derived from a sparse sampling of images to provide various combinations of complex light transport effects in images, including caustics, complex occlusions, inter-reflections, subsurface scattering, etc. More specifically, the Scene Re-Lighter reconstructs the light transport matrix from a relatively small number of acquired images using a “Kernel Nyström” based technique adapted for low rank matrices constructed from sparsely sampled images. A “light transport kernel” is incorporated into the Nyström method to exploit nonlinear coherence in the light transport matrix. Further, an adaptive process is used to efficiently capture the sparsely sampled images from a scene. The Scene Re-Lighter is capable of achieving good reconstruction of the light transport matrix with only few hundred images to produce high quality relighting results. Further, the Scene Re-Lighter is also effective for modeling scenes with complex lighting effects and occlusions.
摘要:
Tensor linear Laplacian discrimination for feature extraction is disclosed. One embodiment comprises generating a contextual distance based sample weight and class weight, calculating a within-class scatter using the at least one sample weight and a between-class scatter for multiple classes of data samples in a sample set using the class weight, performing a mode-k matrix unfolding on scatters and generating at least one orthogonal projection matrix.
摘要:
This disclosure describes an integrated framework for class-unsupervised object segmentation. The class-unsupervised object segmentation occurs by integrating top-down constraints and bottom-up constraints on object shapes using an algorithm in an integrated manner. The algorithm describes a relationship among object parts and superpixels. This process forms object shapes with object parts and oversegments pixel images into the superpixels, with the algorithm in conjunction with the constraints. This disclosure describes computing a mask map from a hybrid graph, segmenting the image into a foreground object and a background, and displaying the foreground object from the background.
摘要:
A method for modeling data affinities and data structures. In one implementation, a contextual distance may be calculated between a selected data point in a data sample and a data point in a contextual set of the selected data point. The contextual set may include the selected data point and one or more data points in the neighborhood of the selected data point. The contextual distance may be the difference between the selected data point's contribution to the integrity of the geometric structure of the contextual set and the data point's contribution to the integrity of the geometric structure of the contextual set. The process may be repeated for each data point in the contextual set of the selected data point. The process may be repeated for each selected data point in the data sample. A digraph may be created using a plurality of contextual distances generated by the process.