摘要:
One embodiment of the present invention provides a system for tracking and distributing annotations for a video stream. During operation, the system receives, at an annotation server, the video stream originating from a remote field computer, extracts a number of features from the received video stream, and identifies a group of features that matches a known feature group, which is associated with an annotation. The system further associates the identified group of features with the annotation, and forwards the identified group of features and the annotation to the remote field computer, thereby facilitating the remote field computer to associate the annotation with a group of locally extracted features and display the video stream with the annotation placed in a location based at least on locations of the locally extracted features.
摘要:
Visual fingerprinting is used to provide a robust and highly effective method of finding similar content in a large document collection of rich document content composed of multiple text, line-art, and photo image objects. The visual fingerprints capture unique two-dimensional localized aspects of document appearance. The visual fingerprints are highly distinctive; fast for lookup; compact for storage requirements; and scalable to large document collections.
摘要:
A method for electronically magnifying a target object with an imaging device. The method includes obtaining a full view image of the target object, where the full view image is focused. The method further includes moving the imaging device in proximity to a portion of the target object and obtaining a key image of the portion of the target object. The method further includes matching the key image to a corresponding portion of the full view image. The method further includes magnifying the corresponding portion of the full view image and displaying the magnified portion of the full view image.
摘要:
A method and system is provided for finding stable keypoints in a picture image using localized scale properties. An integral image of an input image is calculated. Then a scale space pyramid layer representation of the input image is constructed at mulitple scales, wherein at each scale, a set of specific filters are applied to the input image to produce an approximation of at least a portion of the input image. Outputs from filters are combined together to form a single function of scale and space. Stable keypoint locations are identified in each scale at pixel locations at which the single function attains a local peak value. The stable keypoint locations which have been identified are then stored in a memory storage.
摘要:
A method and system generates fine-grained fingerprints for identifying content in a rendered document. It includes applying image-based techniques to identify patterns in a document rendered by an electronic document rendering system, irrespective of a file format in which the rendered document was electronically created. The applying of the image-based technique includes identifying candidate keypoints at locations in a local image neighborhood of the document, and combining the locations of the candidate keypoints to form a fine-grained fingerprint identifying patterns representing content in the document.
摘要:
A method and system generates fine-grained fingerprints for identifying content in a rendered document. It includes applying image-based techniques to identify patterns in a document rendered by an electronic document rendering system, irrespective of a file format in which the rendered document was electronically created. The applying of the image-based technique includes identifying candidate keypoints at locations in a local image neighborhood of the document, and combining the locations of the candidate keypoints to form a fine-grained fingerprint identifying patterns representing content in the document.
摘要:
A method for improving mixed raster compression segmentation utilizes a second stage of a process to generate the MRC Selector plane by operating on a multibit selector (GraySel) signal which is produced by a first stage process. The first stage methods used to generated the GraySel can be PDL or scan oriented. The binary Selector signal produced by the second stage minimizes the compression noise evident in the reconstructed image. This second stage processing relies on knowing the size of the JPEG minimum coded unit (MCU) which will be used to compress the segmented Foreground and Background planes. The idea is to move false (soft) edges away from the real (hard) edges by as large a distance as possible up to the point where they fall off the boundary of the MCU block. Thus a soft edge which occurs between two hard edges is either eliminated or repositioned to the midpoint and a soft edge between a hard edge and the MCU boundary is moved to the boundary. Since JPEG MCU blocks are compressed independently, transitions at the boundary cause no ‘ringing’ in the decompressed output.
摘要:
Reduced size images may be generated by separating dimensions that span the original image into a plurality of groups so that data in each group of one or more dimensions may be processed differently from data of other groups. For example, for two-dimensional images, pixels of selected lines may be completely discarded or skipped while pixels of remaining lines may be processed along the x-dimension to obtain data of the reduced size image. The main advantages include very fast and efficient implementation, minimal storage requirements, and good image quality.
摘要:
A method and a system for computing the function value of an input node based on function values of known nodes. A database of known nodes and their corresponding function values is formed. The known nodes are located such that a distance between any two adjacent known nodes is an integer power-of-two number. The database is searched for a first node such that the input node is located between the first node and an adjacent second node. The difference δ between the input node and the first node is computed. Then δ is shifted to the right by k positions, k being the logarithm in base 2 of the distance between the first and second nodes. The input node function value is computed by combining the first node function value with the product of the shifted δ and the difference between the function values of the second and first nodes. When used in MRC context, the above method and system is applied to each of the color MRC planes individually, instead of the merged output. The resulting performance is approved since most if not all of the color MRC planes are sub-sampled.
摘要:
A method and a system for controlling de-screening of documents. The control system is included in a de-screening system including a bank of filters. The filter bank receives an image signal and produces a set of filter output signals. The control system comprises a contrast module, a halftone estimate module and a pixel control module. The contrast module receives a filter output signal from a filter having a large filter span and produces a contrast signal. The halftone estimate module receives the image signal and a filter output signal from a filter having a small filter span and produces an estimated frequency signal and a halftone weight signal. Based on the contrast signal, the estimated frequency signal and the halftone weight signal, the pixel control module produces a control signal which includes information regarding which of the filter output signals are to be blended and the proportion of blending.