摘要:
Described is a linear structure from motion technique that is scalable, parallelizable, treats images equally, and is robust to outliers, without requiring intermediate bundle adjustment. Camera rotations for images are estimated using feature point correspondence and vanishing points matched across the images. The camera rotation data is fed into a linear system for structure and translation estimation that removes outliers and provides output data corresponding to structure from motion parameters. The data may be used in further optimization e.g. with a final non-linear optimization stage referred to as bundle adjustment to provide final refined structure from motion parameters.
摘要:
Described is a linear structure from motion technique that is scalable, parallelizable, treats images equally, and is robust to outliers, without requiring intermediate bundle adjustment. Camera rotations for images are estimated using feature point correspondence and vanishing points matched across the images. The camera rotation data is fed into a linear system for structure and translation estimation that removes outliers and provides output data corresponding to structure from motion parameters. The data may be used in further optimization e.g. with a final non-linear optimization stage referred to as bundle adjustment to provide final refined structure from motion parameters.
摘要:
Images are analyzed within a 3D environment that is generated based on spatial relationships of the images and that allows users to experience the images in the 3D environment. Image analysis may include ranking images based on user viewing information, such as the number of users who have viewed an image and how long an image was viewed. Image analysis may further include analyzing the spatial density of images within a 3D environment to determine points of user interest.
摘要:
The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.
摘要:
An “Image Denoiser” provides a probabilistic process for denoising color images by segmenting an input image into regions, estimating statistics within each region, and then estimating a clean (or denoised) image using a probabilistic model of image formation. In one embodiment, estimated blur between each region is used to reduce artificial sharpening of region boundaries resulting from denoising the input image. In further embodiments, the estimated blur is used for additional purposes, including sharpening edges between one or more regions, and selectively blurring or sharpening one or more specific regions of the image (i.e., “selective focus”) while maintaining the original blurring between the various regions.
摘要:
A system and process for generating a panoramic video. Essentially, the panoramic video is created by first acquiring multiple videos of the scene being depicted. Preferably, these videos collectively depict a full 360 degree view of the surrounding scene and are captured using a multiple camera rig. The acquisition phase also includes a calibration procedure that provides information about the camera rig used to capture the videos that is used in the next phase for creating the panoramic video. This next phase, which is referred to as the authoring phase, involves mosaicing or stitching individual frames of the videos, which were captured at approximately the same moment in time, to form each frame of the panoramic video. A series of texture maps are then constructed for each frame of the panoramic video. Each texture map coincides with a portion of a prescribed environment model of the scene. The texture map representations of each frame of the panoramic video are encoded so as to facilitate their transfer and viewing. This can include compressing the panoramic video frames. Such a procedure is useful in applications where the panoramic video is to be transferred over a network, such as the Internet.
摘要:
A system and method for manipulating a set of images of a static scene captured at different exposures (i.e., “bracketed” images) to yield a composite image with improved uniformity in exposure and tone. In general, the aforementioned goal can be achieved by analyzing a set of bracketed images using a multi-dimensional histogram and merging the images via an approach that projects pixels onto a curve that fits the data. However, it has been found that the desired composite image can be produced in a simpler manner by summing the pixel brightness levels across the multiple images, followed by an equalization process. One possible equalization process involves simply averaging the summed pixel brightness values by dividing the summed value of each pixel set (i.e., groups of corresponding pixels from the bracketed images) by the number of bracketed images. An even better result can be achieved using a histogram equalization process. In essence, this histogram equalization involves creating a count of the number of pixels sets having the same summed brightness level. From this count, a cumulative distribution function is computed and normalized to a maximum value corresponding to the maximum summed brightness level. The cumulative distribution function is then used to determine new pixel brightness levels for use in generating the composite image.
摘要:
A computer-based method and system for digital 3-dimensional imaging of an object which allows for viewing images of the object from arbitrary vantage points. The system, referred to as the Lumigraph system, collects a complete appearance of either a synthetic or real object (or a scene), stores a representation of the appearance, and uses the representation to render images of the object from any vantage point. The appearance of an object is a collection of light rays that emanate from the object in all directions. The system stores the representation of the appearance as a set of coefficients of a 4-dimensional function, referred to as the Lumigraph function. From the Lumigraph function with these coefficients, the Lumigraph system can generate 2-dimensional images of the object from any vantage point. The Lumigraph system generates an image by evaluating the Lumigraph function to identify the intensity values of light rays that would emanate from the object to form the image. The Lumigraph system then combines these intensity values to form the image.
摘要:
The described implementations relate to deblurring images. One system includes an imaging device configured to capture an image, a linear motion detector and a rotational motion detector. This system also includes a controller configured to receive a signal from the imaging device relating to capture of the image and to responsively cause the linear motion detector and the rotational motion detector to detect motion-related information. Finally, this particular system includes a motion calculator configured to recover camera motion associated with the image based upon the detected motion-related information and to infer imaging device motion induced blur of the image and an image deblurring component configured to reduce imaging device induced blur from the image utilizing the inferred camera motion induced blur.
摘要:
A computer input device and computer system are provided that determine if the input device is at an edge of a pattern on a working surface based on an image of the working surface captured by the input device. An audio control message is generated based on the input device being positioned on the edge of the pattern and the audio control message is used to cause a speaker to generate an audio signal.