摘要:
A system and process for generating a panoramic video. Essentially, the panoramic video is created by first acquiring multiple videos of the scene being depicted. Preferably, these videos collectively depict a full 360 degree view of the surrounding scene and are captured using a multiple camera rig. The acquisition phase also includes a calibration procedure that provides information about the camera rig used to capture the videos that is used in the next phase for creating the panoramic video. This next phase, which is referred to as the authoring phase, involves mosaicing or stitching individual frames of the videos, which were captured at approximately the same moment in time, to form each frame of the panoramic video. A series of texture maps are then constructed for each frame of the panoramic video. Each texture map coincides with a portion of a prescribed environment model of the scene. The texture map representations of each frame of the panoramic video are encoded so as to facilitate their transfer and viewing. This can include compressing the panoramic video frames. Such a procedure is useful in applications where the panoramic video is to be transferred over a network, such as the Internet.
摘要:
An automatic digital image grouping system and method for automatically generating groupings of related images based on criteria that includes image metadata and spatial information. The system and method takes an unordered and unorganized set of digital images and organizes and groups related images into image subsets. The criteria for defining an image subset varies and can be customized depending on the needs of the user. Metadata (such as EXIF tags) already embedded inside the images is used to extract likely image subsets. This metadata may include the temporal proximity of images, focal length, color overlap, and geographical location. The first component of the automatic image grouping system and method is a subset image stage that analyzes the metadata and generates potential image subsets containing related images. The second component is an overlap detection stage, where potential image subset is analyzed and verified by examining pixels of the related images.
摘要:
A system and process for deghosting mosaiced images created by stitching together images of a scene captured from different viewpoints is presented. When images are mosaiced, which were captured by different cameras at different viewpoints, the possibility of localized double images of objects exists. Essentially, this double imaging or ghosting will occur if an object in the scene is close in to the cameras capturing the images. However, this localized ghosting can be compensated for by estimating the amount of local mis-registration and then locally warping each image in the mosaiced image to reduce any ghosting.
摘要:
A Bayesian two-color image demosaicer and method for processing a digital color image to demosaic the image in such a way as to reduce image artifacts. The method and system are an improvement on and an enhancement to previous demosaicing techniques. A preliminary demosaicing pass is performed on the image to assign each pixel a fully specified RGB triple color value. The final color value of pixel in the processed image is restricted to be a linear combination of two colors. Fully-specified RGB triple color values for each pixel in an image used to find two clusters represented favored two colors. The amount of contribution from these favored two colors on the final color value then is determined. The method and system also can process multiple images to improve the demosaicing results. When using multiple images, sampling can be performed at a finer resolution, known as super resolution.
摘要:
A panoramic high-dynamic range (HDR) image method and system of combining multiple images having different exposures and at least partial spatial overlap wherein each of the images may have scene motion, camera motion, or both. The major part of the panoramic HDR image method and system is a two-pass optimization-based approach that first defines the position of the objects in a scene and then fills in the dynamic range when possible and consistent. Data costs are created to encourage radiance values that are both consistent with object placement (defined by the first pass) and of a higher signal-to-noise ratio. Seam costs are used to ensure that transitions occur in regions of consistent radiances. The result is a high-quality panoramic HDR image having the full available spatial extent of the scene along with the full available exposure range.
摘要:
A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.
摘要:
A system and process that segments the video frames, thereby allowing selective decoding and possible decompression of just those specific regions that are to be viewed. Each frame is segmented into a plurality of regions such that the segmented regions correspond from one frame to the next. Each segmented region is then encoded separately. Once the frames have been segmented and encoded (which may include compressing them), they can be transferred to the viewer. One way to transfer the files involves an interactive approach and network connection. A viewing system identifies what portions of the scene the user wants to view and informs a server of the portions of the frame of the video that are needed to render the desired view of the scene to the user. The server then transfers only the requested portions of the next frame to the viewer. This process is repeated for each frame of the video. If an interactive approach is not possible, all the files associated with each segmented region of a panoramic frame are sent to the viewer. Once received, the viewer selectively processes and decompresses (if necessary) only those segments required to display the desired portion of the scene. The segmented and encoded frames can also be stored on a storage medium that is accessible by the viewer. In this case, the viewer reads only those segments needed to render the desired view. Thus, here too the viewer selectively processes only those segments required to display the desired portion of the scene.
摘要:
A system and method for adjusting exposure in a mosaiced or stitched image. A stitched composite image is typically represented by a set of images and a set of associated transformations. Each transformation corresponds to one image in the input image sequence and represents the mapping between image pixels in each image and a three-dimensional coordinate system. Every triplet of images in the mosaiced image, having a first, center and third image, is input into the system and method according to the present invention. Before exposure adjustment can be performed, the regions of overlap between the input images in the mosaiced image are calculated. Once the areas of overlap associated with the first and third images are found, the areas of overlap associated with these images are warped to the coordinate frame of the center image. This allows the corresponding pixel locations in the first and center images, and the corresponding pixel location in the third and center images to be determined for the overlapping areas. Additionally, the luminance of each pixel in these areas is identified. The exposure correction factors for each overlapping area are then calculated. Once the exposure correction factors for each overlapping area are calculated, the luminance of all pixels in the center image are adjusted using these factors to yield a mosaiced image that appears to be consistent in exposure. This process is repeated until all sets of triplets have been considered.
摘要:
The primary components of the panoramic video viewer include a decoder module. The purpose of the decoder module is to input incoming encoded panoramic video data and to output a decoded version thereof. The incoming data may be provided over a network and originate from a server, or it may simply be read from a storage media, such as a hard drive, CD or DVD. Once decoded, the data associated with each video frame is preferably stored in a storage module and made available to a 3D rendering module. The 3D rendering module is essentially a texture mapper that takes the frame data and maps the desired views onto a prescribed environment model. The output of the 3D rendering module is provided to a display module where the panoramic video is viewed by a user of the system. Typically, the user will be viewing just a portion of the scene depicted in the panoramic video at any one time, and will be able to control what portion is viewed. Preferably, the panoramic video viewer will allow the user to pan through the scene to the left, right, up or down. In addition, the user would preferably be able to zoom in or out within the portion of the scene being viewed. The user could also be allowed to select what video should be played, choose when to play or pause the video, and to specify what temporal part of the video should be played.
摘要:
Images are analyzed within a 3D environment that is generated based on spatial relationships of the images and that allows users to experience the images in the 3D environment. Image analysis may include ranking images based on user viewing information, such as the number of users who have viewed an image and how long an image was viewed. Image analysis may further include analyzing the spatial density of images within a 3D environment to determine points of user interest.