摘要:
A method up-samples images in a reduced resolution video, wherein each image I(x, y) stores depths d at pixel locations (x, y). each depth image is scaled up to produce a corresponding up-scaled image. Then, image dilation, a median filter, image erosion, and a min-max filter are applied in order to produce a corresponding up-sampled image.
摘要:
A method and system acquire and display light fields. A continuous light field is reconstructed from input samples of an input light field of a 3D scene acquired by cameras according to an acquisition parameterization. The continuous light is reparameterized according to a display parameterization and then prefiltering and sampled to produce output samples having the display parameterization. The output samples are displayed as an output light field using a 3D display device. The reconstruction can be performed by interpolating the input samples having the different views.
摘要:
A method filters pixels in a sequence of images. Each image in the sequence is partitioned into blocks of pixels, and the images are processed sequentially. The energy is determined for each block of pixels in each image. The energy of each block is based on variances of intensities of the pixels in the sequence of images. A 3D fuzzy filter is applied to each current pixel in each current block during the sequential processing. The 3D fuzzy filter considers the energy of the block, and the intensities of pixels spatially adjacent and temporally adjacent to the current pixel to remove blocking and ringing artifacts.
摘要:
A method filters a depth image, wherein each depth image includes an array of pixels at locations (x, y), and wherein each pixel has a depth. A moving window is applied to the pixels in the depth image, wherein a size of the window covers a set of pixels centered at each pixel. A single representative depth from the set of pixel in the window is assigned to the pixel to produce a processed depth image. Then, each pixel in the processed depth image is filtered to correct outlier depths without blurring depth discontinuities to produce a filtered depth image.
摘要:
A method and system acquire and display light fields. A continuous light field is reconstructed from input samples of an input light field of a 3D scene acquired by cameras according to an acquisition parameterization. The continuous light is reparameterized according to a display parameterization and then prefiltering and sampled to produce output samples having the display parametrization. The output samples are displayed as an output light field using a 3D display device. The reconstruction can be performed by interpolating the input samples having the different views;
摘要:
Disclosed are an encoding/decoding method and apparatus using a skip mode. The image decoding method comprises: generating a warping prediction depth image unit, and then decoding skip information regarding an image unit to be decoded; and decoding the image unit to be decoded using a skip mode based on the skip information. The skip information may be determined based on depth information of the warping prediction depth image unit, depth information of the image unit to be decoded, or edge information of a text picture image unit corresponding to the image unit to be decoded. Thus, an unnecessary prediction process need not be performed during the encoding and decoding of the image unit, thereby improving image encoding and decoding efficiency.
摘要:
Embodiments of the invention disclose a system and a method for determining a disparity search range for a current stereo image of a scene based on a set of stereo images of the scene, comprising steps of: selecting a subset of stereo images from the set of stereo images, the subset includes the current stereo image and at least one neighboring stereo image, wherein the neighboring stereo image is temporally-neighboring to the current stereo image; determining a disparity histogram for each stereo image in the subset of stereo images to form a set of disparity histograms; determining a weighted disparity histogram as a weighted sum of the disparity histograms in the set of disparity histograms; and determining the disparity search range from the weighted disparity histogram.
摘要:
A method synthesizes virtual images from a sequence of texture images and a sequence of corresponding depth images, wherein each depth images stores depths d at pixel locations I(x, y). Each depth image, is preprocessed to produce a corresponding preprocessed depth image. A first reference image and a second reference image are from the sequence of texture images. Then, depth-based 3D warping, depth-based histogram matching, base plus assistant image blending, and depth-based in-painting are applied in order to synthesize a virtual image.
摘要:
A method processes a multiview videos of a scene, in which each video is acquired by a corresponding camera arranged at a particular pose, and in which a view of each camera overlaps with the view of at least one other camera. Side information for synthesizing a particular view of the multiview video is obtained in either an encoder or decoder. A synthesized multiview video is synthesized from the of multiview videos and the side information. A reference picture list is maintained for each current frame of each of the multiview videos, the reference picture indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame.