摘要:
Embodiments of the invention disclose a system and a method for determining a disparity search range for a current stereo image of a scene based on a set of stereo images of the scene, comprising steps of: selecting a subset of stereo images from the set of stereo images, the subset includes the current stereo image and at least one neighboring stereo image, wherein the neighboring stereo image is temporally-neighboring to the current stereo image; determining a disparity histogram for each stereo image in the subset of stereo images to form a set of disparity histograms; determining a weighted disparity histogram as a weighted sum of the disparity histograms in the set of disparity histograms; and determining the disparity search range from the weighted disparity histogram.
摘要:
Multiview videos are acquired by overlapping cameras. Side information is used to synthesize multiview videos. A reference picture list is maintained for current frames of the multiview videos, the reference picture indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list with a skip mode and a direct mode, whereby the side information is inferred from the synthesized reference picture. Alternatively, the depth images corresponding to the multiview videos of the input data, and this data are encoded as part of the bitstream depending on a SKIP type.
摘要:
A method represents a correlated set of images. The correlation can be spatial or temporal. A lossy operation is applied to each image in the correlated set to generate a coarse image. The coarse image is encoded losslessly to yield an encoded coarse image. Each image is also represented by syndrome bits. The combination of the encoded coarse images and the syndrome bits represent the correlated set of images.
摘要:
A method for verifying a similarity between a first signal and a second signal is described. The first and the second signals are encrypted homomorphically using a key. First, we acquire a set of error patterns determined by a similarity constraint. Then, each error pattern is homomorphically encrypted using the key and presented to a verifier in the setup phase. The verifier declares the first signal similar to the second signal, if any error pattern in the set of error patterns satisfies a homomorphic relationship between the first encrypted signal and the second encrypted signal.
摘要:
A method synthesizes virtual images from a sequence of texture images and a sequence of corresponding depth images, wherein each depth images stores depths d at pixel locations I(x, y). Each depth image, is preprocessed to produce a corresponding preprocessed depth image. A first reference image and a second reference image are from the sequence of texture images. Then, depth-based 3D warping, depth-based histogram matching, base plus assistant image blending, and depth-based in-painting are applied in order to synthesize a virtual image.
摘要:
A method processes a multiview videos of a scene, in which each video is acquired by a corresponding camera arranged at a particular pose, and in which a view of each camera overlaps with the view of at least one other camera. Side information for synthesizing a particular view of the multiview video is obtained in either an encoder or decoder. A synthesized multiview video is synthesized from the of multiview videos and the side information. A reference picture list is maintained for each current frame of each of the multiview videos, the reference picture indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list.
摘要:
A method and apparatus transcode an image. An encoded input bitstream of the image is analyzed to obtain a structure of the encoded input bitstream, in which the image includes a region-of-interest and a background region, and in which the encoded input bitstream is a stream of packets. A first quality value for the region-of-interest and a second quality value for the background region are determined. An encoded output bitstream is composed from a subset of the packets selected from the encoded input bitstream according to the structure, in which the subset of packets includes a first set of packets corresponding to the region-of-interest having the first quality value, a second set of packets corresponding to the background region and having second quality value, and empty packets.
摘要:
First biometric parameters are acquired from a user. Input data are encrypted according to the biometric parameters to produce ciphertext. The biometric parameters are encoded using a syndrome encoder to produce a syndrome code. The ciphertext and the syndrome code are associated with each other and stored in a computer readable media so that only the same user can subsequently decrypt the cipher text.
摘要:
A method acquires compressed videos. Intra-, or inter-frames of each compressed video are acquired at a fixed sampling rate. Joint analysis is applied concurrently and in parallel to the compressed videos to determine a variable and non-uniform temporal sampling rate for each compressed video so that a combined distortion is minimized and a combined frame rate constraint is satisfied. Each compressed video is then sampled at the associated variable and non-uniform temporal sampling rate to produce output compressed videos having variable temporal resolutions.
摘要:
A method performs inverse tone mapping of an image in a decoder. For each block of each color channel of the image a scaling factor is determined by adding a predicted scaling factor for the current block to a difference between the predicted scaling factor and the scaling factor of an adjacent block. An offset value for the current block is determined by adding a predicted offset for the current block to a difference between the predicted offset value and the offset value of the adjacent block. The scaling factor and the offset value are applied to pixel intensity values of the current block to produce a mapped block. The inverse tone mapping can also be applied to blocks of different sizes.