摘要:
Embodiments of the invention describe a method for transcoding an input video in a first encoded format to an output video in a second encoded format, wherein the videos include a set of segments and each segment includes frames. First, the method is determining a set of downsample resilient segments in the input video and a set of full-resolution segments in the input video. Next, the method is downsampling the set of downsample resilient segments to produce a set of downsampled segments and transcoding the input video using the set of full-resolution segments and the set of downsampled segments to produce the output video including at least two segments with different resolutions.
摘要:
A method represents a correlated set of images. The correlation can be spatial or temporal. A lossy operation is applied to each image in the correlated set to generate a coarse image. The coarse image is encoded losslessly to yield an encoded coarse image. Each image is also represented by syndrome bits. The combination of the encoded coarse images and the syndrome bits represent the correlated set of images.
摘要:
A method for verifying a similarity between a first signal and a second signal is described. The first and the second signals are encrypted homomorphically using a key. First, we acquire a set of error patterns determined by a similarity constraint. Then, each error pattern is homomorphically encrypted using the key and presented to a verifier in the setup phase. The verifier declares the first signal similar to the second signal, if any error pattern in the set of error patterns satisfies a homomorphic relationship between the first encrypted signal and the second encrypted signal.
摘要:
A method synthesizes virtual images from a sequence of texture images and a sequence of corresponding depth images, wherein each depth images stores depths d at pixel locations I(x, y). Each depth image, is preprocessed to produce a corresponding preprocessed depth image. A first reference image and a second reference image are from the sequence of texture images. Then, depth-based 3D warping, depth-based histogram matching, base plus assistant image blending, and depth-based in-painting are applied in order to synthesize a virtual image.
摘要:
A method processes a multiview videos of a scene, in which each video is acquired by a corresponding camera arranged at a particular pose, and in which a view of each camera overlaps with the view of at least one other camera. Side information for synthesizing a particular view of the multiview video is obtained in either an encoder or decoder. A synthesized multiview video is synthesized from the of multiview videos and the side information. A reference picture list is maintained for each current frame of each of the multiview videos, the reference picture indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list.
摘要:
A method and apparatus transcode an image. An encoded input bitstream of the image is analyzed to obtain a structure of the encoded input bitstream, in which the image includes a region-of-interest and a background region, and in which the encoded input bitstream is a stream of packets. A first quality value for the region-of-interest and a second quality value for the background region are determined. An encoded output bitstream is composed from a subset of the packets selected from the encoded input bitstream according to the structure, in which the subset of packets includes a first set of packets corresponding to the region-of-interest having the first quality value, a second set of packets corresponding to the background region and having second quality value, and empty packets.
摘要:
First biometric parameters are acquired from a user. Input data are encrypted according to the biometric parameters to produce ciphertext. The biometric parameters are encoded using a syndrome encoder to produce a syndrome code. The ciphertext and the syndrome code are associated with each other and stored in a computer readable media so that only the same user can subsequently decrypt the cipher text.
摘要:
A method performs inverse tone mapping of an image in a decoder. For each block of each color channel of the image a scaling factor is determined by adding a predicted scaling factor for the current block to a difference between the predicted scaling factor and the scaling factor of an adjacent block. An offset value for the current block is determined by adding a predicted offset for the current block to a difference between the predicted offset value and the offset value of the adjacent block. The scaling factor and the offset value are applied to pixel intensity values of the current block to produce a mapped block. The inverse tone mapping can also be applied to blocks of different sizes.
摘要:
A adaptive complexity control algorithm is proposed to reduce the complexity of H.264 motion estimation. The main idea is to limit the complexity of motion estimation based on the expected RD coding gain loss. In order to efficiently reduce the complexity to desired level, ACC is designed to provide complexity scalability in motion estimation so as to provide flexible tradeoff between video quality and computational complexity. With the proposed algorithm, we demonstrate that complexity of motion estimation can be reduced by ¾ without significant RD performance degradation.
摘要:
A method processes a multiview videos of a scene, in which each video is acquired by a corresponding camera arranged at a particular pose, and in which a view of each camera overlaps with the view of at least one other camera. Side information for synthesizing a particular view of the multiview video is obtained in either an encoder or decoder. A synthesized multiview video is synthesized from the of multiview videos and the side information. A reference picture list is maintained for each current frame of each of the multiview videos, the reference picture indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list.