摘要:
A method determines distortion in a video by measuring a spatial distortion in coded frames, and by measuring a temporal distortion and spatial distortion in uncoded frames. The spatial distortion of the coded frames is combined with the temporal distortion and the spatial distortion of the uncoded frames to determine a total average distortion in the video.
摘要:
A method estimates rate and distortion characteristics of a video object. First and second object shape features are respectively extracted at a first and second resolution of the video object. First and second rate distortion characteristics of the video object are respectively determined from the extracted first and second object shape features according to first and second modeling parameters. The extracted object shape features can be discrete, such as states of binary shape patterns of the video object, or the object shape features can be continuous such as a set of statistical moments representing a probability density function of the video object.
摘要:
A method encodes a video as video objects. For each candidate object, a quantizer parameter and a skip parameter that jointly minimizes an average total distortion in the video are determined while satisfying predetermined constraints. The average total distortion includes spatial distortion of coded objects and spatial and temporal distortion of uncoded objects. Then, the candidate objects is encoded as the coded objects with the quantizer parameter and the skip parameter, and the candidate objects is skipped as the uncoded objects with the skip parameter.
摘要:
A compressed bitstream is scaled down to a reduced rate bitstream by first demultiplexing a compressed input bitstream to extract video objects as elementary input bitstreams having a first bit rate. A transcoder converts each elementary input bitstream to an elementary output bitstream having a second bit rate. The first bit rate is less than the second bit rate. A transcoding control unit, coupled to the transcoder, supplies control information for the transcoder. A multiplexer composes the elementary output bitstreams into a compressed output bitstream having the second bit rate.
摘要:
Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame. The method also decodes multiview videos by maintaining a reference picture list for a current frame of a plurality of multiview videos, and predicting each current frame of the plurality of multiview videos according to reference pictures indexed by the associated reference picture list.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bitstream to provide random temporal access to the multiview videos.
摘要:
A model stored in a memory accessible by a video transcoder includes a first rate-distortion function modeling a requantization of an input video. A second-rate distortion function models a resynchronization marker insertion rate for the transcoded video, and a third rate-distortion function models an intra-block insertion rate for the transcoded video.
摘要:
A method classifies pixels in an image by first partitioning the image into blocks. A variance of an intensity is determined for each pixel, and for each block the pixel with the maximum variance is identified. Then, the blocks are classified into classes according to the maximum variance.
摘要:
A method encodes an inter-frame of a compressed video, the inter-frame including multiple macroblocks in a predetermined order. Each macroblock has an associated motion vector. For each current macroblock in the predetermined order, a set of near macroblocks are identified. An index is assigned to each near macroblock. A difference between the motion vector of the current macroblock and the motion vector of each near macroblocks is determined. The indices of the near macroblocks are then sorted in order of the differences and appended to the inter-frame.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bitstream to provide random temporal access to the multiview videos.