摘要:
Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame. The method also decodes multiview videos by maintaining a reference picture list for a current frame of a plurality of multiview videos, and predicting each current frame of the plurality of multiview videos according to reference pictures indexed by the associated reference picture list.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bitstream to provide random temporal access to the multiview videos.
摘要:
A model stored in a memory accessible by a video transcoder includes a first rate-distortion function modeling a requantization of an input video. A second-rate distortion function models a resynchronization marker insertion rate for the transcoded video, and a third rate-distortion function models an intra-block insertion rate for the transcoded video.
摘要:
A method classifies pixels in an image by first partitioning the image into blocks. A variance of an intensity is determined for each pixel, and for each block the pixel with the maximum variance is identified. Then, the blocks are classified into classes according to the maximum variance.
摘要:
A method encodes an inter-frame of a compressed video, the inter-frame including multiple macroblocks in a predetermined order. Each macroblock has an associated motion vector. For each current macroblock in the predetermined order, a set of near macroblocks are identified. An index is assigned to each near macroblock. A difference between the motion vector of the current macroblock and the motion vector of each near macroblocks is determined. The indices of the near macroblocks are then sorted in order of the differences and appended to the inter-frame.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bitstream to provide random temporal access to the multiview videos.
摘要:
A system and method synthesizes multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at a poses such that there is view overlap between any pair of cameras. A synthesized multiview video is generated from the acquired multiview videos for a virtual camera. A reference picture list is maintained for each current frame of each of the multiview videos and the synthesized video. The reference picture list indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A method codes an output bitstream of an input video so the decoded output bitstream has a constant perceived quality. A base layer bitstream having a constant bit-rate is generated from the input video, and an input enhancement layer bitstream is generated from a difference between the input video and the base layer bitstream. Rate and distortion characteristics are extracted from the base layer bitstream and the input enhancement layer bitstream, and an output enhancement layer bitstream having a variable bit-rate is generated from the input enhancement layer bitstream according to the rate and distortion characteristics. The base layer bitstream and the output enhancement layer bitstream, in combination, form an output bitstream having a variable bit-rate and a constant distortion.
摘要:
A method acquires compressed videos. Intra-, or inter-frames of each compressed video are acquired at a fixed sampling rate. Joint analysis is applied concurrently and in parallel to the compressed videos to determine a variable and non-uniform temporal sampling rate for each compressed video so that a combined distortion is minimized and a combined frame rate constraint is satisfied. Each compressed video is then sampled at the associated variable and non-uniform temporal sampling rate to produce output compressed videos having variable temporal resolutions.
摘要:
A method transcodes groups of macroblocks of a partially decoded input bitstream. The groups of macroblocks include intra-mode and inter-mode macroblocks. Each macroblock includes DCT coefficients, and at least one motion vector. The modes of each group of macroblocks are mapped to be identical only if there is an inter-mode block and an intra-mode macroblock in the group. If any of the macroblocks in the group are mapped, then the DCT coefficients and the motion vector for such mapped macroblocks are modified in accordance with the mapping to generate reduced-resolution macroblock for an output compressed bitstream to compensate for drift.