摘要:
A compressed bit-stream represents a corresponding sequence having intra-coded frames and inter-coded frames. The compressed bit-stream includes bits associated with each of the inter-coded frames representing a displacement from the associated inter-coded frame to a closest matching of the intra-coded frames. A magnitude of the displacement of a first of the inter-coded frames is determined based on the bits in the compressed bit-stream associated with that inter-coded frame. The inter-coded frame is then identified based on the determined displacement magnitude. The inter-coded frame includes macro-blocks. Each macro-block is associated with a respective portion of the inter-coded frame bits which represent the displacement from that macro-block to the closest matching intra-coded frame. The displacement magnitude is an average of the displacement magnitudes of all the macro-blocks associated with the inter-coded frame. The displacement magnitudes of those macro-blocks which are less than the average displacement magnitude are set to zero. The number of run lengths of the zero magnitude macro-blocks is determined and also used to identify the first inter-coded frame.
摘要:
A method detects a boundary in a sequence of two-dimensional images where each image has multiple intensity value points. Filtering and motion analysis is applied on each image to produce motion enhanced images. Initial search parameters are determined from a dynamic snake model applied to the motion enhanced images. Each motion enhanced image is searched for a potential boundary using the search parameters. The potential boundary is projected into the motion enhanced image of a previous, current, and next image, and the search parameters of the previous, current, and next images are updated. The searching, projecting, and updating repeat until a predetermined level of convergence is reached.
摘要:
A video signal compression system includes motion compensated predictive compression apparatus for compressing respective frames of video signal according to either intraframe processing or interframe processing on a block by block basis to generate blocks of compressed data and associated motion vectors. A compressed signal formatter arranges the blocks of compressed data and the associated motion vectors according to a desired signal protocol wherein motion vectors of interframe processed frames are associated with corresponding blocks of compressed data and motion vectors of intraframe processed frames are associated with blocks substantially adjacent to corresponding blocks of compressed data. The motion vectors are included with intraframe compressed data to facilitate error concealment at respective receiver apparatus.
摘要:
Error concealment apparatus for correcting errors in signals representing video images includes means for detecting image gradients in an area surrounding a lost block of image data. Circuitry responsive to these image gradients generates a plurality of blocks of directionally interpolated pixel values. The pixel values in the respective blocks of directionally interpolated pixel values are sorted according to amplitude, and then pixel values from mutually exclusive positions in the respective blocks are selected to form a block of pixel values for error concealment.
摘要:
Image reproduction is improved in an MPEG-like television receiver by inclusion of post-processing adaptive error concealment. Compressed video signal is examined to determine blocks of video signal containing errors, and error tokens are generated for identifying corresponding blocks of decompressed pixel values. Pixel values adjacent the decompressed blocks of pixel values containing errors are examined to generate estimates of the relative image motion and image detail in the area of such blocks. The block of pixel values is replaced with temporally displaced co-located blocks of pixel values or interpolated data depending upon whether the estimate of image motion is lesser or greater than the estimate of image detail.
摘要:
A bitstream includes coded pictures, and split-flags for generating a transform tree. The bit stream is a partitioning of coding units (CUs) into Prediction Units (PUs). The transform tree is generated according to the split-flags. Nodes in the transform tree represent transform units (TU) associated with the CUs. The generation splits each TU only if the corresponding split-flag is set. For each PU that includes multiple TUs, the multiple TUs are merged into a larger TU, and the transform tree is modified according to the splitting and merging. Then, data contained in each PU can be decoded using the TUs associated with the PU according to the transform tree.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame.
摘要:
A system and method synthesizes multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at a poses such that there is view overlap between any pair of cameras. A synthesized multiview video is generated from the acquired multiview videos for a virtual camera. A reference picture list is maintained for each current frame of each of the multiview videos and the synthesized video. The reference picture list indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A system and method manages multiview videos. A reference picture list is maintained for each current frame of multiple multiview videos. The reference picture list indexes temporal reference pictures, spatial reference pictures and synthesized reference pictures of the multiview videos. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A method and system processes a compressed input video. The compressed input video is processed to produce an interlaced picture, and macroblock coding information of the input video. The interlaced picture has a first spatial resolution, and a top-field and a bottom-field. The top-field and the bottom-field of the interlaced picture are filtered adaptively according to the macroblock coding information to produce a progressive picture with a second spatial resolution less than the first spatial resolution.