摘要:
Image reproduction is improved in an MPEG-like television receiver by inclusion of post-processing adaptive error concealment. Compressed video signal is examined to determine blocks of video signal containing errors, and error tokens are generated for identifying corresponding blocks of decompressed pixel values. Pixel values adjacent the decompressed blocks of pixel values containing errors are examined to generate estimates of the relative image motion and image detail in the area of such blocks. The block of pixel values is replaced with temporally displaced co-located blocks of pixel values or interpolated data depending upon whether the estimate of image motion is lesser or greater than the estimate of image detail.
摘要:
A bitstream includes coded pictures, and split-flags for generating a transform tree. The bit stream is a partitioning of coding units (CUs) into Prediction Units (PUs). The transform tree is generated according to the split-flags. Nodes in the transform tree represent transform units (TU) associated with the CUs. The generation splits each TU only if the corresponding split-flag is set. For each PU that includes multiple TUs, the multiple TUs are merged into a larger TU, and the transform tree is modified according to the splitting and merging. Then, data contained in each PU can be decoded using the TUs associated with the PU according to the transform tree.
摘要:
A method randomly accesses multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame.
摘要:
A system and method synthesizes multiview videos. Multiview videos are acquired of a scene with corresponding cameras arranged at a poses such that there is view overlap between any pair of cameras. A synthesized multiview video is generated from the acquired multiview videos for a virtual camera. A reference picture list is maintained for each current frame of each of the multiview videos and the synthesized video. The reference picture list indexes temporal reference pictures and spatial reference pictures of the acquired multiview videos and the synthesized reference pictures of the synthesized multiview video. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A system and method manages multiview videos. A reference picture list is maintained for each current frame of multiple multiview videos. The reference picture list indexes temporal reference pictures, spatial reference pictures and synthesized reference pictures of the multiview videos. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A method and system processes a compressed input video. The compressed input video is processed to produce an interlaced picture, and macroblock coding information of the input video. The interlaced picture has a first spatial resolution, and a top-field and a bottom-field. The top-field and the bottom-field of the interlaced picture are filtered adaptively according to the macroblock coding information to produce a progressive picture with a second spatial resolution less than the first spatial resolution.
摘要:
A method filters pixels in an image, by first partitioning the image into blocks. Edge block are identified. A variance of an intensity for each pixel in each edge block is determined. Then, each pixel in each edge block is filtered with a filter that is dependant on the variance of the pixel.
摘要:
A method transcodes a compressed multi-layer video bitstream that includes a base layer bitstream and an enhancement layer bitstream. The base and enhancement layers are first partially decoded, and then the partially decoded signals are combined with a motion compensated signal yielding a combined signal. The combined signal is quantized into an output signal according to a quantization parameter, and the output signal is variable length encoded as a single layer bitstream. In a preprocessing step, the enhancement layer can be truncated according to rate control constraint, and the same constraints can also be used during the quantization.
摘要:
A method for generating a representation of multimedia content by first segmenting the multimedia content spatially and temporally to extract objects. Feature extraction is applied to the objects to produce semantic and syntactic attributes, relations, and a containment set of content entities. The content entities are coded to produce directed acyclic graphs of the content entities, where each directed acyclic graph represents a particular interpretation of the multimedia content. Attributes of each content entity are measured and the measured attributes are assigned to each corresponding content entity in the directed acyclic graphs to rank order the multimedia content.
摘要:
A method encodes a video by first measuring a variance of pixel intensities in a current frame. A number of bits to encode the current frame is assigned according to rate and buffer fullness constraints. A multiplier value is determined directly as a function of only the variance and the number of bits assigned to the current frame. Motion vectors between a reference frame and the current frame are estimated, and a sum of absolute difference (SAD) is based on a motion compensated residual between the reference frame and the current frame. An encoding mode is determined for each macro block in the current frame based on the sum of absolute difference, the motion vectors and the multiplier value. Then, the motion compensated residual is encoded based on the encoding mode, multiplier value and the number of allocated bits.