摘要:
A bitstream includes coded pictures, and split-flags for generating a transform tree. The bit stream is a partitioning of coding units (CUs) into Prediction Units (PUs). The transform tree is generated according to the split-flags. Nodes in the transform tree represent transform units (TU) associated with the CUs. The generation splits each TU only if the corresponding split-flag is set. For each PU that includes multiple TUs, the multiple TUs are merged into a larger TU, and the transform tree is modified according to the splitting and merging. Then, data contained in each PU can be decoded using the TUs associated with the PU according to the transform tree.
摘要:
A method conceals errors in an infra-frame of compressed video. The intra-frame is decoded to multiple macroblocks. Each macroblock including pixels arranged in a rectangular array. A lost macroblock is identified during the decoding. Pixels along an outer boundary of the lost macroblock are concealed from nearest candidate pixels along outer boundaries of macroblocks immediately adjacent to the lost macroblock. All other pixels in the lost macroblock are concealed from nearest candidate pixels selected from previously concealed pixels in the lost macroblock.
摘要:
A method and system transcodes an input video to a lower spatial resolution. The input video is first decoded into pictures. Each picture includes a set of macroblocks. Each picture sub-sampled to a downscaled picture having a lower spatial resolution. A quantization scale is selected for each macroblock in the downscaled picture. A set of motion vectors is generated for each macroblock in the downscaled picture. A multiplier value based on the quantization scale is determined for each macroblock in the downscaled picture. One of a plurality of encoding modes is selected for each macroblock in the downscaled picture according to the quantization scale, the motion vectors, and the multiplier value. Then, each macroblock in each downscaled picture is encoded according to the quantization scale, the selected encoding mode, and the set of motion vectors to produce an output video having a lower spatial resolution that the input video.
摘要:
A system and method manages multiview videos. A reference picture list is maintained for each current frame of multiple multiview videos. The reference picture list indexes temporal reference pictures, spatial reference pictures and synthesized reference pictures of the multiview videos. Then, each current frame of the multiview videos is predicted according to reference pictures indexed by the associated reference picture list during encoding and decoding.
摘要:
A method decodes a video encoded as a compressed bitstream including frames constructed as blocks. The bitstream is variable length decoded and inverse quantized to determine motion vectors and DCT coefficients of the blocks of the bitstream. The DCT coefficients are then inverse discrete cosine transformed. A low-resolution block is generated for each reconstructed block, and the reconstructed blocks are motion compensated with the low-resolution blocks to decode the video.
摘要:
A system and method for temporally processing an input video including input frames. Each frame has an associated frame play time, and the input video has a total input video play time that is a sum of the input frame play times of all of the input frames. Each of the input frames is classified according to a content characteristic of each frames. An output frame play time is allocated to each of the input frames that is based on the classified content characteristic of each of the input frames to generate a plurality of output frames that form an output video.
摘要:
A method transcodes an input video to an output video, which can have a lower bit rate than the input video. Sets of rate values and corresponding sets of distortion values are determined for each components of the output video. There is one set of rate values and one corresponding set of distortion values for each component in the output video. The components include a requantization of the input video, inserted resynchronization markers, and inserted intra-blocks. Then, bits are allocated to each of the components in the output video according to the associated set of rate values and the associated corresponding set of distortion values.
摘要:
A method conceals errors in an intra-fame of compressed video. The intra-frame is decoded to multiple macroblocks. Each macroblock including pixels arranged in a rectangular array. A lost macroblock is identified during the decoding. Pixels along an outer boundary of the lost macroblock are concealed from nearest candidate pixels along outer boundaries of macroblocks immediately adjacent to the lost macroblock. All other pixels in the lost macroblock are concealed from nearest candidate pixels selected from previously concealed pixels in the lost macroblock.
摘要:
A method classifies pixels in an image. The image can be a decompressed image that was compressed using a block-based compression process. A filter is applied to each pixel in the image to determine a mean intensity value of the pixel. The mean is used to determine a mean-square intensity for each pixel, which in turn is used to determine a variance of the intensity for each pixel. The mean-square represents an average power of a DC component in the image, and the variance represents an average power of AC frequency components in the image. The pixels are then classified according to the variance as being either smooth, edge, or texture pixels. Blocks in the image can then be classified according to the classified pixels, and blocking artifacts and ringing artifacts in the blocks can then be filtered according to the block classification.
摘要:
A method transmits a video over a network as a bit stream of packets. Real-time feedback information on conditions of the network is received while transmitting the packets. A probability of packet loss is based on the real-time feedback. Then, redundant packets are generated for selected packets of the bit stream if the probability of packet loss is greater than a predetermined threshold.