摘要:
A bitstream includes coded pictures, and split-flags for generating a transform tree. The bit stream is a partitioning of coding units (CUs) into Prediction Units (PUs). The transform tree is generated according to the split-flags. Nodes in the transform tree represent transform units (TU) associated with the CUs. The generation splits each TU only if the corresponding split-flag is set. For each PU that includes multiple TUs, the multiple TUs are merged into a larger TU, and the transform tree is modified according to the splitting and merging. Then, data contained in each PU can be decoded using the TUs associated with the PU according to the transform tree.
摘要:
A method conceals errors in an infra-frame of compressed video. The intra-frame is decoded to multiple macroblocks. Each macroblock including pixels arranged in a rectangular array. A lost macroblock is identified during the decoding. Pixels along an outer boundary of the lost macroblock are concealed from nearest candidate pixels along outer boundaries of macroblocks immediately adjacent to the lost macroblock. All other pixels in the lost macroblock are concealed from nearest candidate pixels selected from previously concealed pixels in the lost macroblock.
摘要:
A method and system transcodes an input video to a lower spatial resolution. The input video is first decoded into pictures. Each picture includes a set of macroblocks. Each picture sub-sampled to a downscaled picture having a lower spatial resolution. A quantization scale is selected for each macroblock in the downscaled picture. A set of motion vectors is generated for each macroblock in the downscaled picture. A multiplier value based on the quantization scale is determined for each macroblock in the downscaled picture. One of a plurality of encoding modes is selected for each macroblock in the downscaled picture according to the quantization scale, the motion vectors, and the multiplier value. Then, each macroblock in each downscaled picture is encoded according to the quantization scale, the selected encoding mode, and the set of motion vectors to produce an output video having a lower spatial resolution that the input video.
摘要:
A method transcodes an input video to an output video, which can have a lower bit rate than the input video. Sets of rate values and corresponding sets of distortion values are determined for each components of the output video. There is one set of rate values and one corresponding set of distortion values for each component in the output video. The components include a requantization of the input video, inserted resynchronization markers, and inserted intra-blocks. Then, bits are allocated to each of the components in the output video according to the associated set of rate values and the associated corresponding set of distortion values.
摘要:
A method conceals errors in an intra-fame of compressed video. The intra-frame is decoded to multiple macroblocks. Each macroblock including pixels arranged in a rectangular array. A lost macroblock is identified during the decoding. Pixels along an outer boundary of the lost macroblock are concealed from nearest candidate pixels along outer boundaries of macroblocks immediately adjacent to the lost macroblock. All other pixels in the lost macroblock are concealed from nearest candidate pixels selected from previously concealed pixels in the lost macroblock.
摘要:
A method classifies pixels in an image. The image can be a decompressed image that was compressed using a block-based compression process. A filter is applied to each pixel in the image to determine a mean intensity value of the pixel. The mean is used to determine a mean-square intensity for each pixel, which in turn is used to determine a variance of the intensity for each pixel. The mean-square represents an average power of a DC component in the image, and the variance represents an average power of AC frequency components in the image. The pixels are then classified according to the variance as being either smooth, edge, or texture pixels. Blocks in the image can then be classified according to the classified pixels, and blocking artifacts and ringing artifacts in the blocks can then be filtered according to the block classification.
摘要:
A method transmits a video over a network as a bit stream of packets. Real-time feedback information on conditions of the network is received while transmitting the packets. A probability of packet loss is based on the real-time feedback. Then, redundant packets are generated for selected packets of the bit stream if the probability of packet loss is greater than a predetermined threshold.
摘要:
A method detects a boundary in a sequence of two-dimensional images where each image has multiple intensity value points. Filtering and motion analysis is applied on each image to produce motion enhanced images. Initial search parameters are determined from a dynamic snake model applied to the motion enhanced images. Each motion enhanced image is searched for a potential boundary using the search parameters. The potential boundary is projected into the motion enhanced image of a previous, current, and next image, and the search parameters of the previous, current, and next images are updated. The searching, projecting, and updating repeat until a predetermined level of convergence is reached.
摘要:
A method decodes a picture in a form of a bit-stream. The picture is encoded and represented by vectors of coefficients. Each coefficient is in a quantized form. A specific coefficient is selected in each vector based on a scan order of the vector. Then, a set of modes is inferred based on characteristics of the specific coefficient. Subsequently, the bit-stream is decoded according to the set of modes.
摘要:
An encoded video in the form of a bitstream includes a sequence of frames, and each frame is partitioned into encoded blocks. A context for decoding is selected for each encoded block. The bitstream is entropy decoded based on the context to obtain a transform indicator difference. The transform index, which indicates a transform type and a transform direction, is based on the transform indicator difference and a predicted transform indicator. Transform coefficients are obtained from the bitstream, and inverse transformed according to the transform index to produce a decoded video.