摘要:
A hybrid intra-inter bi-predictive (or multi-predictive) coding mode allows both intraframe (intra) and interframe (inter) predictions to be combined together for hybrid-encoding a current macroblock or a subblock. Bi-prediction may be used also in I-pictures, combining two intra predictions that use two different intra prediction directions. A video encoder processes data representing a two-dimensional video image which has been produced by a conventional commercially available video camera. The video encoder is adapted to select, for coding a current macroblock, between an intra encoding mode, an P-frame inter encoding mode, a B-frame bi-predictive inter mode, and a hybrid intra-inter bi-predictive encoding mode. A video decoder receives and decodes a data stream that may contain a block/macroblock encoded in accordance with the hybrid intra-inter bi-predictive encoding mode.
摘要:
A method for minimizing and optimizing a list of active reference pictures to be used for (forward or backward) inter coding of a current picture, or of a pixel region (e.g., video object) within the current picture. Setting the number of active reference pictures in a list equal to one based upon one or more decisions related to the pixel data and movement between the current picture and one or more reference pictures. Otherwise, minimizing the number of active reference pictures in the list based upon one or more other decisions related to the pixel data and movement between the current picture and one or more reference pictures. Minimizing the number of active reference pictures in a second reference picture list based upon the utilization of reference pictures in the first reference picture list. A video encoder processes data representing a two-dimensional video image which has been produced by a conventional commercially available video camera. The video encoder is adapted to determine, select, and to minimize the number of active reference pictures to be used to inter code a current picture. The bit stream transmitted by the encoder can be decoded by decoders of the related art.
摘要:
Coding method complexity types are assigned to video for combinations of coding decision levels associated with a sequence of pictures, a group of pictures, a scene, a region of the scene or picture, a block or macroblock. Multiple coding units are encoded as a function of complexity and determined from the coding decision levels. A first coding decision level is associated with determining a first complexity and a first coding method type for the sequence of pictures. A second coding decision level is associated with determining a second complexity and a second coding method type for the pictures or scene. The picture or sequence includes the coding units. A third coding decision level is associated with determining a third complexity and a third coding method type for the coding units.
摘要:
Video compression is performed under multiple distortion constraints. Video coding includes determining a first set of Lagrangian cost values for multiple coding modes using a first distortion metric, determining a second set of Lagrangian cost values for the multiple coding modes using a second distortion metric, and selecting one of the coding modes base first set of Lagrangian cost values and the second set of Lagrangian cost values to encode a pixel block using the selected coding mode. The distortion metrics can include information associated with video display characteristics such as screen display size, video processing performance, a distortion characteristic, a temporal characteristic, or a spatial characteristic. The distortion metrics can also include characteristics of multiple video displays, such as parameters for usage, importance, design, and the technology type of the video displays.
摘要:
Improved video coding is described to encode video data within a sequence of video frames. To this end, at least a portion of a reference frame is encoded to include motion information associated with the portion of the reference frame. At least a portion of a predictable frame that includes video data predictively correlated to said portion of said reference frame is defined based on the motion information. At least said portion of the predictable frame is encoded without including corresponding motion information and including mode identifying data. The mode identifying data indicate that the encoded portion of the predictable frame can be directly derived using at least the motion information associated with the portion of the reference frame.
摘要:
A method and apparatus are provided for motion estimation using combined reference bi-prediction. The apparatus includes an encoder (200) for encoding a multi-prediction picture from a combination of two or more reference pictures by respectively predicting a motion vector for a particular one of the two or more reference pictures in a motion estimation process while initializing motion vectors for remaining ones of the two or more reference pictures to a predefined value for use by the motion estimation process.
摘要:
Methods and apparatuses are provided relating to the encoding of data, such as, e.g., images, video, etc. For example, certain methods include processing at least a portion of data using a plurality of different quantization functions to produce a plurality of corresponding quantized portions of data, and selectively outputting one of the quantized portions of data based on at least one threshold value. The method may also include dividing initial data into a plurality of portions and classifying the portion of data based on at least one classification characteristic. Here, for example, there may be a threshold value that is associated with the classification characteristic. Additional syntax may be adopted to enable considerably higher compression efficiency by allowing several alternative motion prediction cases. A high efficiency time stamp independent Direct Mode is also provided which considers spatial motion vector prediction as well with stationary temporal predictors.
摘要:
Individual pixels in an image block undergo blending with film grain from a film grain block randomly selected from among a pool of previously established film grain blocks in accordance with a luma characteristic of the image block. Prior to blending, the selected film grain block undergoes deblocking by a deblocking filter (28). Following blending, a clipper clips the individual pixels prior to display. The pool of film grain blocks is created by scaling a set of film grain patterns in accordance with at least one parameter of a film grain information message that accompanies the image block.
摘要:
A video decoder, and corresponding method are described for enabling fast channel change of compressed video, where a video decoder for receiving compressed stream data and providing decompressed video output includes a demultiplexor for receiving the compressed stream data and separating the normal stream and the channel change stream, a normal decoding portion in signal communication with the demultiplexor for selectably receiving the compressed normal and channel change streams, and providing decompressed video output, and a normal frame store in signal communication with the normal decoding portion for storing reference pictures.
摘要:
A method for minimizing and optimizing a list of active reference pictures to be used for (forward or backward) inter coding of a current picture, or of a pixel region (e.g., video object) within the current picture. Setting the number of active reference pictures in a list equal to one based upon one or more decisions related to the pixel data and movement between the current picture and one or more reference pictures. Otherwise, minimizing the number of active reference pictures in the list based upon one or more other decisions related to the pixel data and movement between the current picture and one or more reference pictures. Minimizing the number of active reference pictures in a second reference picture list-based upon the utilization of reference pictures in the first reference picture list. A video encoder processes data representing a two-dimensional video image which has been produced by a conventional commercially available video camera. The video encoder is adapted to determine, select, and to minimize the number of active reference pictures to be used to inter code a current picture. The bit stream transmitted by the encoder can be decoded by decoders of the related art.