摘要:
Often hierarchical bi-directionally predicted frame structures are used for encoding a video picture sequence. The frames may consist of interlacing fields. A method for encoding interlaced video, wherein inter-prediction of fields is used and reference lists are assigned to the fields for indicating reference frames or fields, comprises that, if within such reference list a reference to another frame is included, then references to both fields of the other frame are included separately in direct sequence. Further, a temporal level is assigned to each of the frames according to their display order, and for the frames of all except one temporal level the quantization parameter is higher for one type of fields than for the other type of fields.
摘要:
A method for coding a picture portion of a picture of a video sequence is disclosed, at least one picture of the video sequence being a key picture. The method comprises the following steps: a) calculate a saliency map of the key picture, b) estimate for the picture portion to be coded, at least one motion vector pointing towards a portion of the key picture, c) motion compensate at least one portion of the saliency map using an estimated motion vector to obtain, for said picture portion, at least one portion of the predicted saliency map, and d) code the picture portion according to the saliency level of the portion of the predicted saliency map. The invention also relates to a method for decoding a binary stream, a coder able to code pictures of a video sequence and a decoder of a binary stream.
摘要:
The invention relates to spatially scalable encoding and decoding processes using a method for deriving coding information. More particularly, it relates to a method for deriving coding information for high resolution pictures from the coding information of low resolution pictures. The method mainly comprises the following steps: Computing geometrical parameters characterizing the position of said high layer macroblock relatively to the corresponding base layer macroblocks and deriving from these parameters a macroblock class; Deriving a partition and possibly sub-partitions for each partition of said high layer macroblock from corresponding base layer macroblocks partition and sub-partitions on the basis of the geometrical parameters and HL MB class; and Deriving motion information for said high layer macroblock from motion information of corresponding base layer macroblocks.
摘要:
The invention relates to spatially scalable encoding and decoding processes using a method for deriving coding information. More particularly, it relates to a method for deriving coding information used to encode high resolution images from coding information used to encode low resolution images when the ratio between high resolution and low resolution images dimensions is a multiple of 3/2. The method mainly comprises the following steps: deriving a block coding mode for each 8×8 blocks of a prediction macroblock MBi_pred from the macroblock coding mode of the associated base layer macroblocks on the basis of the macroblock class of MBi and an the basis of the position of the 8×8 block within MBi_pred; deriving a macroblock coding mode for MBi_pred from the coding modes of the associated base layer macroblocks; and deriving motion information for each macroblock MBi_pred from the motion information of the associated base layer macroblocks.
摘要:
The method realizes a motion compensated temporal filtering (MCTF), the temporal filtering being replaced by an intra mode coding to obtain at least one low (L) or high (H) frequency picture if the current picture has a level of correlation with a lower previous picture at a threshold, the low frequency pictures obtained (L) being thus scaled to be adapted, at the energy level, to the pictures obtained by motion compensated temporal filtering, and comprises, at the end of analysis: a selection of the pictures obtained by intra coding of a picture of a low decomposition level with the additional condition, for the high frequency pictures, that this picture is derived itself from an intra coding. a calibration of the picture selected by carrying out at least one reverse step of the scaling step. The applications relate to video compression with temporal prediction.
摘要:
The method is characterized in that the pre-analysis phase performs correlation level calculations of the even and odd field blocks of the current picture with the even and odd blocks of the current picture with the even and odd field blocks of the reference picture based on motion vectors calculated during this phase and corresponding to the blocks to impose, during the coding mode decision stage and among the inter coding modes, the inter coding between fields of the same parity or of opposing parity or the inter coding between frames, according to the correlation levels.
摘要:
Motion vectors of a first reference frame are permitted to point to a plurality of further reference frames. A method of storing the motion vectors comprises, when a block of the first reference frame has two motion vectors initially, selecting one of the two motion vectors, the non-selected motion vector not being stored. The selected motion vector may be scaled. This can reduce the motion vector memory size.
摘要:
Reference prediction mode values, also referred to as most probable modes, usable for encoding or decoding of a prediction mode related to a current coding unit, are derived. First and second reference prediction mode values are derived (S402) from respective prediction modes of at least two neighboring coding units of the current coding unit. The first and second reference prediction modes are different. A third reference prediction mode value is derived (S403) from the first and second reference prediction mode values. The third reference prediction mode is different from each of said first and second reference prediction mode values.By deriving three MPMs instead of two for comparison with the prediction mode of the current coding block the coding efficiency is improved. This is due to the increase in the probability that the prediction mode of the current coding block corresponds to one of the derived most probable modes.
摘要:
The invention relates to a method for deriving motion data for a macroblock divided in elementary blocks of a high resolution picture, called high layer macroblock, from motion data of macroblocks of a low resolution picture, called base layer macroblock. The method comprises the following steps: computing, for each elementary block, an intermediate position within the low resolution picture from the elementary block position depending on the coding modes of the high layer macroblock and of the high and low resolution pictures; identifying the base layer macroblock, called base_MB, comprising the pixel located at the intermediate position; computing a final position within the low resolution picture from the virtual base layer position depending on the coding modes of the base_MB, of the high layer macroblock; identifying the base layer macroblock, called real_base_MB, comprising the pixel located at the final position; and deriving motion data, for the high layer macroblock, from motion data of the identified real_base_MB.
摘要:
Fully scalable encoder and decoder for interlaced video. A method for encoding an interlaced sequence of digital video data decomposes the interlaced video sequence into first and second fields, performs digital filtering to get lower frequency and higher frequency component signals of the first fields, and uses spatio-temporal filtering and motion estimation for generating base layer signals being suitable for reconstruction of a progressive mode video sequence in a receiver. Advantageously, both the spatio-temporal filter at the encoder, and the inverse process at the receiver, can perform scaling in spatial and temporal dimension. The second fields are used to generate enhancement signals, which enable a receiver to reproduce an interlaced video sequence of the full, or scaled, spatial and/or temporal resolution.