摘要:
A method for reconstruction of a picture sequence coded in accordance with a coding method specifying a set of coding tools and/or their associated coding parameters is disclosed. The pictures being divided into coding entities. The method for reconstruction according to the invention comprises the following steps for each coding entity coded in INTER mode: determining for the coding entity at least one reference picture, and reconstructing the coding entity from the at least one reference picture with coding tools configured by coding parameters associated with the coding tools. Advantageously, the coding tools and/or the associated coding parameters depend on the reference picture.
摘要:
The invention concerns a device and a method for creating a saliency map of an image. It comprises the steps of: Projection of said image according to the luminance component and if said image is a color image, according to the luminance component and according to the chrominance components, Perceptual sub-bands decomposition of said components according to the visibility threshold of a human eye, Extraction of the salient elements of the sub-bands related to the luminance component, Contour enhancement of said salient elements in each sub-band related to the luminance component, Calculation of a saliency map from the contour enhancement, for each sub-band related to the luminance component. Creation of the saliency map as a function of the saliency maps obtained for each sub-band.
摘要:
The process for coding the current block comprises a step of selecting the predicted block from among candidate blocks, the selection being dependent on a difference DBdrx,dry between the co-located block of the current image block lying in an image of type B and the block of a reference image, the latter block being designated by the motion vector with components drx, dry which is allocated to co-located block and which is colinear with the motion vector Vp allocated to the current block and designating the candidate block. The process relates to data compression, the transmission of digital images using the video coding standard comprising a direct prediction mode, for example the h263, MPEG4 or h261 standard.
摘要:
A method for coding a picture portion of a picure of a video sequence is disclosed, at least one picture of the video sequence being a key picture. The method comprises the following steps: a) calculate a saliency map of the key picture, b) estimate for the picture portion to be coded, at least one motion vector pointing towards a portion of the key picture, c) motion compensate at least one portion of the saliency map using an estimated motion vector to obtain, for said picture portion, at least one portion of the predicted saliency map, and d) code the picture portion according to the saliency level of the portion of the predicted saliency map. The invention also relates to a method for decoding a binary stream, a coder able to code pictures of a video sequence and a decoder of a binary stream.
摘要:
This high dynamic picture transmission system comprises a coding unit able to generate: a standard bitstream coding the pictures in which the luminance of each pixel is coded with a standard dynamic, and at least a second bitstream containing the information necessary to reconstruct the luminance of high dynamic pictures from the coded luminance with a standard dynamic contained in the standard bitstream.
摘要:
The invention relates to a method for determining for a high layer macroblock that uses inter-layer prediction a partitioning of the macroblock in partitions. It comprises the following steps: —dividing the high layer macroblock in non-overlapping high layer blocks of a predefined size; —determining a corresponding base layer pixel for one pixel, called reference pixel, of each high layer block; —identifying, for each reference pixel, a base layer macroblock to which the corresponding base layer pixel belongs, a base layer partition to which the corresponding base layer pixel belongs in the identified base layer macroblock, a base layer sub-partition to which the corresponding base layer pixel belongs in the identified base layer partition if the sub-partition exists; —deriving, for each of the high layer block, a single value, called Part Info value; and —determining a partitioning of the high layer macroblock in macroblock partitions by comparing between them the Part Info values associated with each of the high layer blocks.
摘要:
Fully scalable encoder and decoder for interlaced video. A method for encoding an interlaced sequence of digital video data decomposes the interlaced video sequence into first and second fields, performs digital filtering to get lower frequency and higher frequency component signals of the first fields, and uses spatio-temporal filtering and motion estimation for generating base layer signals being suitable for reconstruction of a progressive mode video sequence in a receiver. Advantageously, both the spatio-temporal filter at the encoder, and the inverse process at the receiver, can perform scaling in spatial and temporal dimension. The second fields are used to generate enhancement signals, which enable a receiver to reproduce an interlaced video sequence of the full, or scaled, spatial and/or temporal resolution.
摘要:
In video encoding, the video frames are spatio-temporally filtered for reduction of spatial and temporal redundancy before they are entropy encoded. Known filtering schemes consider temporally successive frames and are static. It is probable but not necessary that successive frames are most efficient to encode. Therefore, a plurality or all possible frame order permutations are considered for a group of frames (GOP) and evaluated based on a global criterion, which is the sum of local criterion values computed over successive subsets of permuted frames. The local criterion value is deduced from motion estimation processed on each considered set of frames. The best ordering is chosen as the one that minimizes the global criterion value.
摘要:
The method realizes a motion compensated temporal filtering (MCTF), the temporal filtering being replaced by an intra mode coding to obtain at least one low (L) or high (H) frequency picture if the current picture has a level of correlation with a lower previous picture at a threshold, the low frequency pictures obtained (L) being thus scaled to be adapted, at the energy level, to the pictures obtained by motion compensated temporal filtering, and comprises, at the end of analysis:a selection of the pictures obtained by intra coding of a picture of a low decomposition level with the additional condition, for the high frequency pictures, that this picture is derived itself from an intra coding. a calibration of the picture selected by carrying out at least one reverse step of the scaling step. The applications relate to video compression with temporal prediction.
摘要:
In video encoding, the video frames are spatio-temporally filtered for reduction of spatial and temporal redundancy before they are entropy encoded. Known filtering schemes consider temporally successive frames and are static. It is probable but not necessary that successive frames are most efficient to encode. Therefore, a plurality or all possible frame order permutations are considered for a group of frames (GOP) and evaluated based on a global criterion, which is the sum of local criterion values computed over successive subsets of permuted frames. The local criterion value is deduced from motion estimation processed on each considered set of frames. The best ordering is chosen as the one that minimizes the global criterion value.