Abstract:
Based on the prediction mode used for the current block, a decoder may determine whether to parse an illumination compensation indication for the current block. The illumination compensation indication may indicate whether to enable an illumination compensation process for the current block. If the prediction mode is indicative of the continuous motion changes between the current block and one or more of the reference blocks, the decoder may bypass parsing the illumination compensation indication. The decoder may disable the illumination compensation process on the current block based on the determination to bypass parsing the illumination compensation indication for the current block.
Abstract:
There are disclosed various methods, apparatuses and computer program products for video encoding and decoding. In some embodiments for encoding.In some embodiments for decoding or encoding a block of a current picture a first motion vector predictor candidate pointing to a first reference block in a first reference picture and a second motion vector predictor candidate pointing to a second reference block in a second reference picture are obtained. A difference vector is added to the first motion vector predictor candidate to obtain a modified first motion vector predictor candidate and subtracted from the second motion vector predictor candidate to obtain a modified second motion vector predictor candidate. A cost estimate is determined for bi-directional motion prediction on the basis of the modified first motion vector predictor candidate and the modified second motion vector predictor candidate. The addition, subtraction and cost estimate determination may be repeated to obtain a set of modified first motion vector predictor candidates, a set of modified second motion vector predictor candidates and a set of cost estimates. A first motion vector predictor is selected from the set of modified first motion vector predictor candidates and a second motion vector predictor is selected from the set of modified second motion vector predictor candidates on the basis of the set of cost estimates.
Abstract:
Video telephony (VT) call management techniques are described. The techniques enable a device to cache intra-frame data at a pre-decoder-initialization stage. An example device includes a memory configured to store video data associated with a VT call, a video decoder configured to render a portion of the stored video data; and one or more processors. The processor(s) are configured to determine whether the received video frame data comprises i-frame data, to determine whether the video decoder is in a pre-initialized state or an initialized state, and when the received video frame data comprises the i-frame data and the video decoder is in the pre-initialized state, to store the i-frame data to the memory.
Abstract:
Motion vectors used for bi-prediction of a block of a current picture are predicted and reconstructed by determining candidate motion vector predictors for each of two motion vector predictor lists. A reconstruction order for the motion vector predictor lists is selected based on the candidate motion vector predictors. In video coding, motion vector deltas are generated for the motion vector predictor lists according to the selected reconstruction order and in video decoding, reconstructed motion vectors are generated for the motion vector predictor lists according to the selected reconstruction order. A candidate motion vector predictor of the second motion vector predictor list is replaced by the actual or reconstructed motion vector for the first motion vector predictor list or a scaled version thereof. This selection of reconstruction order enables determining better candidate motion vector predictors and thereby resulting in smaller motion vector deltas.
Abstract:
A method for encoding a plenoptic image divided into blocks is disclosed. The method comprises: - encoding (S120) a block in a central region of the plenoptic image, called central block, without spatial prediction; and - predictively encoding (S130) blocks immediately surrounding said central block by spatially interleaving spatial unidirectional prediction and spatial bi-directional prediction.
Abstract:
A video coding device includes a memory configured to store video data and processor(s) configured to process at least a portion of the stored video data. The processor(s) are configured to identify a coefficient group (CG) that includes a current transform coefficient of the video data, the CG representing a subset of transform coefficients within a transform unit. The processor(s) are further configured to determine a size of the CG based on a combination of a transform size and one or both of (i) a coding mode associated with the transform unit, or (ii) a transform matrix associated with the transform unit.
Abstract:
A video coding device includes processor(s) configured to determine, for each of a plurality of bins of a value for a syntax element of a current transform coefficient, contexts using respective corresponding bins of values for the syntax element of previously coded transform coefficients. The processor(s) are configured to determine a context for an i th bin of the value for the syntax element of the current transform coefficient using a corresponding i th bin of a value for the syntax element of a previously coded transform coefficient. To use the corresponding i th bin of the value for the syntax element of the previously coded transform coefficient, the processor(s) are configured to use only the i th bin, and no other bins, of the value for the syntax element of the previously coded transform coefficient. 'i' represents a non-negative integer.
Abstract:
Video streaming concepts are presented. According to a first aspect, the video stream is formed as a multi-layered data stream with forming a set of one or more layers of the multi-layered data stream by copying from the coded version of the video content, while a composition of the at least one video is synthesized in at least a portion of pictures of a predetermined layer of the multi-layer data stream by means of inter-layer prediction. According to a second aspect, inter-layer prediction is used to either substitute otherwise missing referenced pictures of a newly encompassed video by inserting replacement pictures, or portions of the newly encompassed video referencing, by motion-compensated prediction, pictures which are missing are replaced by inter-layer prediction. According to a third aspect, output pictures inserted into the composed video stream so as to synthesize the composition of the video content by copying from a no-output portion of the composed data stream by temporal prediction, are inserted into the composed data stream so that output pictures are arranged in the data stream in the presentation time order rather than the coded picture order.