摘要:
A picture prediction method and a related apparatus are disclosed. A picture prediction method includes: determining K1 pixel samples in a picture block x, and determining a candidate motion information unit set corresponding to each pixel sample in the K1 pixel samples, where the candidate motion information unit set corresponding to each pixel sample includes at least one candidate motion information unit; determining a merged motion information unit set i including K1 motion information units, where each motion information unit in the merged motion information unit set i is selected from at least a part of motion information units in candidate motion information unit sets corresponding to different pixel samples in the K1 pixel samples; and predicting a pixel value of the picture block x by using a non-translational motion model and the merged motion information unit set i. This application helps reduce computational complexity of picture prediction performed based on the non-translational motion model.
摘要:
A method and apparatus for deriving a sub-block motion vector for the current sub-block based on a motion-model function depending on the current sub-block location are disclosed. The derived sub-block motion vector is then used for encoding or decoding the sub-block. The motion-model function may correspond to an affine motion-model function or a bilinear motion-model function. In one embodiment, a new Merge mode can be used to apply prediction of a current block by applying prediction on the sub-block basis using the sub-block motion vector derived from the motion-model function. In another embodiment, an additional inter prediction mode can be used to apply prediction of a current block by applying prediction on the sub-block basis using the sub-block motion vector derived from the motion-model function.
摘要:
The invention relates to a method for generating a motion field between a current frame and a reference frame belonging to a video sequence from an input set of motion fields. An motion field is associated to an ordered pair of frames comprises for a group of pixels belonging to a first frame of the ordered pair of frames, a motion vector computed from a location of the pixel in the first frame to an endpoint in a second frame of the ordered pair of frames. The method comprises a step for determining a plurality of motion paths from a current frame to a reference frame wherein a motion path comprises a sequence of N ordered pairs of frames associated to the input set of motion fields and wherein a first frame of an ordered pair corresponds to a second frame of the previous ordered pair in the sequence; the first image of the first ordered pair is the current frame; the second frame of the last ordered pair is the reference frame; and N is an integer. The method then comprises a step for determining, for the group of pixels belonging to the current frame, a plurality of candidate motion vectors from the current frame to the reference frame wherein a candidate motion vector is the result of a sum of motion vectors; each motion vector belonging to a motion field associated to an ordered pair of frames according to a determined motion path. And the method then comprises a step for selecting, for the group of pixels belonging to the current frame, a candidate motion vector among the plurality of candidate motion vectors.
摘要:
The present invention relates to, for example, an encoding apparatus and an encoding method, a decoding apparatus and a decoding method, a recording medium, and a program suitable for encoding image signals with a higher compression ratio for transmission or accumulation. In an arithmetic coding section (5 8), from among the syntax elements of input image compression information, the frame/field flag is first encoded by a frame/field flag context model (91). When the macroblock to be processed is subjected to frame-based encoding, a frame-based context model (92), specified in the current H.26L standard, is applied. On the other hand, when the macroblock to be processed is subjected to field-based encoding, a field-based context model (94) is applied for the syntax elements described below. The present invention is applied to an encoder for encoding image information and a decoder for decoding image information.
摘要:
The present invention relates to, for example, an encoding apparatus and an encoding method, a decoding apparatus and a decoding method, a recording medium, and a program suitable for encoding image signals with a higher compression ratio for transmission or accumulation. In an arithmetic coding section (58), from among the syntax elements of input image compression information, the frame/field flag is first encoded by a frame/field flag context model (91). When the macroblock to be processed is subjected to frame-based encoding, a frame-based context model (92), specified in the current H.26L standard, is applied. On the other hand, when the macroblock to be processed is subjected to field-based encoding, a field-based context model (94) is applied for the syntax elements described below. The present invention is applied to an encoder for encoding image information and a decoder for decoding image information.
摘要:
The invention refers to a method for coding a sequence of digital images (S), each image having the same image format and comprising a number of pixels with assigned pixel values, wherein motion parameters (MP) between first and second images (I1, 12) are determined, where based on said motion parameters (MP) a motion compensation (MC) is performed for coding the sequence of images (S), where said motion parameters (MP) are included in the coded sequence of images (CS). The motion parameters (MP) between a first image (I1) and a second image (12) comprise a scalar field ( p̂ ) having scalar values for a plurality of image positions in the image format, wherein the scalar field ( p̂ ) is determined such that gradient vectors (MV) derived from the scalar field ( p̂ ) correspond to motion vectors for the motion compensation (MC).
摘要:
The present invention enables an efficient encoding of video data in which a prediction signal can be determined without using a motion vector. A video encoding device 100 comprises a region division section 101 for dividing a frame image constituting video data into a plurality of regions as encoding target regions, an encoding section 104 for encoding an image of each region, an inverse transformation section 105 and an addition section 106 for generating reproduced image of the encoded image, a storage section 107 for storing reproduced images, a prediction generation section 108 for searching a region which is highly correlated to a reproduced image of a template region, which is adjacent to the region of the encoding target image in a predetermined positional relationship and is a part of the reproduced image, from the reproduced image, and determining a prediction signal based on the searched region and the above-mentioned positional relationship, and a subtraction section 102 for generating a difference signal between the prediction signal and the encoding target image as a signal for encoding.