摘要:
A method for deriving motion vector is provided, this method includes: obtaining a space domain motion vector prediction and a time domain motion vector prediction of adjacent blocks of a coding unit in a predetermined direction; performing a filtering operation on the space domain motion vector and the time domain motion vector prediction to obtain the space domain motion vector prediction and the time domain motion vector prediction of the filtered adjacent blocks; determining, according to a predetermined inter-frame prediction mode, reference motion vectors of a current block in four side directions by using the space domain motion vector prediction and the time domain motion vector prediction of the filtered adjacent blocks and a coordinate position of the current block in the coding unit; and deriving motion vectors of the current block according to the reference motion vectors and the coordinate position of the current block in the coding unit.
摘要:
The present specification discloses a method, apparatus, and device for video frame interpolation. The method of embodiment of the present specification comprises: acquiring a video frame training sample, wherein the video frame training sample includes an even number of consecutive video frames and a first key frame, and the first key frame is an intermediate frame of the even number of consecutive video frames; constructing a pyramid deep learning model, wherein each level of the pyramid deep learning model being used to generate intermediate frames of different resolutions has a plurality of convolutional neural network layers; inputting the even number of consecutive video frames to the pyramid deep learning model to generate a second key frame; modifying the pyramid deep learning model according to the second key frame and the first key frame to generate a modified pyramid deep learning model; inputting a plurality of video frames to be processed into the modified pyramid deep learning model to generate an intermediate frame of the plurality of video frames. The invention fully exploits the spatio-temporal domain information between multi-frame video frames, and adopts a pyramid refinement strategy to effectively estimate the motion information and the occlusion region, thereby greatly improving the quality of the intermediate frame.
摘要:
Disclosed are a panoramic video asymmetrical mapping method and a corresponding inverse mapping method: by means of the mapping methods, mapping a spherical surface corresponding to a panoramic image or video A onto a two-dimensional image or video B; projecting the spherical surface onto an isosceles quadrangular pyramid with a square bottom plane, and further projecting the isosceles quadrangular pyramid onto a planar surface; using isometric projection on a main viewpoint region in the projection and using a relatively high sampling density to ensure that the video quality of the region of the main viewpoint is high, while using a relatively low sample density for non-main viewpoint regions so as to reduce bit rate. While ensuring that the video quality of the region of the main viewpoint remains unchanged, the present panoramic video asymmetrical mapping technique greatly reduces the resolution of the remaining regions in the video, effectively reducing the bit rate required for encoding a virtual reality video. The panoramic video asymmetrical inverse mapping technique provides a method for mapping from a planar surface to a spherical surface, and by means of said method, a planar surface video may be mapped back to a spherical surface for rendering and viewing.
摘要:
Disclosed are a describing method and a coding method of panoramic video ROIs based on multiple layers of spherical circumferences. The describing method comprises: first setting a center of the panoramic video ROIs; then setting the number of layers of ROIs as N; obtaining the size Rn of the current layer ROI based on a radius or angle; obtaining the sizes of all of the N layers of ROIs, and writing information such as the center of the ROIs, the number of layers, and the size of each layer into a sequence header of a code stream. The coding method comprises adjusting or filtering an initial QP based on a QP adjusted value and then coding an image. By flexibly assigning code rates to multiple layers of panoramic video ROIs, while guaranteeing a relatively high image quality of ROIs, the code rate needed for coding and transmission is greatly reduced.
摘要:
A coding method based on multi-hypothesis motion compensation for a P-frame, including: a) using neighboring coded image blocks as reference image blocks, adopting a motion vector of each reference image block as a first motion vector which points to a first prediction block; b) adopting the first prediction block corresponding to each reference image block as a reference value, and performing joint motion estimation on the current image block to acquire a second motion vector which points to a second prediction block; c) weighted averaging the first prediction block and the second prediction corresponding to each reference image block to acquire a third prediction block of the current image block, respectively; and d) calculating a coding cost corresponding to each reference image block to determine a final first motion vector, a final second motion vector, and a final prediction block of the current image block.
摘要:
A method and a device for encoding or decoding based on an inter-frame prediction. The method includes steps of: determining a temporal motion vector prediction value of a to-be-processed coding unit, where the temporal motion vector prediction value is a temporal motion vector prediction value of a sub-block, a temporal motion vector of which is obtainable through prediction, in sub-blocks adjacent to the to-be-processed coding unit and/or sub-blocks in the to-be-processed coding unit; determining a motion vector residual prediction value of the to-be-processed coding unit according to the temporal motion vector prediction value; determining a motion vector of a sub-block in the to-be-processed coding unit according to the temporal motion vector prediction value and the motion vector residual prediction value and performing a motion compensation according to the motion vector of the sub-block in the to-be-processed coding unit to determine a prediction block of the to-be-processed coding unit.
摘要:
The application discloses a method, system, device and computer-readable storage medium for inverse quantization, wherein, in some embodiments, determining an initial weighted inverse quantization matrix, wherein, the initial weighted inverse quantization matrix is the same as the quantized block in size; setting some matrix elements in the initial weighted inverse quantization matrix to zero to obtain a weighted inverse quantization matrix, wherein, determining the matrix elements that need to be zeroed according to the size of the quantized block; weighted inverse quantizing the quantized coefficients in the quantized block to generate corresponding inverse transform coefficients, wherein, the value of the matrix element corresponding to the position of the quantized coefficient in the weighted inverse quantization matrix is used as a weight coefficient of the weighted inverse quantization.
摘要:
An intra-frame and inter-frame combined prediction method for P frames or B frames. The method comprises: self-adaptively selecting by means of a rate-distortion optimization (RDO) decision whether to use the intra-frame and inter-frame combined prediction or not; using a method for weighting an intra prediction block and an inter prediction block in the intra-frame and inter-frame combined prediction to obtain a final prediction block; and obtaining the weighting coefficient of the intra prediction block and the inter prediction block according to prediction distortion statistics of the prediction method. Therefore, prediction precision can be improved, and coding and decoding efficiency of the prediction blocks are improved. The advantages of intra prediction and inter prediction are fully utilized in the present invention; and the optimal prediction parts of the two methods are selected to be combined, so that to a certain extent, areas with excessive distortion can be removed out of the intra prediction block and the inter prediction block, thus obtaining a better prediction effect and achieving excellent practicality and robustness.
摘要:
Disclosed is a virtual viewpoint synthesis method based on image local segmentation, which relates to the digital image processing technology. By mapping the input left and right images to the virtual viewpoints so as to be fused to obtain a synthesized image, smoothing and denoising the rough and noisy depth maps based on the object segmentation information of the scene, the method as disclosed solves the occlusion issue through local area segmentation during the process of viewpoint synthesis, which may guarantee that the subjective quality of viewpoint synthesis will not be significantly deteriorated when the depth map has a relatively large flaw, and maintain geometric information of the scene to the utmost extent so as to generate a real immersive sense, thereby ameliorating the drawback of significant deterioration of synthesis quality in conventional methods when the depth information of the scene has errors and noises, and offering a relatively strong robustness to the errors in the depth map information of the scene. The disclosed method may be applied to a video surveillance system and image processing software, etc.
摘要:
The present disclosure provides an encoding method, a decoding method, an encoder, and a decoder, the encoding method comprises: performing interframe prediction to each interframe coded block to obtain corresponding interframe predicted blocks; writing information of each of the interframe predicted blocks into a code stream; if the interframe coded block exists at an adjacent position to the right or beneath or to the lower right of the intraframe coded block, performing intraframe prediction to the intraframe coded block based on at least one reconstructed coded blocks at adjacent positions to the left and/or above and/or to the upper left of the intraframe coded block and at least one of the interframe coded blocks at adjacent positions to the right and/or beneath and/or to the lower right of the intraframe coded block to obtain intraframe predicted blocks; writing information of each of the intraframe predicted blocks into the code stream.