摘要:
A method for improving the performance of the BLSkip mode in SVC includes the steps of upsampling the motion field of the base layer, interpolating the motion vectors for the intra MBs, interpolating the 8×8 block motion field to a 4×4 block motion field, and generating a MV predictor for a 4×4 block in BLSkip mode using neighbor candidates.
摘要:
A method for improving the performance of the BLSkip mode in SVC includes the steps of upsampling the motion field of the base layer, interpolating the motion vectors for the intra MBs, interpolating the 8×8 block motion field to a 4×4 block motion field, and generating a MV predictor for a 4×4 block in BLSkip mode using neighbor candidates.
摘要:
Scalable video coding based on H.264/AVC uses motion compensation, which results in motion vectors and residual pictures. To optimize the encoding of the residual pictures of the enhancement-layer, they can be predicted from the base-layer. This requires complex encoders and decoders. Simplified residual prediction techniques are provided that focus on reducing the encoding and decoding complexity of the spatial scalable enhancement-layer with optimized coding efficiency. The method for encoding video data containing high-pass frames and low-pass frames comprises the steps of encoding the low-pass frames, wherein residual prediction may be used, splitting the high-pass frames into two interleaving frame groups, encoding the frames of the first of said frame groups, wherein residual prediction may be used, and encoding the frames of the second of the frame groups using an encoding method without residual prediction.
摘要:
The H.264 standard has introduced a new coding method for intra-coded pictures, which is named “intra prediction”. The basic idea of intra prediction is to predict a block using reconstructed neighbour pixels. Thus, only the residual information needs to be encoded. Scalable video coding adds some scalable extension to H.264, in terms of spatial, temporal and SNR scalability. An improved encoding method for scalable video comprises generating from a video picture a first and a second picture with higher resolution than the first picture, wherein each macroblock in the first picture has a plurality of corresponding macroblocks in the second picture, intra-coding the first picture on macroblock level, wherein for each macroblock a prediction direction is determined, generating a first residual picture, intra-coding the second picture on macroblock level, wherein for each macroblock the prediction direction of the corresponding macroblock of the first picture is reused, and generating a second residual picture.
摘要:
Scalable video coding based on H.264/AVC uses motion compensation, which results in motion vectors and residual pictures. To optimize the encoding of the residual pictures of the enhancement-layer, they can be predicted from the base-layer. This requires complex encoders and decoders. Simplified residual prediction techniques are provided that focus on reducing the encoding and decoding complexity of the spatial scalable enhancement-layer with optimized coding efficiency. The method for encoding video data containing high-pass frames and low-pass frames comprises the steps of encoding the low-pass frames, wherein residual prediction may be used, splitting the high-pass frames into two interleaving frame groups, encoding the frames of the first of said frame groups, wherein residual prediction may be used, and encoding the frames of the second of the frame groups using an encoding method without residual prediction.
摘要:
A significant data rate reduction effect in video coding is acchieved by quantizing the transformed frequency coefficients or components of a pixel block so that thereafter fewer amplitude levels need to be encoded and part of the quantised amplitude values becomes zero and need not be encoded as quantised amplitude values. Many transform based video coding standards use a default quantization matrix to achieve better subjective video coding/de-coding quality. A quantization matrix assigns smaller scaling values to some frequency components of the block if the related horizontal and/or vertical frequencies are believed to be the less important frequency components with respect to the resulting subjective picture quality. The inventive quantization matrix generation starts from default quantization matrices and derives therefrom a perceptually optimum quantization matrix for a given picture sequence. In a first pass the candidate quantization matrix for a given picture sequence is iteratively constructed by simultaneously increasing scaling values for some coefficient positions and decreasing scaling values for other ones of the coefficient positions. In a second pass the generated quantization matrix is applied for re-encoding the picture sequence.
摘要:
The H.264 standard has introduced a new coding method for intra-coded pictures, which is named “intra prediction”. The basic idea of intra prediction is to predict a block using reconstructed neighbour pixels. Thus, only the residual information needs to be encoded. Scalable video coding adds some scalable extension to H.264, in terms of spatial, temporal and SNR scalability. An improved encoding method for scalable video comprises generating from a video picture a first and a second picture with higher resolution than the first picture, wherein each macroblock in the first picture has a plurality of corresponding macroblocks in the second picture, intra-coding the first picture on macroblock level, wherein for each macroblock a prediction direction is determined, generating a first residual picture, intra-coding the second picture on macroblock level, wherein for each macroblock the prediction direction of the corresponding macroblock of the first picture is reused, and generating a second residual picture.
摘要:
A tone mapping graphical user interface (GUI) is provided that allows a video engineer to process a video using a set of tools for changing high dynamic range data into lower dynamic range data. The tone mapping GUI includes a video player region that includes the current video output section for a region-based method of performing HDR conversion, and a current video output section for a reference method that performs HDR conversion using, for example, a global process.
摘要:
A method of tone mapping high dynamic range images for display on low dynamic range displays wherein a high dynamic range image is first accessed. The high dynamic range image is segmented into different regions such that each region is represented by a matrix, where each element of the matrix is a weight or probability of a pixel. An exposure of each region is determined or calculated and the exposure values are applied to the regions responsive to the weight or probability. The different regions are then fused together to obtain a final tone mapped image.
摘要:
A depth acquisition system utilizes at least three sensors with at least one sensor in a non-colinear configuration to increase depth information. This configuration provides both vertical and horizontal depth information to be combined to enhance image quality, especially in three-dimensional image gathering. Vertical sensor pairs aid in determining disparities for horizontal edges and make depth estimations for horizontal edges more accurate.