摘要:
A device for reducing the number of motion estimation operations in performing motion compensated prediction includes a motion estimator, a motion compensated prediction device and a processing element. The motion estimator is configured to extract a motion vector from a macroblock of a video frame. The macroblock includes inter modes which are block sizes. The motion compensated prediction device is configured to generate a prediction macroblock based on the motion vector by analyzing a corresponding macroblock in a reference frame. The processing element communicates with the motion estimator and the motion compensated prediction device. The processing element also compares a distortion value to a first predetermined threshold and selects a first encoding mode among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold.
摘要:
A method and apparatus are provided for encoding and decoding image information. The encoding comprises receiving a block of pixels; creating a set of motion vector prediction candidates for the block of pixels; and examining the set to determine if a motion vector prediction candidate is a temporal motion vector prediction, or a spatial motion vector prediction. If the motion vector prediction candidate is a temporal motion vector prediction, the motion vector prediction candidate is kept in the set. If the motion vector prediction candidate is a spatial motion vector prediction, it is examined whether the set comprises a motion vector prediction candidate corresponding with the spatial motion vector prediction; and if so, the motion vector prediction candidate is removed from the set. Once the set is created, one of the candidates from the set is selected to represent a motion vector prediction for the block of pixels.
摘要:
There is disclosed apparatuses, methods and computer programs for utilizing motion prediction in video coding. Coding type of a block of pixels of a video representation encoded in a bitstream is read, where values of said pixels having a first precision. If the type of the block is a block predicted by using two or more reference blocks, a first reference pixel location in a first reference block is determined and a second reference pixel location in a second reference block is determined. The first reference pixel location is used to obtain a first prediction having a second precision, which is higher than the first precision. The second reference pixel location is used to obtain a second prediction, also having the second precision. The first prediction and the second prediction are combined to obtain a combined prediction having the first precision.
摘要:
An apparatus for providing motion estimation for video encoding includes a selection element and a processing element. The selection element is configured to select a subset including less than all of candidate pixel locations from among a plurality of candidate pixel locations used for motion vector determination based on a relationship between a best candidate pixel location of a first level of accuracy and a best candidate pixel location of a second level of accuracy. The processing element is configured to process an input video sequence to determine a motion vector at the first level of accuracy, to refine the motion vector at the second level of accuracy, and to determine the motion vector at a third level of accuracy using only the subset of candidate pixel locations.
摘要:
A system and method for enabling parallel decoder implementation for different views, even when there are existing dependencies between views. In various embodiments of the present invention, information is signaled to a decoder that slices are coded using certain constraints, so that parallel decoding of slices is possible. This signaling can be performed at a sequence parameter set level, or it could be performed at picture parameter set level, slice header level or macroblock header level. Additionally, the delay between various views is also signaled to the decoder, at the same location as the constraint information in various embodiments. Various algorithms can also be used to improve the coding efficiency of the system.
摘要:
Transform coding is not restricted inside normal block boundary but is adjusted to the characteristics of the prediction error. Thereby it is possible to achieve a coding efficiency improvement by selecting and coding the best portion of the prediction error in terms of rate distortion tradeoff.
摘要:
A system and method for implementing an adaptive interpolation filter structure that achieves high coding efficiency with significantly less complexity than more conventional systems. In various embodiments, a set-of integer pixels are defined that are used in the interpolation process to obtain each sub-pixel sample at different locations. Samples at each sub-pixel positions are generated with independent pixel-aligned one-dimensional (1D) adaptive interpolation filters. The filter coefficients are be transmitted to a decoder or stored into a bit stream. At the decoder end, the received filtered coefficients may be used in an interpolation process to create a motion-compensated prediction.
摘要:
Systems and methods of signaling different filter structures for each pixel or sub-pixel position in motion compensation prediction video coding are provided. An encoder signals to a decoder one filter structure among a plurality of pre-defined candidates that is used for a respective pixel or sub-pixel position. In accordance with one embodiment, filter structures signaled to the decoder from the encoder “switch” between directional filter and radial filter structures during interpolation at the sub-pixel level. In accordance with another embodiment, filter structures that are signaled may switch between a directional filter structure and a separable filter structure at the sub-pixel level. Thus, not only can an encoder switch between different filter structures during interpolation, but a filter structure pair is provided that the encoder can utilize to interpolate a wide range of signals without increasing tap-length.
摘要:
A system and method is provided for improving efficiency when entropy coding a block of quantized transform coefficients in video coding. Quantized coefficients are coded in two separate coding modes, namely, a run mode to a level mode coding mode. “Rules” for switching between these two modes are provided, and various embodiments are realized by allowing an entropy coder to adaptively decide when to switch between the two coding modes based on context information, the rules and/or by explicitly signaling the position of switching (e.g., whether or not it should switch coding modes).
摘要:
In accordance with an embodiment of the present invention, a method comprising: representing coefficients of an adaptive image filter in binary words wherein the coefficients are restricted and the binary words have a maximum number of bits defined at least in part by a precision index; computing at least one of a convolution sum and multiple partial convolution sums of the adaptive image filter wherein the at least one of a convolution sum and multiple partial convolution sums having a constrained value range and having a binary representation with a fixed maximum number of bits; and generating the output of the adaptive image filter based at least in part on the at least one of a convolution sum and multiple partial convolution sums, is disclosed.