摘要:
In one embodiment, a method includes receiving a prediction unit (PU) for a coding unit (CU) of the video content. The method analyzes the prediction unit to determine a size of prediction unit. A size of a transform unit is determined based on the size of the prediction unit based on a set of rules. The set of rules specify the size of the transform unit is linked to the size of prediction unit and not a size of the coding unit. The method then outputs the size of the transform unit for use in a transform operation.
摘要:
A method and apparatus for selecting a coding mode for a block of a current picture are disclosed. For example, the method selects a coding mode in accordance with a cost function, for coding the block, wherein the cost function comprises a coding distortion parameter and a number of coding bits parameter, wherein said coding distortion parameter is measured in accordance with at least one of: between a prediction residual and a reconstructed prediction residual, or between a transformed prediction residual and a dequantized transformed prediction residual, wherein the number of coding bits parameter is computed in accordance with at least one of: from a true number of compressed bits resulting from coding said block, directly from a plurality of bins, or directly from a plurality of quantized transform coefficients.
摘要:
In one embodiment, a method for encoding or decoding video content is provided. The method includes determining a set of interpolation filters for use in interpolating sub-pel pixel values and a mapping between interpolation filters in the set of interpolation filters and different sizes of prediction units (PUs) of video content. A PU of video content is received and a size of the received PU is determined The method determines an interpolation filter in the set of interpolation filters based on a mapping between the interpolation filter and the size of the received PU to interpolate a sub-pel pixel value for use in a temporal prediction process for the PU.
摘要:
In one embodiment, a method receives a unit of video content. The unit of video content is coded in a bi-prediction mode. A motion vector predictor candidate set is determined for a first motion vector for the unit. The method then determines a first motion vector predictor from the motion vector prediction candidate set for the first motion vector and calculates a second motion vector predictor for a second motion vector for the unit of video content. The second motion vector predictor is calculated based on the first motion vector or the first motion vector predictor.
摘要:
A method for processing a block of transform coefficients during inter coding includes receiving, during inter coding, an N×M block of transform coefficients, wherein N is a row width of the block and M is a column height of the block. The method further includes partitioning the N×M block into a plurality of sub-blocks each comprising a plurality of the transform coefficients; and processing the plurality of sub-blocks, one at a time, in a coding order along a first diagonal scan coding pattern to generate a bit sequence corresponding to the N×M block. The processing comprises, for the sub-blocks containing at least one non-zero transform coefficient, coding at least the non-zero transform coefficients in a transform coefficient sequence along a second diagonal scan coding pattern.
摘要:
In various embodiments, a significance map of a matrix of video data coefficients is encoded or decoded using context-based adaptive binary arithmetic coding (CABAC). The significance map scanned line-by-line along a scanning pattern. Each line may be a vertical, horizontal, or diagonal section of the scanning pattern. Context models for each element processed in a particular line are chosen based on values of neighboring elements that are not in the line. Avoiding reliance on neighbors that are in the same line facilitates parallel processing.
摘要:
There is a coding. The coding may include preparing video compression data based on source pictures. The preparing may include partitioning the source pictures into coding units and/or generating a transform unit having a transform array. The preparing may also include processing the generated transform unit. The processing may include generating a significance map, having a significance map array with y-x locations corresponding to the y-x locations of the transform array. The processing may also include determining, utilizing a scanning pattern, a context model for coding a significance map element of the plurality of significance map elements based on a value associated with at least one coded neighbor significance map element of the plurality of significance map elements in the significance map array. There is also a decoding including processing video compression data which is generated in the coding.
摘要:
A method and system of encoding and decoding digital video content. The digital video content comprises a stream of pictures which can each be intra, predicted, or bi-predicted pictures. Each of the pictures comprises macroblocks that can be further divided into smaller blocks. The method entails encoding and decoding each picture in said stream of pictures in either frame mode or in field mode.
摘要:
In various embodiments, a significance map of a matrix of video data coefficients is encoded or decoded using context-based adaptive binary arithmetic coding (CABAC). The significance map scanned line-by-line along a scanning pattern. Each line may be a vertical, horizontal, or diagonal section of the scanning pattern. Context models for each element processed in a particular line are chosen based on values of neighboring elements that are not in the line. Avoiding reliance on neighbors that are in the same line facilitates parallel processing.
摘要:
A method and system of encoding and decoding digital video content. The digital video content comprises a stream of pictures which can each be intra, predicted, or bi-predicted pictures. Each of the pictures comprises macroblocks that can be further divided into smaller blocks. The method entails encoding and decoding each picture in said stream of pictures in either frame mode or in field mode.