摘要:
A plurality of programs are statistically multiplexed using a plurality of two-pass encoders. Pictures in the plurality of programs are encoded in a first encoding pass using a plurality of first encoding modules. RW—passOne, which is a total number of bits used to encode all the pictures in a sequence in the plurality of programs in the first encoding pass, is determined. RW—passTwo, which is a total target number of bits for all pictures in a same sequence in the plurality of programs in a second encoding pass, is also determined. Rtwo,Ff,picType(i,l), which is a target bit allocation for each picture in each program l of the plurality of programs, is determined based on RW—passOne, RW—passTwo, and Rone,Ff,picType(i,l). Rone,Ff,picType(i,l) is a number of bits consumed for a corresponding picture in the first encoding pass. The plurality of programs are encoded using a plurality of second encoding modules and Rtwo,Ff,picType(i,l) in the second encoding pass to form a plurality of variable bit rate (VBR) compressed bit streams. Thereafter, the plurality of VBR compressed bit streams are multiplexed to form a single constant bit rate (CBR) bit stream.
摘要:
A video coding system includes a prediction module configured to perform multi-reference picture sub-pixel motion estimation. Reference pictures are selected for the sub-pixel motion estimation based on full-pixel motion estimation. A cost function is used to select the reference pictures from one or more groups of reference pictures for a block in a macroblock. The cost is based on full-pixel motion estimation errors and a predicted motion vector for the block. The selected reference pictures are used for sub-pixel motion estimation to get the final motion vector for each block.
摘要:
In one embodiment, a method includes receives a prediction unit (PU) for a coding unit (CU) of video content. The PU is partitionable into a plurality of PU partition types. The method determines a PU partition type for the PU and a residual tree structure based on the PU partition type for partitioning of the CU into transform units (TUs). The residual tree includes a binary partition of a node into two. A TU partition for the PU partition type is determined based on the residual tree structure and a desired level of partitioning in the residual tree structure. The method then uses the TU partition in a transform operation.
摘要:
Devices and methods that allow for applying a wavefront scan to rectangular transform blocks are described herein. Such devices and methods may allow greater efficiencies for entropy coding by enabling parallel processing of transform coefficients. In some embodiments, a method for coding a digital video sequence having a plurality of pictures includes dividing at least one of the plurality of pictures into blocks, performing a rectangular transform on at least one of said blocks to produce one or more transform coefficients, performing quantization on the one or more transform coefficients, and encoding the one or more transform coefficients, one at a time, along a coding scan order, to generate a compressed bitstream. The coding scan order may include a forward wavefront scan order or a reverse wavefront scan order and the quantization may result in producing quantized transform coefficients.
摘要:
A method for processing a block of transform coefficients during inter coding includes receiving, during inter coding, an N×M block of transform coefficients, wherein N is a row width of the block and M is a column height of the block. The method further includes partitioning the N×M block into a plurality of sub-blocks each comprising a plurality of the transform coefficients; and processing the plurality of sub-blocks, one at a time, in a coding order along a first diagonal scan coding pattern to generate a bit sequence corresponding to the N×M block. The processing comprises, for the sub-blocks containing at least one non-zero transform coefficient, coding at least the non-zero transform coefficients in a transform coefficient sequence along a second diagonal scan coding pattern.
摘要:
In one embodiment, a method receives a current picture of video content. The method then determines a set of reference pictures for the current picture and a temporal distance from the current picture for each of the set of reference pictures. A combined list of reference pictures in the set of reference pictures is determined where an order of pictures in the combined list is based on the temporal distance for each of the set of reference pictures to the current picture. The method then uses the combined list to perform temporal prediction for the current picture.
摘要:
A plurality of programs are statistically multiplexed using a statistical multiplexing (stat mux) system. Rchannel, a constant total bit rate for the plurality of programs, is determined. Cl,Ff,picTYpe, a complexity for each picture in the plurality of programs, is determined. Tl,Ff,picType, a bit allocation for each picture in the plurality of programs, is determined based on Cl,Ff,picTYpe and Rchannel. The plurality of programs are encoded using a plurality of encoders, a combined encoder buffer, and Tl,Ff,picType to form a plurality of variable bit rate (VBR) compressed bit streams. The plurality of VBR compressed bit streams are multiplexed to form a single constant bit rate (CBR) bit stream.
摘要:
Embodiments for high efficiency low complexity interpolation filters for High Efficiency Video Coding are disclosed herein, specifically novel techniques for a video compression system. In order to estimate and compensate sub-pel displacements, the image signal on these sub-pel positions is generated by an interpolation process. In HEVC, sub-pel pixel interpolation is performed using filters. Generally, the filter may have 8 taps to determine the sub-pel pixel values for sub-pel pixel positions, such as half-pel and quarter-pel positions. The taps of an interpolation filter weight the integer pixels with coefficient values to generate the sub-pel signals. Different coefficients may produce different compression performance in signal distortion and noise.
摘要:
In various embodiments, a significance map of a matrix of video data coefficients is encoded or decoded using context-based adaptive binary arithmetic coding (CABAC). The significance map scanned line-by-line along a scanning pattern. Each line may be a vertical, horizontal, or diagonal section of the scanning pattern. Context models for each element processed in a particular line are chosen based on values of neighboring elements that are not in the line. Avoiding reliance on neighbors that are in the same line facilitates parallel processing.
摘要:
There is a coding. The coding may include preparing video compression data based on source pictures. The preparing may include partitioning the source pictures into coding units and/or generating a transform unit having a transform array. The preparing may also include processing the generated transform unit. The processing may include generating a significance map, having a significance map array with y-x locations corresponding to the y-x locations of the transform array. The processing may also include determining, utilizing a scanning pattern, a context model for coding a significance map element of the plurality of significance map elements based on a value associated with at least one coded neighbor significance map element of the plurality of significance map elements in the significance map array. There is also a decoding including processing video compression data which is generated in the coding.