摘要:
In a video distribution system, a divider to segment an input video stream into partitions for each of a plurality of channels of the video stream is provided. A channel analyzer is coupled to the divider wherein the channel analyzer decomposes the partitions. An encoder is coupled to the channel analyzer to encode the decomposed partitions into an encoded bitstream wherein the encoder receives coding information from at least one of the plurality of channels to be used in encoding the decomposed partitions into the encoded bitstream. A decoder receives the coded bitstream to decode the received bitstream and to reconstruct the input video stream. The decoder uses the coding information to decode the bitstream.
摘要:
In various embodiments, a significance map of a matrix of video data coefficients is encoded or decoded using context-based adaptive binary arithmetic coding (CABAC). The significance map scanned line-by-line along a scanning pattern. Each line may be a vertical, horizontal, or diagonal section of the scanning pattern. Context models for each element processed in a particular line are chosen based on values of neighboring elements that are not in the line. Avoiding reliance on neighbors that are in the same line facilitates parallel processing.
摘要:
Embodiments of the invention generally provide a method and apparatus for complexity-scalable video coding. One embodiment of a method for video coding includes receiving a sequence of one or more video frames, obtaining a budget for the one or more video frames, the budget specifying a maximum number of computations that may be used in performing motion estimation for the one or more video frames, allocating the maximum number of computations among individual ones of the one or more video frames, performing motion estimation in accordance with the allocating, and outputting a motion estimate for the sequence.
摘要:
A method and system of encoding and decoding digital video content. The digital video content comprises a stream of pictures which can each be intra, predicted, or bi-predicted pictures. Each of the pictures comprises macroblocks that can be further divided into smaller blocks. The method entails encoding and decoding each picture in said stream of pictures in either frame mode or in field mode.
摘要:
A method for processing a block of transform coefficients during intra coding includes receiving an N×M block of transform coefficients, wherein N is a row width of the block and M is a column height of the block. A first scan coding pattern is determined from a set of scan coding patterns comprising a diagonal scan coding pattern, a horizontal scan coding pattern, and a vertical scan coding pattern. The method further includes partitioning the N×M block into a plurality of sub-blocks each comprising a plurality of the transform coefficients; and processing the plurality of sub-blocks, one at a time, in a coding order along the first scan coding pattern to generate a bit sequence. The processing further comprises, for the sub-blocks containing at least one non-zero transform coefficient, coding at least the non-zero transform coefficients in a transform coefficient sequence along a second scan coding pattern.
摘要:
In one embodiment, a method for encoding or decoding video content is provided. The method includes determining a set of interpolation filters for use in interpolating sub-pel pixel values and a mapping between interpolation filters in the set of interpolation filters and different sizes of prediction units (PUs) of video content. A PU of video content is received and a size of the received PU is determined The method determines an interpolation filter in the set of interpolation filters based on a mapping between the interpolation filter and the size of the received PU to interpolate a sub-pel pixel value for use in a temporal prediction process for the PU.
摘要:
A method and system for interpolating video pixels is described, in which the values of a first quarter pixel, a half pixel and a second quarter pixel are calculated based on certain interpolation filter coefficients.
摘要:
A process performs a search on a base reference frame in a video signal to determine a first motion vector for each of the possible partitions of a current macroblock. The first motion vector for a partition provides the most optimal match for the partition in the base reference frame. An optimal partition for the current macroblock, which gives a minimum motion estimation cost among all the possible partitions in the base reference frame, is determined. The process performs the search on the optimal partition over a non-base reference frame in the video signal to determine a second motion vector that provides the most optimal match for the corresponding partition in the non-base reference frame. A simplified search is performed on each partition other than the optimal partition over the non-base reference frame to determine the second motion vector for the corresponding partition.
摘要:
In a video distribution system, a divider (105) to segment an input video stream (302) into partitions for each of a plurality of channels of the video stream is provided. A channel analyzer (306) is coupled to the divider wherein the channel analyzer decomposes the partitions. An encoder (106) is coupled to the channel analyzer to encode the decomposed partitions into an encoded bitstream (208, 210) wherein the encoder receives coding information from at least one of the plurality of channels to be used in encoding the decomposed partitions into the encoded bitstream. A decoder (124) receives the coded bitstream to decode the received bitstream and to reconstruct the input video stream. The decoder uses the coding information to decode the bitstream.
摘要:
There is a coding. The coding may include preparing video compression data based on source pictures utilizing a processor. The preparing may include processing a generated transform unit, including generating a significance map having a significance map array with y-x locations corresponding to the transform array. The generating may include scanning, utilizing a zigzag scanning pattern, a plurality of significance map elements in the significance map array. The generating may also include determining, utilizing the zigzag scanning pattern, a context model for coding a significance map element of the plurality of significance map elements based on a value associated with at least one coded neighbor significance map element of the significance map element in the significance map array. There is also a decoding including processing video compression data which is generated in the coding.