摘要:
Reduced complexity inverse discrete cosine transform (IDCT) masks and a method for reducing the number of IDCT calculations in video decoding are provided. The method comprises: accepting an n×m matrix of DCT coefficients; performing (n−y) horizontal IDCT operations, where y is greater than 0; performing y scaling operations; and, generating an n×m block of pixel information. Some aspects of the method further comprise: performing (m−z) vertical IDCT operations, where z is in the range between 0 and m/2. In some aspects, performing (n−y) horizontal ICDT operations includes performing IDCT operations for the first (n−y) horizontal rows. Then, performing y scaling operations includes: selecting the DC component from the first position of each horizontal row; scaling the selected DC component; and, copying the scaled DC component into the remaining positions of each of horizontal row.
摘要:
Adjacent blocks are identified in an image. Coding parameters for the adjacent blocks are identified. Deblock filtering between the identified adjacent blocks is skipped if the coding parameters for the identified adjacent blocks are similar and not skipped if the coding parameters for the identified adjacent blocks are substantially different.
摘要:
Embodiments of the present invention relate to systems and methods for signaling layer characteristics of multi-layer video sequences and for employing these signals in encoding, decoding and translation operations.
摘要:
Adjacent regions are identified in an image. Coding parameters for the adjacent regions are identified. Selective filtering is performed at the region between the identified adjacent regions.
摘要:
Embodiments of the present invention comprise systems and methods for picture up-sampling using direct interpolation. Some embodiments of the present invention provide an up-sampling procedure designed for the Scalable Video Coding extension of H.264/MPEG-4 AVC.
摘要:
A temporally scalable video coding method is provided to interleave pictures from all layers of a video sequence including video sub-sequences organized using enhancement layers following a set of rules: (1) pictures in each layer are to be coded sequentially within the layer; (2) a picture from an upper layer should be coded when its temporally closest neighboring pictures among all lower layers (in both forward and backward directions if available) have been already coded; in other words, coding of an upper-layer picture requires the temporally closest neighboring pictures among all lower layers (in both forward and backward directions if available) be coded before hand. To ensure a reasonable coding efficiency, for each picture, its qualified reference pictures may be reordered so that the reference pictures are ordered using their relative temporal distance from the current picture instead of the default picture coding order.
摘要:
A template is analyzed to determine step sizes for searching within a search area. The template is analyzed by first padding the template with data points to increase its size. Cross-correlation between the padded template and the original template leads to identification of an effective step size along multiple axes. Step sizes for each of a horizontal, vertical and a third axis are derived. Third axis step sizes may correspond to rotation, scaling factor, subsampling factor, linear distance, time or frequency. Windows of the search area, selected based on the step sizes, then are tested in a fast search by correlating the template to selected windows to derive correlation coefficients. Any tested window which has a correlation coefficient exceeding a given value is a potential match for the template and is subject to a refined stage of comparison.
摘要:
In certain embodiments, to eliminate DC leakage into surrounding AC values, scaling stage within a photo overlap transform operator is modified such that the off-diagonal elements of the associated scaling matrix have the values of 0. In certain embodiments, the on-diagonal scaling matrix are given the values (0.5, 2). In some embodiments, the scaling is performed using a combination of reversible modulo arithmetic and lifting steps. In yet other embodiments, amount of DC leakage is estimated at the encoder, and preprocessing occurs to mitigate amount of leakage, with the bitstream signaling that preprocessing has occurred. A decoder may then read the signal and use the information to mitigate DC leakage.
摘要:
Techniques and tools for encoding and decoding a block of frequency coefficients are presented. An encoder selects a scan order from multiple available scan orders and then applies the selected scan order to a two-dimensional matrix of transform coefficients, grouping non-zero values of the frequency coefficients together in a one-dimensional string. The encoder entropy encodes the one-dimensional string of coefficient values according to a multi-level nested set representation. In decoding, a decoder entropy decodes the one-dimensional string of coefficient values from the multi-level nested set representation. The decoder selects the scan order from among multiple available scan orders and then reorders the coefficients back into a two-dimensional matrix using the selected scan order.