Abstract:
A method and system for compressing and decompressing video image data in real time employs thresholding and facsimile-based encoding to eliminate the need for computationally intensive two-dimensional transform-based compression techniques. The method operates first by forming a difference frame which contains only information pertaining to the difference between a current video image frame and a computed approximation of the video image frame. The difference frame is fed to a thresholder which categorizes each pixel in the frame as being either in a first set having intensities above or at a preset threshold, or a second set having intensities below a preset threshold. A facsimile-based compression algorithm is then employed to encode the first set of above or at threshold pixel locations. To compress the intensity data for each above or at threshold pixel, a quantizer and lossless encoder are preferably employed, with the quantizer serving to categorize the intensities by groups, and the lossless encoder using conventional coding, such as Huffman coding, to compress the intensity data further. Various techniques may be employed with the embodiments of the invention to adjust the actual amount of compressed data generated by the method and system to accommodate communication lines with different data rate capabilities.
Abstract:
Two software-only prefix encoding techniques employ encoding look-up tables to produce contributions to the encoded bit stream that are incremented in integral numbers of bytes to facilitate accelerated encoding rates at the expense of an acceptable trade-off in increased memory size requirements. The first technique, referred to as offset-based encoding, employs encoding tables which eliminate most of the bit-based operations that need to be performed by a prefix encoder without inordinately expanding memory requirements. In offset-based encoding, a Huffman table is employed which contains information for each number of bits by which the length of a Huffman word is offset from an integral number of bytes. The encoding method generates bytes of encoded data, even though the Huffman code has variable length code words for each symbol to be encoded. The second technique, referred to as byte-based encoding, employs a byte-based Huffman encoding table which operates even faster than the offset-based encoding scheme because it does not employ any bit-based operations at all; however, this is achieved at the expense of a considerable expansion in memory requirements.
Abstract:
To let decoder side motion vector derivation (DMVD) coded blocks be decoded in parallel, decoder side motion estimation (ME) dependency on spatially neighboring reconstructed pixels can be removed. Mirror ME and projective ME are only performed on two reference pictures, and the spatially neighboring reconstructed pixels will not be considered in the measurement metric of the decoder side ME. Also, at a video decoder, motion estimation for a target block in a current picture can be performed by calculating a motion vector for a spatially neighboring DMVD block, using the calculated motion vector to predict motion vectors of neighboring blocks of the DMVD block, and decoding the DMVD block and the target block in parallel. In addition, determining a best motion vector for a target block in a current picture can be performed by searching only candidate motion vectors in a search window, wherein candidate motion vectors are derived from a small range motion search around motion vectors of neighboring blocks.
Abstract:
In a scalable video codec, an adaptive Wiener filter with offset aims to minimize the differences between two input pictures or picture regions, and the filter coefficients need to be transmitted to decoder site.
Abstract:
Techniques related to demosaicing for digital image processing are discussed. Such techniques include correcting defective pixels by detecting hot and warm pixels and correcting such detected hot and warm pixels based on neighboring pixels and angle compensation including detecting dominant angles and compensating for such detected angles during demosaicing.
Abstract:
Adaptive control can use hierarchical motion estimation (HME) and/or multiple reference motion estimation (MRME) for the motion estimation of current encoding blocks. Both HME and MRME are allowed in the motion estimation to achieve a high coding gain. Control consists of slice level control and macro-block (MB) level control. A slice is one or more contiguous macroblocks. In slice level control, it is decided to use only one reference frame or use multiple reference frames to coding current slice based on the motion vectors obtained in coarse level motion estimation. In MB level control, it is decided to perform MRME or perform HME for the MB and its subblocks based on the coarse level motion vectors of the MB.
Abstract:
Systems, apparatus, articles, and methods are described including operations to generate a weighted look-up-table based at least in part on individual pixel input values within an active block region and on a plurality of contrast compensation functions. A second level compensation may be performed for a center pixel block of the active region based at least in part on the weighted look-up-table.
Abstract:
Systems, devices and methods are described including performing scalable video coding using inter-layer residual prediction, inter-layer residual prediction in an enhancement layer coding unit, prediction unit, or transform unit may use residual data obtained from a base layer or from a lower enhancement layer. The residual may be subjected to upsample filtering and/or refinement filtering. The upsample or refinement filter coefficients may be predetermined or may be adoptively determined.
Abstract:
According to one embodiment, a method is disclosed. The method includes performing a local content analysis on video data to classify pixels into singular pixels, motion pixels and static pixels.
Abstract:
Systems, apparatus and methods are described including determining a prediction residual for a channel of video data; and determining, using the first channel's prediction residual, a prediction residual for a second channel of the video data. Further, a prediction residual for a third channel of the video data may be determined using the second channel's prediction residual.