Abstract:
A method and apparatus for receiving and processing digitized video data in a discrete cosine transform (DCT) domain exploits the orthogonality of a convolution function along with the data sparseness present in the DCT domain. In a preferred method, the convolution function is applied to vectors associated with a plurality of input video data frames in consideration of an orthogonal characteristic of the convolution function. More specifically, cross-products of the input vectors which would yield non-zero output vectors are identified and weighting factors for the cross-products are determined in consideration of this orthogonality. An output video data frame is generated from the non-zero output vectors and the weighting factors. A convolution operation based upon the disclosed method and aparatus is suitable for chroma-keying as well as other video or audio blending applications.
Abstract:
Multiplier-free implementation of an approximation of the DCT used in image and video processing. In accordance with the primary aspect of the present invention, image and video processing is done with no multiplications and a fewer number of operations through the application of a modified Arai, Agui, and Nakajima (AAN) scheme for eight-point DCT.
Abstract:
Image processing techniques which involve direct manipulation of the compressed domain representation of an image to achieve the desired spatial domain processing without having to go through a complete decompression and compression process. The techniques include processing approaches for performing the eight operations in D4 (the dihedral group of symmetries of a square) on JPEG images using the discrete cosine transform (DCT) domain representation of the images directly. For a task such as image rotation by 90° (an operation in D4), DCT-domain based methods can yield nearly a five-fold increase in speed over a spatial-domain based technique. These simple compressed-domain based processing techniques are well suited to the imaging tasks that are needed in a JPEG-based digital still-camera system.
Abstract:
A method of compressing color source image data includes forming a quantization table with a "supra-threshold" term. This method includes a step of selecting a set of target images, where each target image includes one or more image elements such as text. These image elements are then analyzed to identify those that are more important for visual quality. These "supra-threshold" terms are then selected that gives a larger weight to the quantization table elements that correspond to important image elements and a smaller weight to the table elements that correspond to less important image elements. This process selectively weights the characteristics of each DCT basis vectors. By giving larger weights to the table elements that correspond to the "up-downness" of the image, i.e., the vertical attributes of the image elements, and the "left-rightness" of the image, i.e., the horizontal attributes of the image elements, and smaller weights to the table elements corresponding to the "criss-crossedness" of the image, i.e., the diagonal attributes of the image elements, the visual quality of an image that includes text can be preserved while significantly increasing the compression ratio.
Abstract:
A method is described for filtering compressed images represented in the discrete-cosine-transform (DCT) domain. The filter includes three sparse, vertical submatrices which are sparse versions of the vertical filter components (VFCs) of a desired filter function that have been combined in such a way as to eliminate many of the non-zero elements. The filter also includes three sparse, horizontal transpose submatrices, which, like the vertical submatrices, are sparse versions of the horizontal filter components of the filter function. The sparseness of these sparse submatrices yields a significant reduction in the number of computations required to filter the image in the DCT domain. To take advantage of this discovery, the input DCT data blocks are "butterflied" to retain the relationship between the input data blocks and the filtered output data blocks as a function of these sparse submatrices. The sparseness of the vertical and horizontal submatrices reduces the number of computations required to filter the image. The sparseness of the DCT data blocks can also be used to further reduce the number of computations required.
Abstract:
In one example, a method includes identifying a first set of pixels in co-located pairs in a corresponding pair of multiview image frames for which the co-located pairs have a disparity between the pixels that is greater than a selected disparity threshold. The method further includes identifying a second set of pixels in at least one of the image frames that are within a selected distance of an intensity transition greater than a selected intensity transition threshold. The method further includes applying crosstalk correction to pixels that are identified as being in at least one of the first set and the second set.
Abstract:
To improve contrast ratio of the image on a backlit display plane such as a liquid crystal display (“LCD”), each area of the image that has separately controllable backlight may be given full backlight until an average or composite brightness of the image in that area is less than a threshold value at which light leakage through the image from full-strength backlight begins to be noticable by a viewer. For image areas with composite brightness less than that threshold, backlight brightness may be reduced in proportion to how much below the threshold the area's composite image brightness is. Backlight brightness may also be adjusted for other image aspects such as (1) the presence of bright pixels in an otherwise relatively dark area, (2) whether the area is adjacent to one or more other areas in which the image information is in motion, and/or (3) time-averaging of image information over several successive frames of such information.
Abstract:
A system including a quality estimation module configured to estimate a visual quality of video content based on data from a decoder module. The system further including a settings database configured to store a plurality of predetermined settings. The settings database outputs at least one of the predetermined settings in response to the visual quality. The system further including a video post-processor module configured to automatically adjust settings of the video post-processor module based on the at least one of the predetermined settings. The video content is processed based on the settings of the video post-processor module that were automatically adjusted.
Abstract:
The techniques described in this disclosure are directed to interpolating pixel values. In some examples, the techniques interpolate a pixel value for an interpolated center pixel based on pixel values of pixel that reside on diagonal lines that are orthogonal to one another. The techniques may determine first order derivative values and, in some examples, second order derivative values to determine which pixels to utilize to interpolate the pixel values for the interpolated center pixel. The techniques may similarly determine pixel values for non-center interpolated pixels using orthogonal vertical and horizontal lines.
Abstract:
The temporal and/or spatial characteristics of a macroblock are analyzed in order to reduce the number of modes for which motion estimation and rate distortion efficiency calculations are to be performed. In one embodiment, macroblock mean and variance characteristics are analyzed to merge sub-blocks together within the macroblock. These merged sub-blocks may be used to identify both inter and intra modes for the macroblock.