Abstract:
A method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence at incorporates human interaction through a user interface. The method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, eliminating incorrect trajectories through use of the interface and processing images in said video sequence responsive to the evaluating and eliminating steps.
Abstract:
The present invention provides a method, apparatus and system for film grain cache splitting for film grain simulation. In one embodiment of the present invention a method for storing film grain patterns includes storing at least a first portion of film grain patterns in an internal memory and storing at least a second portion of the film grain patterns in an external memory. That is, in the present invention a method for film grain cache splitting for film grain simulation includes splitting the storage of film grain patterns between an internal cache and an external memory. In one embodiment of the present invention, the internal cache is integrated into an integrated circuit chip of a decoder.
Abstract:
Methods and apparatus are provided for reducing vector quantization error through patch shifting. A method generates, from an input video sequence, one of more high resolution replacement patches, the one or more high resolution replacement patches for replacing one or more low resolution patches during a reconstruction of the input video sequence. This generating step generates the one or more high resolution replacement patches using data corresponding to a patch spatial shifting process, the patch spatial shifting process for reducing jittery artifacts caused by a motion-induced vector quantization error in the one or more high resolution replacement patches, the data for at least deriving a patch size of the one or more high resolution replacement patches such that the one or more high resolution replacement patches are generated to have the patch size greater than a patch size of the one or more low resolution patches in order to be suitable for use in the patch spatial shifting process.
Abstract:
A method for inserting a logo into a stereo video image to generate an overlaid stereo image, the method comprising: detecting presence of stereo pictures in the video image and, when stereo pictures are detected, determining the 3D format of said stereo pictures, said 3D format being a stereo spatially multiplexed format; generating a stereo logo comprising stereo spatially multiplexed logo pictures including a representation of the logo, said stereo spatially multiplexed logo pictures being arranged in said 3D format; and combining the stereo logo and the video image to generate the overlaid stereo image in said 3D format.
Abstract:
Methods and systems for delivering three-dimensional content over a network are disclosed. In accordance with the methods and systems, disparity adjustment of three-dimensional video streams can be made by enabling a receiver to dynamically select among several pairs of left & right views in accordance with user-preferences. Further, Multi-view Video Coding methods and systems are disclosed in which the dynamic selection and adjustment is facilitated by configuring dependencies of enhancement views to reference only base views.
Abstract:
A method of object-aware video coding is provided that comprises the steps of: receiving a video sequence having a plurality of frames; selecting at least two frames; determing total area of at least one object of interest in each of the at least two frames; comparing the total area to a threshold area; classifying each of the at least two frames as being a low object weighted frame or a high object weighted frame, low object weighted frames being frames having the total area exceeding the threshold area and high object weighted frames being frame having the total area not exceeding the threshold area; and encoding each low object weighted frame according to one encoding mode and encoding each high object weighted frame according to a different encoding mode.
Abstract:
In an implementation, a pixel is selected from a target digital image. Multiple candidate pixels, from one or more digital images, are evaluated based on values of the multiple candidate pixels. For the selected pixel, a corresponding set of pixels is determined from the multiple candidate pixels based on the evaluations of the multiple candidate pixels and on whether a predetermined threshold number of pixels have been included in the corresponding set. Further for the selected pixel, a substitute value is determined based on the values of the pixels in the corresponding set of pixels. Various implementations described provide adaptive pixel-based spatio-temporal filtering of images or video to reduce film grain or noise. Implementations may achieve an “even” amount of noise reduction at each pixel while preserving as much picture detail as possible by, for example, averaging each pixel with a constant number, N, of temporally and/or spatially correlated pixels.
Abstract:
The present invention concerns a method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence, such as the ball in a soccer game. In one embodiment, the method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, and processing images in the video sequence based at least in part upon the selected trajectory.
Abstract:
A method of tone mapping high dynamic range images for display on low dynamic range displays wherein a high dynamic range image is first accessed. The high dynamic range image is segmented into different regions such that each region is represented by a matrix, where each element of the matrix is a weight or probability of a pixel. An exposure of each region is determined or calculated and the exposure values are applied to the regions responsive to the weight or probability. The different regions are then fused together to obtain a final tone mapped image.
Abstract:
Simulation of a block of film grain for addition to a block of an image occurs by first establishing at least one parameter at least in part in accordance with an attribute of the image block. At least one at least one block of film grain is simulated from at least one film grain pattern generated in accordance with the at least one parameter. In particular, the film grain pattern is generated using a bit accurate technique.