Abstract:
Methods and apparatuses for data pruning for video compression using example-based super resolution are provided. A method and apparatus for encoding is provided in which patches of video are extracted from input video, grouped together using a clustering method, and representative patches are packed into patch frames. The original video is downsized and sent either along with, or in addition to, the patch frames. At a decoder, the method and apparatus provided extract patches from the patch frames and create a patch library. The regular video frames are upsized and the low resolution patches are replaced by patches from the patch library by searching the library using the patches in the decoded regular frames as keywords. If there are no appropriate patches, no replacement is made. A post processing procedure is used to enhance the spatiotemporal smoothness of the recovered video.
Abstract:
A user interactive video tone mapping method is provided that can be used for the tone reproduction of high dynamic range (HDR) content on low dynamic range (LDR) displays. The method comprises the steps of selecting a frame of a video sequence; providing a user interface for a user to manually apply scribble marks to the frame; manually applying N scribble marks to the frame, N being a whole number of value 1 or greater; and tone mapping the video sequence responsive to the scribble marks.
Abstract:
A method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence at incorporates human interaction through a user interface. The method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, eliminating incorrect trajectories through use of the interface and processing images in said video sequence responsive to the evaluating and eliminating steps.
Abstract:
The present invention provides a method, apparatus and system for film grain cache splitting for film grain simulation. In one embodiment of the present invention a method for storing film grain patterns includes storing at least a first portion of film grain patterns in an internal memory and storing at least a second portion of the film grain patterns in an external memory. That is, in the present invention a method for film grain cache splitting for film grain simulation includes splitting the storage of film grain patterns between an internal cache and an external memory. In one embodiment of the present invention, the internal cache is integrated into an integrated circuit chip of a decoder.
Abstract:
Methods and apparatus are provided for reducing vector quantization error through patch shifting. A method generates, from an input video sequence, one of more high resolution replacement patches, the one or more high resolution replacement patches for replacing one or more low resolution patches during a reconstruction of the input video sequence. This generating step generates the one or more high resolution replacement patches using data corresponding to a patch spatial shifting process, the patch spatial shifting process for reducing jittery artifacts caused by a motion-induced vector quantization error in the one or more high resolution replacement patches, the data for at least deriving a patch size of the one or more high resolution replacement patches such that the one or more high resolution replacement patches are generated to have the patch size greater than a patch size of the one or more low resolution patches in order to be suitable for use in the patch spatial shifting process.
Abstract:
A method for inserting a logo into a stereo video image to generate an overlaid stereo image, the method comprising: detecting presence of stereo pictures in the video image and, when stereo pictures are detected, determining the 3D format of said stereo pictures, said 3D format being a stereo spatially multiplexed format; generating a stereo logo comprising stereo spatially multiplexed logo pictures including a representation of the logo, said stereo spatially multiplexed logo pictures being arranged in said 3D format; and combining the stereo logo and the video image to generate the overlaid stereo image in said 3D format.
Abstract:
Methods and systems for delivering three-dimensional content over a network are disclosed. In accordance with the methods and systems, disparity adjustment of three-dimensional video streams can be made by enabling a receiver to dynamically select among several pairs of left & right views in accordance with user-preferences. Further, Multi-view Video Coding methods and systems are disclosed in which the dynamic selection and adjustment is facilitated by configuring dependencies of enhancement views to reference only base views.
Abstract:
A method of object-aware video coding is provided that comprises the steps of: receiving a video sequence having a plurality of frames; selecting at least two frames; determing total area of at least one object of interest in each of the at least two frames; comparing the total area to a threshold area; classifying each of the at least two frames as being a low object weighted frame or a high object weighted frame, low object weighted frames being frames having the total area exceeding the threshold area and high object weighted frames being frame having the total area not exceeding the threshold area; and encoding each low object weighted frame according to one encoding mode and encoding each high object weighted frame according to a different encoding mode.
Abstract:
In an implementation, a pixel is selected from a target digital image. Multiple candidate pixels, from one or more digital images, are evaluated based on values of the multiple candidate pixels. For the selected pixel, a corresponding set of pixels is determined from the multiple candidate pixels based on the evaluations of the multiple candidate pixels and on whether a predetermined threshold number of pixels have been included in the corresponding set. Further for the selected pixel, a substitute value is determined based on the values of the pixels in the corresponding set of pixels. Various implementations described provide adaptive pixel-based spatio-temporal filtering of images or video to reduce film grain or noise. Implementations may achieve an “even” amount of noise reduction at each pixel while preserving as much picture detail as possible by, for example, averaging each pixel with a constant number, N, of temporally and/or spatially correlated pixels.
Abstract:
The present invention concerns a method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence, such as the ball in a soccer game. In one embodiment, the method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, and processing images in the video sequence based at least in part upon the selected trajectory.