摘要:
A method of object-aware video coding is provided that comprises the steps of: receiving a video sequence having a plurality of frames; selecting at least two frames; determing total area of at least one object of interest in each of the at least two frames; comparing the total area to a threshold area; classifying each of the at least two frames as being a low object weighted frame or a high object weighted frame, low object weighted frames being frames having the total area exceeding the threshold area and high object weighted frames being frame having the total area not exceeding the threshold area; and encoding each low object weighted frame according to one encoding mode and encoding each high object weighted frame according to a different encoding mode.
摘要:
A method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence at incorporates human interaction through a user interface. The method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, eliminating incorrect trajectories through use of the interface and processing images in said video sequence responsive to the evaluating and eliminating steps.
摘要:
A method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence at incorporates human interaction through a user interface. The method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, eliminating incorrect trajectories through use of the interface and processing images in said video sequence responsive to the evaluating and eliminating steps.
摘要:
A method of object-aware video coding is provided that comprises the steps of: receiving a video sequence having a plurality of frames; selecting at least two frames; determing total area of at least one object of interest in each of the at least two frames; comparing the total area to a threshold area; classifying each of the at least two frames as being a low object weighted frame or a high object weighted frame, low object weighted frames being frames having the total area exceeding the threshold area and high object weighted frames being frame having the total area not exceeding the threshold area; and encoding each low object weighted frame according to one encoding mode and encoding each high object weighted frame according to a different encoding mode.
摘要:
The present invention concerns a method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence, such as the ball in a soccer game. In one embodiment, the method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, and processing images in the video sequence based at least in part upon the selected trajectory.
摘要:
Methods and apparatus are provided for sampling-based super resolution video encoding and decoding. The encoding method receives high resolution pictures and generates low resolution pictures and metadata there from, the metadata for guiding post-decoding post-processing of the low resolution pictures and the metadata; and then encodes the low resolution pictures and the metadata using at least one encoder. The corresponding decoding method receives a bitstream and decodes low resolution pictures and metadata there from using a decoder; and then reconstructs high resolution pictures respectively corresponding to the low resolution pictures using the low resolution pictures and the metadata.
摘要:
Methods and apparatus are provided for sampling-based super resolution video encoding and decoding. The encoding method receives high resolution pictures and generates low resolution pictures and metadata there from, the metadata for guiding post-decoding post-processing of the low resolution pictures and the metadata; and then encodes the low resolution pictures and the metadata using at least one encoder. The corresponding decoding method receives a bitstream and decodes low resolution pictures and metadata there from using a decoder; and then reconstructs high resolution pictures respectively corresponding to the low resolution pictures using the low resolution pictures and the metadata.
摘要:
Methods and apparatus are provided for reducing vector quantization error through patch shifting. A method generates, from an input video sequence, one of more high resolution replacement patches, the one or more high resolution replacement patches for replacing one or more low resolution patches during a reconstruction of the input video sequence. This generating step generates the one or more high resolution replacement patches using data corresponding to a patch spatial shifting process, the patch spatial shifting process for reducing jittery artifacts caused by a motion-induced vector quantization error in the one or more high resolution replacement patches, the data for at least deriving a patch size of the one or more high resolution replacement patches such that the one or more high resolution replacement patches are generated to have the patch size greater than a patch size of the one or more low resolution patches in order to be suitable for use in the patch spatial shifting process.
摘要:
Methods and apparatuses for data pruning for video compression using example-based super resolution are provided. A method and apparatus for encoding is provided in which patches of video are extracted from input video, grouped together using a clustering method, and representative patches are packed into patch frames. The original video is downsized and sent either along with, or in addition to, the patch frames. At a decoder, the method and apparatus provided extract patches from the patch frames and create a patch library. The regular video frames are upsized and the low resolution patches are replaced by patches from the patch library by searching the library using the patches in the decoded regular frames as keywords. If there are no appropriate patches, no replacement is made. A post processing procedure is used to enhance the spatiotemporal smoothness of the recovered video.
摘要:
Methods and apparatuses for data pruning for video compression using example-based super resolution are provided. A method and apparatus for encoding is provided in which patches of video are extracted from input video, grouped together using a clustering method, and representative patches are packed into patch frames. The original video is downsized and sent either along with, or in addition to, the patch frames. At a decoder, the method and apparatus provided extract patches from the patch frames and create a patch library. The regular video frames are upsized and the low resolution patches are replaced by patches from the patch library by searching the library using the patches in the decoded regular frames as keywords. If there are no appropriate patches, no replacement is made. A post processing procedure is used to enhance the spatiotemporal smoothness of the recovered video.