Abstract:
A method for processing a video sequence having a plurality of frames includes the steps of: extracting features from each of the frames, determining correspondences between the extracted features from two of the frames, estimating motion in the video sequence based on the determined correspondences, generating a background mosaic for the video sequence based on the estimated motion, and performing foreground-background segmentation on each of the frames based on the background mosaic.
Abstract:
An implementation provides a method including forming a metric surface in a particle-based framework for tracking an object, the metric surface relating to a particular image in a sequence of digital images. Multiple hypotheses are formed of a location of the object in the particular image, based on the metric surface. The location of the object is estimated based on probabilities of the multiple hypotheses.
Abstract:
The invention is made in the field of coding of images of high dynamic range.The invention is based on the concept of Frame Compatible format. The idea is to transport, in a frame, down-sampled LDR content together with additional information allowing reconstructing HDR content from the LDR content.Thus, it is proposed a method of encoding an HDR image of high dynamic range according to claim 1. Said method comprises down-sampling (DWN) an LDR image and additional data, the LDR image providing a lower dynamic range depiction of the HDR image content and the additional data allowing for reconstructing the HDR image from the LDR image.
Abstract:
A tone mapping graphical user interface is provided that allows a video engineer to process a video using a set of tools for changing high dynamic range data into lower dynamic range data.
Abstract:
A method of segmenting regions of an image wherein a number of partitions are determined based on a range of an image histogram in a logarithmic luminance domain. Regions are defined by the partitions. A mean value of each region is calculated by K-means clustering wherein the clustering is initialized, data is assigned and centroids are updated. Anchor points are determined based on the centroids and a weight of each pixel is computed based on the anchor points.
Abstract:
A method and associated apparatus for using a trajectory-based technique to detect a moving object in a video sequence at incorporates human interaction through a user interface. The method comprises steps of identifying and evaluating sets of connected components in a video frame, filtering the list of connected components by comparing features of the connected components to predetermined criteria, identifying candidate trajectories across multiple frames, evaluating the candidate trajectories to determine a selected trajectory, eliminating incorrect trajectories through use of the interface and processing images in said video sequence responsive to the evaluating and eliminating steps.
Abstract:
A method and apparatus are provided for reversible, polynomial based image scaling. The apparatus includes a video scaler for performing image scaling from a first base resolution image to a higher resolution image, and from the higher resolution image to a second base resolution image. The first and the second base resolution images are equal on a pixel-by-pixel basis for an entirety of the first and the second base resolution images. A scaling function used for the image scaling is based on a polynomial function having two or more degrees.
Abstract:
A method and apparatus are disclosed and described for providing a synchronized workstation with two-dimensional and three-dimensional outputs. The apparatus includes a video decoder (315) for decoding picture data. The video decoder includes a data manager (320) for receiving video production commands and managing a video playback of the picture data in at least one of a two-dimensional video output mode and a three-dimensional video output mode responsive to the video production commands. The two-dimensional video output mode and the three-dimensional video output mode are capable of being used independently and simultaneously.
Abstract:
A method and apparatus are disclosed and described for providing bit rate configuration for multi-view video coding. In the video encoder, the method includes encoding image data for at least one picture for at least two joint views of multi-view video content, the at least two joint views including a base view and at least one dependent view. The bit rate configuration for encoding the image data is determined to include an average bit rate and a maximum bit rate for the base view and the average bit rate and the maximum bit rate for the at least two joint views (235, 215, 220).
Abstract:
Film grain is simulated in an output image using pre-established blocks of film grain from a pool of pre-established blocks. Successive film grain blocks are selected by matching the average intensity of a block from the pool to the average intensity of a successive one of a set of M×N pixels in an incoming image. Once all of the successive pixel blocks from the image are matched to selected film grain blocks, the selected film grain blocks are “mosaiced”, that is composited into a larger image mapped to the incoming image.