Abstract:
Encoding and decoding architectures for 3D video delivery are described, such as 2D compatible 3D video delivery and frame compatible 3D video delivery. The architectures include pre-processing stages to pre-process the output of a base layer video encoder and/or decoder and input the pre-processed output into an enhancement layer video encoder and/or decoder of one or more enhancement layers. Multiplexing methods of how to combine the base and enhancement layer videos are also described.
Abstract:
There are provided a method and apparatus for adaptive Group of Pictures structure selection. The apparatus includes an encoder for encoding a video sequence using a Group of Pictures structure by performing, for each Group of Pictures for the video sequence, picture coding order selection, picture type selection, and reference picture selection. The selections are based upon a Group of Pictures length.
Abstract:
Enhancement methods for sampled and multiplexed image and video data are described. Each component picture is separately processed either after de-multiplexing or on the fly. Processing and de-multiplexing can be combined in a single joint step. The methods apply to both encoding and decoding system and include applications to scalable video coding systems.
Abstract:
A device includes a coder or a codec configured for interleaved image data utilizing diamond shaped blocks for motion estimation and/or motion compensation and utilizing square or orthogonal transforms of residual data. In various embodiments, the decoder may be configured, among others, to perform de-blocking on edges of the diamond shaped blocks and/or data padding at boundaries of the image data. Additionally a method is proposed in which at least one of a transform and quantization process to be applied to de-multiplexed data is modified. One application is to combine left and right stereoscopic images, interleaved in a checkerboard manner.
Abstract:
Full resolution graphic overlays (e.g., graphics, menus, arrows, buttons, captions, banners, picture in picture information) and subtitles in frame compatible 3D delivery for a scalable system are described.
Abstract:
Improved video coding is described to encode video data within a sequence of video frames. To this end, at least a portion of a reference frame is encoded to include motion information associated with the portion of the reference frame. At least a portion of a predictable frame that includes video data predictively correlated to said portion of said reference frame is defined based on the motion information. At least said portion of the predictable frame is encoded without including corresponding motion information and including mode identifying data. The mode identifying data indicate that the encoded portion of the predictable frame can be directly derived using at least the motion information associated with the portion of the reference frame.
Abstract:
There are provided a method and apparatus for reusing available motion information as a motion estimation predictor for video encoding. The apparatus includes an encoder for encoding an image block by determining a motion estimation predictor for the image block using motion information previously generated from an element other than the encoder, and using the motion estimation predictor in a motion estimation process to generate a motion vector for the image block. The motion estimation predictor is used in place of at least one predictor otherwise used in the motion estimation process. The at least one predictor is any of a search window predictor, a temporal predictor, and a block type predictor.
Abstract:
Error resilient rate distortion optimization (ERRDO) is used for transmitting high quality images and video over constrained bandwidth networks, e.g., in streaming applications. Transmitting high quality images and video by reducing computational complexity is described.
Abstract:
A scalable frame compatible three-dimensional video encoding and decoding system for use in a multiview video coding system is described. A base layer includes low resolution information from a plurality of views while one or more enhancement layers may include high resolution information for at least one of the plurality of views. Interpolation filters are derived based on a combination of low resolution information and high resolution information are discussed. For a given view, sending high resolution information at some times and low resolution information at other times are also described.
Abstract:
There are provided a method and apparatus for adaptive weight selection for motion compensated prediction. The apparatus includes an encoder for encoding a picture by deriving a set of weighting parameters, selecting at least one weighting parameter in the set based upon a selection criteria, and applying the selected at least one weighting parameter to a reference picture used to encode the picture.