摘要:
A method and apparatus for processing descriptions of audiovisual content are described. According to one embodiment, a description of audiovisual content is created, and information pertaining to reuse of the description of audiovisual content is defined. Further, the description of audiovisual content and the reuse information are stored in a repository of descriptive data to enable subsequent reuse of this description.
摘要:
A system and method for allowing synchronized execution of a set of commands is provided. A first system contains a content description tree that may be dynamically updated by a set of commands from a second system. In the set of commands, commands may be executed in parallel or in sequence. If the group of commands are tagged to be executed in parallel, parallel execution of commands implies that there are no dependencies between the execution of the commands. As the result, the decoder will have more flexibility to schedule the execution of these commands. If the group of commands are tagged to be executed sequentially, then the order of execution matters. Therefore, in order to synchronize the execution of the group of commands in a set, a parallel or sequential tag is included identifying whether this group of commands should be executed in parallel or in sequence.
摘要:
An apparatus or method of efficiently coding high resolution video using texture analysis and synthesis techniques in a scalable video coding framework. A high-resolution video signal is spatially downsampled and encoded into a base-layer. Texture and structure information are extracted from the downsampled signal and base-layer for use by a texture synthesizer. The texture synthesizer is structurally and texturally aware utilizing edge information from a base-layer of the synthesizer to improve synthesis. After synthesis, a video quality assessor directs enhancement-layer coding of unacceptably synthesized areas by alternate (non-textural) coding means, such as conventional AVC or MPEG-2 coding. In one mode, the quality assessor iteratively improves synthesis of certain blocks to make them acceptable for enhancement-layer coding. The apparatus or method outputs a bit stream, or bit streams, containing both the coded base-layer and the enhancement-layer.
摘要:
An encoding system includes a video coding layer (VCL) to generate slices when encoding multimedia data, a generic adaptation layer (GAL) to create, from the slices, a set of GAL units having a format that is generic to various transport systems, and a network adaptation layer (NAL) associated with a specific transport system to map the set of GAL units to the format of the specific transport system.
摘要:
Switch sample metadata defining switch sample sets associated with multimedia data is created. Each switch sample set includes samples that have identical decoding values. Further, a file associated with the multimedia data is formed. This file includes the switch sample metadata, as well as other information pertaining to the multimedia data.
摘要:
Implementations generally relate to pre-charge phase data compression. In some implementations, a method includes computing prediction values for image data, where the image data is pre-charge phase data. The method also includes computing residual data based on the prediction values. The method also includes quantizing the residual data. The method also includes entropy encoding the quantized residual data. The method also includes refining an inverse quantized residual data based on one or more of the residual data and a number of left-over bit-budget after entropy encoding.
摘要:
Motion Compensated Prediction (MCP) has been a key factor in most advanced video compression schemes. For further reduction in the residual signal energy in B-frames, bidirectional prediction where two motion-compensated signals are superimposed has also been utilized in most prior video coding standards such as MPEG-2 or MPEG-4/AVC. Syntax changes and appropriate motion vector prediction that allows efficient use of multi-parameter MCP is described. The prediction signal is constructed by linearly combining the motion-compensated signals from each parameter (or motion vector).
摘要:
A method of estimating motion by estimating motion parameters of a complex motion model using reconstructed neighboring pixels of the current block and then applying the estimated motion parameters to the coordinates of the current block to determine motion compensated in a corresponding reference block. The parameters are not transmitted to a decoder and are derived at the decoder side. The encoder only sends a 1-bit control-flag to indicate whether the derived parameters should be used.
摘要:
An adaptive loop filter utilizing separable filters within a quadtree-based adaptive loop filter (QALF). The filters for at least the P and B pictures are replaced with a separable filter to provide Separable QALF (SQALF). In a preferred embodiment the filter comprises a Wiener filter as a best fit approximation of the non-separable filter. In response to decoding with the SQALF approach, computational complexity in the decoder is reduced without lowering perceived visual quality. Estimation of the separable filter is obtained in the encoder by iteratively minimizing the sum of squares error with the separable filter. The SQALF filtering is described within a video decoder.
摘要:
Spatial domain directional intra prediction has been shown to be very effective to remove the correlation between the pixels in the current block and reconstructed neighbors. In AVC, 8 directional prediction modes (plus the DC prediction mode) are defined. The prediction mode number is signaled to the decoder using a simple predictive coding method. The previous intra prediction methods have two major disadvantages: lack of precision for arbitrary directional patterns and accuracy lack of accuracy to exploit geometric dependency between blocks. To address these issues, a new method accurately predicts the intra directions from reconstructed neighboring pixels and differentially encode the intra directions. This allows a more precise directional prediction without the significant increase in the cost for transmitting the side information.