摘要:
A method and apparatus for organizing data pertaining to audiovisual content are described. According to one embodiment, an exemplary method for organizing data pertaining to audiovisual content includes defining at least one descriptive list for a descriptive portion of the data pertaining to audiovisual content, defining at least one accessing list for an accessing portion of the data pertaining to audiovisual content, and generating a matrix that connects the accessing list to the descriptive list.
摘要:
Adaptive joint source channel coding associates multiple predictors with a reference data unit, such as a macroblock or frame of video data. An encoder determines a sub-codebook in which each of the selected multiple predictors decodes to the reference data unit. An identifier for the sub-codebook is transmitted through a channel to a decoder for subsequent decoding of the reference data unit. The reference data unit itself does not need to be sent. The multiple predictors are contained within a decoding region and the identifier for the sub-codebook specifies the decoding region. The decoder uses the identified sub-codebook and one of the predictors to decode the reference data unit. If none of the original predictors are correctly received, different types of error handling are employed based on the type of channel.
摘要:
A joint optimization iterative algorithm determines optimized mode pairs. Each mode pair includes an intra-predictor and a transform pair that are iteratively modified to determine an optimized intra-predictor and an optimized transform that forms the optimized mode pair. A set of training videos and a set of quantization parameters (QPs) are used as the base data for determining the optimized mode pairs. Each video includes a plurality of pixel blocks, herein referred to as blocks. Block statistics associated with each mode pair are accumulated by separately encoding each block using each mode pair, and selecting the best mode pair for each block according to a measured characteristic of each encoding. The accumulated block statistics are used to modify the intra-predictor and the transform within each mode pair.
摘要:
The invention is an apparatus and method for estimating an optimized sub-pixel interpolation filter using iterative estimations as needed for sub-pixel motion compensation and motion estimation in a video codec for improving coding efficiency. Multiple iterations of adaptive interpolation filter estimation are performed including more than one iteration based on sub-pixel motion vectors. During testing of the inventive apparatus and method on various video segments, average bit rate reductions were exhibited of approximately 5%.
摘要:
Quantization (scaling) matrices for HEVC standards using an HVS-based mathematical model and data analysis are described herein. A quadratic parameter model-based quantization matrix design is also included.
摘要:
The currently existing ISO/AVC file format is modified by providing extensions to store and access video content currently being defined by the SVC standard. Specifically, extensions to the AVC file format are made to provide a new SVC file format that enables the storage and access of scalable video data. The scalable video data is stored as a single track within a media data section of the SVC file format. New extensions are defined for description entries and boxes within a metadata section of the SVC file format. These extensions provide means for extracting sub-streams or layers from the single track of scalable video data stored in the media data section.
摘要:
Apparatus and methods for coding images geometric vector quantization (GVQ) having an over-complete dictionary which produces a sparse vector of coefficients as it contains large runs of zeros. The sparse encoding is particularly well suited for use with run-length entropy coding techniques. Image blocks are sparse coded using GVQ, with the vector of coefficients converted to RUN-LENGTH symbols, and binarized into a set of binary symbols. At least a portion of the binary symbols are used as contexts which can be selected when performing binary arithmetic coding of the binary coded RUN and LENGTH data to generate a bit stream containing the encoded image that provides enhanced compression.
摘要:
The encoding and distribution of schema for multimedia content descriptions are described. Before the schema is sent to a system for use, it is encoded by assigning a first token for each component in the schema and a second token for each attribute of each component. In one aspect, a signifier is sent first to signal that the schema is to be sent in a particular format. In one aspect, the schema is sent as a whole entity first. In another aspect, some of the components of the schema are sent first, followed by the schema that refers to these components. In another aspect, the schema is sent first, followed by a set of component types. When the decoder receives the signifier and encoded schema, it looks up values for the first and second tokens in a look up table for all components and attributes to decode the schema.
摘要:
An encoding system includes a video coding layer (VCL) to generate slices when encoding multimedia data, a generic adaptation layer (GAL) to create, from the slices, a set of GAL units having a format that is generic to various transport systems, and a network adaptation layer (NAL) associated with a specific transport system to map the set of GAL units to the format of the specific transport system.
摘要:
Method and apparatus for hierarchical bi-directional intra-prediction in a video encoder is described. In some examples, initial reference blocks are selected in an image. Sets of blocks in the image are predicted successively over a plurality of stages based on a respectively plurality of templates, each of the plurality of templates including reference blocks selected from at least one of: the initial reference blocks; or previously encoded and reconstructed blocks of the image. The image is encoded in the video encoder using the initial reference blocks and the sets of blocks as predicted.