摘要:
A method of coarse representation of a visual object's shape for search/query/filtering applications uses a binding box that fully encompasses the object of interest within the image to extract a feature vector. Once the feature vector is available, matching based on specific queries may be performed using a search engine to compare the query number to an appropriate element of the feature vector, performing sorting to pick the best matches.
摘要:
A system and method of coding (encoding and/or decoding) video content to extend file formats for storage. The system and method utilizes the concept to define additional sample group description entries. By way of example the method can comprise the steps of: (1) receiving a file with encoded media data as a scalable video codec stream; (2) extracting information identifying the various spatial resolutions, temporal resolutions, quality resolutions or combinations of spatio-temporal-quality resolutions from the media data; (3) generating new description entries and dependency grouping box; (4) populating boxes with extracted metadata; and (5) incorporating metadata into a file associated with the media data using a specific media file format.
摘要:
A method and apparatus for organizing data pertaining to audiovisual content are described. According to one embodiment, an exemplary method for organizing data pertaining to audiovisual content includes defining at least one descriptive list for a descriptive portion of the data pertaining to audiovisual content, defining at least one accessing list for an accessing portion of the data pertaining to audiovisual content, and generating a matrix that connects the accessing list to the descriptive list.
摘要:
Adaptive joint source channel coding associates multiple predictors with a reference data unit, such as a macroblock or frame of video data. An encoder determines a sub-codebook in which each of the selected multiple predictors decodes to the reference data unit. An identifier for the sub-codebook is transmitted through a channel to a decoder for subsequent decoding of the reference data unit. The reference data unit itself does not need to be sent. The multiple predictors are contained within a decoding region and the identifier for the sub-codebook specifies the decoding region. The decoder uses the identified sub-codebook and one of the predictors to decode the reference data unit. If none of the original predictors are correctly received, different types of error handling are employed based on the type of channel.
摘要:
A method of modifying a group of pictures (GOP) structure in an MPEG video signal from a low-delay mode bitstream having I and P pictures to a non-low-delay bitstream having I, P and B pictures uses the motion vectors from the low-delay mode bitstream to derive the motion vectors for the non-low-delay mode bitstream. Motion vectors for anchor pictures for the non-low-delay mode bitstream are converted from the motion vectors for the corresponding pictures in the low-delay mode bitstream. Motion vectors for the B pictures in the non-low-delay mode bitstream are converted from the motion vectors for the corresponding P pictures in the low-delay mode bitstream. The converted motion vectors for the non-low-delay mode bitstream are used in recoding an uncompressed video signal derived from the low-delay mode bitstream to produce the non-low-delay mode bitstream.
摘要:
A joint optimization iterative algorithm determines optimized mode pairs. Each mode pair includes an intra-predictor and a transform pair that are iteratively modified to determine an optimized intra-predictor and an optimized transform that forms the optimized mode pair. A set of training videos and a set of quantization parameters (QPs) are used as the base data for determining the optimized mode pairs. Each video includes a plurality of pixel blocks, herein referred to as blocks. Block statistics associated with each mode pair are accumulated by separately encoding each block using each mode pair, and selecting the best mode pair for each block according to a measured characteristic of each encoding. The accumulated block statistics are used to modify the intra-predictor and the transform within each mode pair.
摘要:
The invention is an apparatus and method for estimating an optimized sub-pixel interpolation filter using iterative estimations as needed for sub-pixel motion compensation and motion estimation in a video codec for improving coding efficiency. Multiple iterations of adaptive interpolation filter estimation are performed including more than one iteration based on sub-pixel motion vectors. During testing of the inventive apparatus and method on various video segments, average bit rate reductions were exhibited of approximately 5%.
摘要:
Quantization (scaling) matrices for HEVC standards using an HVS-based mathematical model and data analysis are described herein. A quadratic parameter model-based quantization matrix design is also included.
摘要:
The currently existing ISO/AVC file format is modified by providing extensions to store and access video content currently being defined by the SVC standard. Specifically, extensions to the AVC file format are made to provide a new SVC file format that enables the storage and access of scalable video data. The scalable video data is stored as a single track within a media data section of the SVC file format. New extensions are defined for description entries and boxes within a metadata section of the SVC file format. These extensions provide means for extracting sub-streams or layers from the single track of scalable video data stored in the media data section.
摘要:
Apparatus and methods for coding images geometric vector quantization (GVQ) having an over-complete dictionary which produces a sparse vector of coefficients as it contains large runs of zeros. The sparse encoding is particularly well suited for use with run-length entropy coding techniques. Image blocks are sparse coded using GVQ, with the vector of coefficients converted to RUN-LENGTH symbols, and binarized into a set of binary symbols. At least a portion of the binary symbols are used as contexts which can be selected when performing binary arithmetic coding of the binary coded RUN and LENGTH data to generate a bit stream containing the encoded image that provides enhanced compression.