摘要:
The invention provides a standardized interface facility for MPEG-4 authoring, bitstream manipulation, editing and interpretation, with associated tools and interfaces to, resulting in coded bitstreams which are easier to test, check and debug while conforming to the MPEG-4 standard. The specified interfaces can also facilitate graceful degradation in the face of decreased resources by allowing editing of bitstreams. The specified interfaces can also allow creation of decodable bitstreams in response to the user requests either directly or indirectly embedded in audiovisual applications, as well as future services. The invention specifies a bitstream input/output package in the Java programming language to facilitates bitstream encoding and decoding of audio-visual media objects, especially when coding uses the MPEG-4 standard. The invention separates fixed length and variable length coding, and allows flexible parsing which offers the potential of optimized implementation as needed to aid real-time or near real-time operation.
摘要:
The invention provides a system and method allowing the adaptation of a nonadaptive system for playing/browsing coded audiovisual objects, such as the parametric system of MPEG-4. The system of the invention is referred to as the programmatic system, and incorporates adaptivity on top of the parametric system. The parametric system of MPEG-4 consists of a Systems Demultiplex (Demux) overseen by digital media integration framework (DMIF), scene graph and media decoders, buffers, compositer and renderer. Adaptations possible with the invention include interfaces in the categories of media decoding, user functionalities and authoring, thus allowing a number of enhanced functionalities in response to use input as well as graceful degradation in response to limited system resources. The invention includes a specification of an interfacing method in the form of an application programming interface (API). Hot object, directional, trick mode, transparency and other interfaces are specified.
摘要:
A fundamental limitation in the exchange of audiovisual information today is that its representation is extremely low level. It is composed of coded video or audio samples (often as blocks) arranged in a commercial format. In contrast, the new generation multimedia requires flexible formats to allow a quick adaptation to requirements in terms of access, bandwidth scalability, streaming as well as general data reorganization. The Flexible-Integrated Intermedia Format (Flexible-IIF or F-IIF) is an advanced extension to the Integrated Intermedia Format (IIF). The Flexible-Integrated Intermedia Format (Flexible-IIF) datastructures, file formats systems and methods provide a framework that integrates advanced concepts, such as objects based audio-visual representation, meta-data and object oriented programming, to achieve a flexible and generic representation of the audiovisual information and the associated methods to operate on the audiovisual information.
摘要:
Audiovisual data storage is enhanced using an expanded physical object table utilizing an ordered list of unique identifiers for a particular object for every object instance of an object contained in segments of a data file. Two object instances of the same object in the same segment have different object identifiers. Therefore, different instances of the same object use different identification and the different object instances may be differentiated from one another for access, editing and transmission. The necessary memory required for randomly accessing data contained in files using the expanded physical object table may be reduced by distributing necessary information within a header of a file to simplify the structure of the physical object table. In this way, a given object may be randomly accessed by means of an improved physical object table/segment object table mechanism.
摘要:
File formats systems and methods are disclosed that provide a framework that integrates concepts, such as objects based audio-visual representation, meta-data and object oriented programming, to achieve a flexible and generic representation of the audiovisual information and the associated methods to operate on the audiovisual information. A system and method are disclosed for storing data processed from presentation data. The data is stored according to a method comprising coding input presentation data by identifying objects from within the presentation data, coding each object individually and organizing the coded data into access layer data units. The access layer data units are stored throughout a plurality of segments, each segment comprising a segment table in a header portion thereof and those access layer data units that are members of the respective segment, there being one entry in the segment table for each access layer data unit therein. A plurality of extended segments are also stored, each of the extended segments further comprising one or more of the access layer data units that include protocol specific data, the extended segments each represented by a extended segment header. The data of an accessible object is also stored, including an accessible object header and identifiers of the plurality of extended segments, each of the extended segments being a member of the same object.
摘要:
File formats systems and methods are disclosed that provide a framework that integrates concepts, such as objects based audio-visual representation, meta-data and object oriented programming, to achieve a flexible and generic representation of the audiovisual information and the associated methods to operate on the audiovisual information. A system and method are disclosed for storing data processed from presentation data. The data is stored according to a method comprising coding input presentation data by identifying objects from within the presentation data, coding each object individually and organizing the coded data into access layer data units. The access layer data units are stored throughout a plurality of segments, each segment comprising a segment table in a header portion thereof and those access layer data units that are members of the respective segment, there being one entry in the segment table for each access layer data unit therein. A plurality of extended segments are also stored, each of the extended segments further comprising one or more of the access layer data units that include protocol specific data, the extended segments each represented by a extended segment header. The data of an accessible object is also stored, including an accessible object header and identifiers of the plurality of extended segments, each of the extended segments being a member of the same object.
摘要:
The invention provides a system and method for processing object-based audiovisual information which is capable of flexibly encoding, storing and accessing a variety of data objects. Audiovisual objects, illustratively prepared in MPEG-4 format, are stored onto physical media using file consisting of segments and access layer primary data units (PDUs), which are accessed using index information. The processes of encoding, storing and accessing audiovisual information are decoupled from monolithic storage in a strict linear order, and reordering and other editorial manipulations are possible.
摘要:
File formats systems and methods are disclosed that provide a framework that integrates concepts, such as objects based audio-visual representation, meta-data and object oriented programming, to achieve a flexible and generic representation of the audiovisual information and the associated methods to operate on the audiovisual information. A system and method are disclosed for storing data processed from presentation data. The data is stored according to a method comprising coding input presentation data by identifying objects from within the presentation data, coding each object individually and organizing the coded data into access layer data units. The access layer data units are stored throughout a plurality of segments, each segment comprising a segment table in a header portion thereof and those access layer data units that are members of the respective segment, there being one entry in the segment table for each access layer data unit therein. A plurality of extended segments are also stored, each of the extended segments further comprising one or more of the access layer data units that include protocol specific data, the extended segments each represented by a extended segment header. The data of an accessible object is also stored, including an accessible object header and identifiers of the plurality of extended segments, each of the extended segments being a member of the same object.
摘要:
Video-coded information is transmitted over a network at a priority level that is determined based on feedback from the network. In an embodiment, the feedback comprises a response to a request for information on whether the network currently has the available capacity to transmit additional high priority traffic. In an embodiment, a candidate base layer frame is transmitted over the network as a base layer frame if permission to send high priority data was granted and is transmitted over the network as an enhancement layer frame if permission to send high priority data was denied. In a further embodiment, the candidate base layer frame is deleted if permission to send high priority data was denied.
摘要:
A predictive video coder performs gradient prediction based on previous blocks of image data. For a new block of image data, the prediction determines a horizontal gradient and a vertical gradient from a block diagonally above the new block (vertically above a previous horizontally adjacent block). Based on these gradients, the encoder predicts image information based on image information of either the horizontally adjacent block or a block vertically adjacent to the new block. The encoder determines a residual that is transmitted in an output bitstream. The decoder performs the identical gradient prediction and predicts image information without need for overhead information. The decoder computes the actual information based on the predicted information and the residual from the bitstream.