摘要:
The invention provides a system and method allowing the adaptation of a nonadaptive system for playing/browsing coded audiovisual objects, such as the parametric system of MPEG-4. The system of the invention is referred to as the programmatic system, and incorporates adaptivity on top of the parametric system. The parametric system of MPEG-4 consists of a Systems Demultiplex (Demux) overseen by digital media integration framework (DMIF), scene graph and media decoders, buffers, compositer and renderer. Adaptations possible with the invention include interfaces in the categories of media decoding, user functionalities and authoring, thus allowing a number of enhanced functionalities in response to use input as well as graceful degradation in response to limited system resources. The invention includes a specification of an interfacing method in the form of an application programming interface (API). Hot object, directional, trick mode, transparency and other interfaces are specified.
摘要:
Systems and methods for performing videoconferencing using endpoints with multiple monitors and multiple cameras are disclosed herein. These endpoints are comprised of, where each node is comprised of a control unit and one or more node units, each connected to at least one monitor, camera, speaker, or microphone. Video is encoded using scalable coding, and endpoints are connected to each other over a network using an SVCS. Algorithms are described for layout management, tagging of individual streams, and use of tags for dynamic and prioritized layout management.
摘要:
Systems and methods are provided for communicating timely information related to the scalability layer structure of signals received by decoders and other components in a video and/or audio communication system. For a communication system, which uses the Standard H.264 SVC coding format, standard SSEI messages are modified or supplemented to include the ability to signal scalability layer structure information and changes thereof. Recipients can use the signal scalability layer information to properly process or decode received signals.
摘要:
As information to be processed at an object-based video or audio-visual (AV) terminal, an object-oriented bitstream includes objects, composition information, and scene demarcation information. Such bitstream structure allows on-line editing, e.g. cut and paste, insertion/deletion, grouping, and special effects. In the interest of ease of editing, AV objects and their composition information are transmitted or accessed on separate logical channels (LCs). Objects which have a lifetime in the decoder beyond their initial presentation time are cached for reuse until a selected expiration time. The system includes a de-multiplexer (1), a controller (2) which controls the operation of the AV terminal, input buffers (3), AV objects decoders (4), buffers (4′) for decoded data, a composer (5), a display (6), and an object cache (7).
摘要:
In a digital video compression system that includes both a real buffer of size B.sub.max, and a smaller virtual buffer of size B.sub.virtual, a method for controlling the generated bit rate of compressed video information to keep within the maximum buffer capacity is disclosed. The method includes the steps of (a) receiving blocks of digital information, (b) determining whether a current block can be compressed by one of one or more shortcuts and using the shortcut if possible; (c) if the current block cannot be compressed with any shortcut, determining whether the virtual buffer capacity will be exceeded if the current received block is compressed only by arithmetic coding and using only arithmetic coding if possible; (d) if the current block cannot be compressed with any shortcut and the virtual buffer capacity would be exceeded with only arithmetic coding, determining whether the virtual buffer capacity will be exceeded if the current block is compressed with both downsampling and arithmetic coding and using both downsampling and arithmetic coding if possible; and (e) if all else fails, compressing the current block with a default mode of compression even if the virtual buffer capacity is exceeded with such compression.
摘要:
The invention provides a standardized interface facility for MPEG-4 authoring, bitstream manipulation, editing and interpretation, with associated tools and interfaces to, resulting in coded bitstreams which are easier to test, check and debug while conforming to the MPEG-4 standard. The specified interfaces can also facilitate graceful degradation in the face of decreased resources by allowing editing of bitstreams. The specified interfaces can also allow creation of decodable bitstreams in response to the user requests either directly or indirectly embedded in audiovisual applications, as well as future services. The invention specifies a bitstream input/output package in the Java programming language to facilitates bitstream encoding and decoding of audio-visual media objects, especially when coding uses the MPEG-4 standard. The invention separates fixed length and variable length coding, and allows flexible parsing which offers the potential of optimized implementation as needed to aid real-time or near real-time operation.
摘要:
As information to be processed at an object-based video or audio-visual (AV) terminal, an object-oriented bitstream includes objects, composition information, and scene demarcation information. Such bitstream structure allows on-line editing, e.g. cut and paste, insertion/deletion, grouping, and special effects. In the interest of ease of editing, AV objects and their composition information are transmitted or accessed on separate logical channels (LCs). Objects which have a lifetime in the decoder beyond their initial presentation time are cached for reuse until a selected expiration time.
摘要:
Systems and methods are provided for communicating timely information related to the scalability layer structure of signals received by decoders and other components in a video and/or audio communication system. For a communication system, which uses the Standard H.264 SVC coding format, standard SSEI messages are modified or supplemented to include the ability to signal scalability layer structure information and changes thereof. Recipients can use the signal scalability layer information to properly process or decode received signals.
摘要:
Media communication systems and methods for media encoded using scalable coding with temporal scalability are provided. Transmitting endpoints include switching information in their transmitted media to indicate if temporal level switching at a decoder can occur at any frame of the transmitted encoded media.
摘要:
Systems and methods for error resilient transmission and for random access in video communication systems are provided. The video communication systems are based on single-layer, scalable video, or simulcast video coding with temporal scalability, which may be used in video communication systems. A set of video frames or pictures in a video signal transmission is designated for reliable or guaranteed delivery to receivers using secure or high reliability links, or by retransmission techniques. The reliably-delivered video frames are used as reference pictures for resynchronization of receivers with the transmitted video signal after error incidence and for random access.