摘要:
A motion-compensated video encoding scheme employs progressive fine-granularity layered coding to encode macroblocks of video data into frames having multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from different quality layers in reference frames. The video encoding scheme estimates drifting errors during the encoding and chooses a coding mode for each macroblock in the enhancement layer to maximize high coding efficiency while minimizing drifting errors.
摘要:
A resource allocation of multiple compressed AV streams delivered over the Internet is disclosed that achieves end-to-end optimal quality through a multimedia streaming TCP-friendly transport (MSTFP) protocol that adaptively estimates the network bandwidth while smoothing the sending rate. Resources allocated dynamically according to a media encoding distortion and network degradation algorithm. A scheme is also disclosed for dynamically estimating the available network bandwidth for streaming of objects, such as MPEG4 multiple video objects, in conjunction with the MSTFP protocol. The scheme can account for packet-loss rates to minimize end-to-end distortion for media delivery.
摘要:
Various methods and arrangements are provided for transmitting adaptive multimedia content over networks that provide differential services. By way of example, one method includes compressing video objects, generating at least one corresponding elementary stream containing the compressed video objects, classifying information within each elementary stream based on importance, and assembling the classified information into packets associated with different classes of network packets. In classifying the information within each elementary stream based on importance, different priority levels can be assigned to shape, motion, and texture information. Methods and arrangements are also provided for use with multimedia content information that includes audio information, image information, textual information, and the like.
摘要:
A resource allocation of multiple compressed AV streams delivered over the Internet is disclosed that achieves end-to-end optimal quality through a multimedia streaming TCP-friendly transport (MSTFP) protocol that adaptively estimates the network bandwidth while smoothing the sending rate. Resources allocated dynamically according to a media encoding distortion and network degradation algorithm. A scheme is also disclosed for dynamically estimating the available network bandwidth for streaming of objects, such as MPEG-4 multiple video objects, in conjunction with the MSTFP protocol. The scheme can account for packet-loss rates to minimize end-to-end distortion for media delivery.
摘要:
A resource allocation of multiple compressed AV streams delivered over the Internet is disclosed that achieves end-to-end optimal quality through a multimedia streaming TCP-friendly transport (MSTFP) protocol that adaptively estimates the network bandwidth while smoothing the sending rate. Resources allocated dynamically according to a media encoding distortion and network degradation algorithm. A scheme is also disclosed for dynamically estimating the available network bandwidth for streaming of objects, such as MPEG4 multiple video objects, in conjunction with the MSTFP protocol. The scheme can account for packet-loss rates to minimize end-to-end distortion for media delivery.
摘要:
Apparatus and method for classifying regions of an image, based on the relative “importance” of the various areas and to adaptively use the importance information to allocate processing resources and input image formation.
摘要:
Automatic video object extraction that defines substantially precise objects is disclosed. In one embodiment, color segmentation and motion segmentation are performed on a source video. The color segmentation segments the video by substantially uniform color regions thereof. The motion segmentation segments the video by moving regions thereof. The color regions and the moving regions are then combined to define the video objects. In varying embodiments, pre-processing and post-processing is performed to further clean the source video and the video objects defined, respectively.
摘要:
Methods and arrangement are provided for compressing and transporting image-based rendering (IBR) data using alignment and three-dimensional (3D) wavelet transform techniques, and selectively decompressing portions of the resulting compressed data to render various two-dimensional (2D) views of a 3D scene. A compression engine compresses the IBR data using a 3D wavelet transform and outputs a compressed bitstream comprising encoded frequency coefficients associated with the IBR data. This compressed bitstream can then be provided to a separate decompression engine that selectively decodes portions of the compressed bitstream based on an access request for image data associated with a desired view from a rendering engine. Through a mixed cache, the progressive inverse wavelet synthesis (PIWS) engine decompresses the decoded portions with minimum calculation, and provides the decompressed IBR data to the rendering engine. The rendering engine is therefore able to render the decompressed IBR data without having to have the entire IBR bitstream decoded and decompressed at any one time.
摘要:
Apparatus and method for encoding zerotrees in a wavelet-based coding technique. The method uses a depth-first pattern for traversing the zerotree, i.e., each branch of the tree, from parent to child to grandchild and so on, is fully traversed before a next branch is traversed. The depth-first tree traversal pattern is used to quantize the coefficients of the tree as well as to assign symbols to the quantized coefficients. The method assigns one of three symbols to each node: ZEROTREE ROOT, VALUED ZEROTREE ROOT, and VALUE. By using three symbols and the efficient tree traversal pattern, the method is substantially more efficient at encoding a zerotree than the prior art. Additionally, this concept is applied to the encoding of “vector” zerotrees.
摘要:
A method and apparatus for coding motion video (images) that produces very low bit rate video representation and minimizes the amount of data needed to be transferred to the decoder. The apparatus comprises a 2-pass segmentation section, a 2-stage motion estimation/compensation section that optionally employs Affine Motion Compensation (AMC), a wavelet transform section, a Zero-Tree Wavelet Quantization (ZTQ) section, a code book, a comparator, a reconstruction section, a coder, a frame memory and an entropy coder.