摘要:
Systems and methods of performing rate control in scalable video encoders that can be used to provide videoconferencing, announcements, and live video streaming to multiple participant devices having diverse bandwidths, resolutions, and/or other device characteristics. The systems and methods can accommodate different target bit rates of the multiple participant devices by operating on scalable video bitstreams in a multi-layer video format, including a base layer having one or more reference video frames, and an enhancement layer having one or more disposable non-reference, predictive video frames. By adjusting the number of disposable non-reference, predictive video frames in the enhancement layer, as well as quantization parameters for the respective base and enhancement layers, the disclosed systems and methods can accommodate the different target bit rates for the respective participant devices, while enhancing the spatial and/or temporal qualities of the base and enhancement layers in the respective video bitstreams.
摘要:
Systems and methods of performing rate control in scalable video encoders that can be used to provide videoconferencing, announcements, and live video streaming to multiple participant devices having diverse bandwidths, resolutions, and/or other device characteristics. The systems and methods can accommodate different target bit rates of the multiple participant devices by operating on scalable video bitstreams in a multi-layer video format, including a base layer having one or more reference video frames, and an enhancement layer having one or more disposable non-reference, predictive video frames. By adjusting the number of disposable non-reference, predictive video frames in the enhancement layer, as well as quantization parameters for the respective base and enhancement layers, the disclosed systems and methods can accommodate the different target bit rates for the respective participant devices, while enhancing the spatial and/or temporal qualities of the base and enhancement layers in the respective video bitstreams.
摘要:
Systems and methods of performing rate control in scalable video encoders that can be used to provide videoconferencing, announcements, and live video streaming to multiple participant devices having diverse bandwidths, resolutions, and/or other device characteristics. The systems and methods can accommodate different target bit rates of the multiple participant devices by operating on scalable video bitstreams in a multi-layer video format, including a base layer having one or more reference video frames, and an enhancement layer having one or more disposable non-reference, predictive video frames. By adjusting the number of disposable non-reference, predictive video frames in the enhancement layer, as well as quantization parameters for the respective base and enhancement layers, the disclosed systems and methods can accommodate the different target bit rates for the respective participant devices, while enhancing the spatial and/or temporal qualities of the base and enhancement layers in the respective video bitstreams.
摘要:
Provided are methods and systems for rate-distortion optimized quantization and for skipping transform blocks that can take into account the rate-distortion tradeoff as well as the perceptual quality of reconstructed video.
摘要:
A rate control algorithm for a predictive video encoder is provided to accurately control the coding rate while requiring a small complexity. The algorithm assumes the video encoder to consist of two processors, pre-processor and main processor, and the rate control located in main processor uses prediction cost and residual cost for each picture that are computed in pre-processor. Prediction cost represents the estimated number of bits needed to generate an error image, and residual cost describes the difficulty of encoding the resultant error image. Accurate and simple rate control is made possible by using a new rate model that adapts to the local characteristic of a video sequence, and by targeting to maintain the quantization parameter as constant as possible in a given set of pictures, GOP, instead of the conventional two step approach.
摘要:
A technique is provided for use in a video processing system for selecting between an inter-frame coding mode and an intra-frame coding mode when processing a current portion of a video sequence. The technique includes identifying previously reconstructed samples associated with the current portion from a current frame and a reference frame of the video sequence. Then, a selection is made whether to implement the inter-frame coding technique or the intra-frame coding technique on the current portion based on comparisons associated with the previously reconstructed samples. The technique may be implemented in both a video encoder and a video decoder. As a result, the encoder does not need to transmit coding mode-related overhead information to a corresponding decoder. By way of example, the invention may be implemented in a pixel-based coding system or a block transform-based coding system.
摘要:
Systems and methods of handling packet errors associated with multimedia data received at data receivers from data senders, in which packet error feedback information is provided by the data receivers to the data senders in one or more real-time transport control protocol (RTCP) feedback packets. The data receivers can calculate, determine, or otherwise obtain costs associated with providing specific forms of packet error feedback information in the RTCP feedback packets, and further calculate, determine, or otherwise obtain wait times for handling out-of-order packets, as well as receiving retransmissions of lost packets. By handling packet errors associated with multimedia data while taking into account such costs and wait times, each of which can have an impact on system bandwidth and/or latency, the systems and methods can more reliably achieve the quality of experience (QoE) generally desired and/or required for multimedia data transmissions.
摘要:
A technique for use in a block transform-based coding system of processing (e.g., encoding and/or decoding) one or more block transform coefficients associated with at least one block of visual data (e.g., an image and/or video sequence) comprises the following steps. First, one or more previously reconstructed block transform coefficients associated with the visual data are identified. Then, a context selection value is computed for use in processing a block transform coefficient associated with the at least one block, the context selection value being based on the one or more previously reconstructed block transform coefficients. The context selection value may be computed as a function of one or more values respectively associated with one or more previously reconstructed block transform coefficients in near proximity, with respect to a scanning order, to the block transform coefficient to be processed. Further, the context selection value may be computed as a function of a spatial frequency associated with the block transform coefficient. Still further, the context selection value may be computed as a function of both the one or more values respectively associated with the one or more previously reconstructed block transform coefficients in near scanning order proximity and the spatial frequency associated with the block transform coefficient.
摘要:
Systems and methods that employ an application-assisted approach to adaptive spatio-temporal error concealment in video decoders. The systems and methods employ a video receiver that can receive real-time video frames from a video transmitter over a wired and/or wireless network. The video receiver includes an end system configurable as an application for consuming the content of the real-time video frames. The end system application can determine whether each received real-time video frame is complete or incomplete, whether each received slice is complete or incomplete, and provide such information to the video decoder for use in avoiding errors while decoding the respective real-time video frames. The video decoder can determine whether any pixel blocks included in real-time video packets are properly decoded or missing, generate statistics regarding the coding mode of the respective pixel blocks, and use such further information while performing adaptive spatio-temporal error concealment on the respective pixel blocks.
摘要:
Systems and methods of handling packet errors associated with multimedia data received at data receivers from data senders, in which packet error feedback information is provided by the data receivers to the data senders in one or more real-time transport control protocol (RTCP) feedback packets. The data receivers can calculate, determine, or otherwise obtain costs associated with providing specific forms of packet error feedback information in the RTCP feedback packets, and further calculate, determine, or otherwise obtain wait times for handling out-of-order packets, as well as receiving retransmissions of lost packets. By handling packet errors associated with multimedia data while taking into account such costs and wait times, each of which can have an impact on system bandwidth and/or latency, the systems and methods can more reliably achieve the quality of experience (QoE) generally desired and/or required for multimedia data transmissions.