Abstract:
This invention is a method of scalable video encoding and decoding. The scalable video encoding codes both a base layer and an enhanced layer having greater resolution and/or refresh rate. Upon decoding some enhanced layer pictures may be dropped to reach a best resolution and refresh rate within a target data rate. Upon encoding a key picture in at least one group of pictures forming the video is a combined base layer/enhanced layer key picture. Such a combined base layer/enhanced layer key picture cannot be dropped on decoding. This technique reduces drift in the decoder.
Abstract:
A method of assigning a priority for controlling a bit rate of a bitstream having a plurality of quality layers is provided. The method includes composing first quality layers for a reference picture, composing second quality layers for a current picture that is encoded with reference to the reference picture, and assigning a priority each of the first and second quality layers, wherein a low priority is assigned to a quality layer having a small influence on a video quality reduction of the current picture when the quality layer is truncated.
Abstract:
A method and apparatus for audio bass enhancement is provided. The method includes band-pass filtering an input signal; generating at least one even and odd harmonics of the band-pass filtered input signal; and synthesizing the harmonics and the band-pass filtered input signal to be output.
Abstract:
The method, system, and apparatus of linear temporal reference scheme having non-reference predictive frames is disclosed. In one embodiment, a method of a temporal reference scheme includes creating a non-reference predictive frame and a reference predictive frame from a reference frame, increasing a quantization factor of the non-reference predictive frame by a threshold value, and decreasing the quantization factor of the reference predictive frame by the threshold value. The method may include toggling between the temporal reference scheme and another temporal reference scheme in which there may be no non-reference predictive frames and only reference predictive frames based on a scenario. The method may also include calculating a target bit rate across a communication channel through a bit rate calculation algorithm that considers a bandwidth capacity across the communication channel.
Abstract:
Image frames of a video stream are encoded with the aim of reducing flicker in the video stream when displayed. In one embodiment, the quantization parameter used to quantize an image frame is capped to be not greater than a quantization parameter used to quantize an immediately previous image frame. In another embodiment, the quantization step size used to quantize a macro-block of an image frame is computed based on the value of a quantization error of a co-located macro-block in an immediately previous image frame. In yet another embodiment, macro-block transform coefficients corresponding to high-frequency components are quantized using relatively higher quantization parameter values if the image frame is deemed to contain high activity. In yet another embodiment, flicker-prone macro-blocks of a source frame are replaced by corresponding best-match macro-blocks of a previous reconstructed frame, prior to being encoded.
Abstract:
A video system includes an encoder for generating a compressed bit stream in response to a received video signal. The encoder includes a mode decision processor that is arranged to determine whether the mode of a first pixel block in a first row is an “intra-mode” or an “inter-mode.” The encoder also includes a mode estimation processor that is arranged to estimate the mode of a left pixel block in a second row that is received after the first row in response to the determined mode of the first pixel block in the first row. The encoder also includes a pixel block processor that is arranged to process a pixel block in the second row that is to the right of the left pixel block in response to the estimated mode of the left pixel block.
Abstract:
A video encoder receives a macro-block of an image frame, and determines whether the macro-block contains text. The video encoder computes a quantization parameter for quantizing the macro-block, with the quantization parameter computed to be smaller if the macro-block is determined to contain text. The video encoder encodes the macro-block using the quantization parameter. Text quality in the encoded macro-block is preserved.
Abstract:
A video system includes an encoder for generating a compressed bit stream in response to a received video signal. The encoder includes a mode decision processor that is arranged to determine whether the mode of a first pixel block in a first row is an “intra-mode” or an “inter-mode.” The encoder also includes a mode estimation processor that is arranged to estimate the mode of a left pixel block in a second row that is received after the first row in response to the determined mode of the first pixel block in the first row. The encoder also includes a pixel block processor that is arranged to process a pixel block in the second row that is to the right of the left pixel block in response to the estimated mode of the left pixel block.
Abstract:
Image frames of a video stream are encoded with the aim of reducing flicker in the video stream when displayed. In one embodiment, the quantization parameter used to quantize an image frame is capped to be not greater than a quantization parameter used to quantize an immediately previous image frame. In another embodiment, the quantization step size used to quantize a macro-block of an image frame is computed based on the value of a quantization error of a co-located macro-block in an immediately previous image frame. In yet another embodiment, macro-block transform coefficients corresponding to high-frequency components are quantized using relatively higher quantization parameter values if the image frame is deemed to contain high activity. In yet another embodiment, flicker-prone macro-blocks of a source frame are replaced by corresponding best-match macro-blocks of a previous reconstructed frame, prior to being encoded.
Abstract:
A client for playing a video stream and a method thereof adaptively control the video play in accordance with variable communication environments and the size of the video stream to be transmitted. The client includes a demultiplexer for generating elementary stream (ES) packets by demultiplexing a video stream; a FIFO buffer for temporarily storing the ES packets; a play timing modification unit for reducing a gap of a play time caused by a burst error by modifying time stamps of the ES packets that exist in the FIFO buffer if it is judged that the burst error occurs; and a video decoder for generating an output video by decoding the ES packets of which the time stamps are modified. The client can relatively improve the quality of the video streaming in multimedia communications through a wireless or wire network.