摘要:
Techniques are described for performing near visually lossless video recompression. The disclosed techniques generate video frames having relatively small bitrates and relatively small file sizes while retaining approximately a same level of visually perceivable video quality as the originally recorded video frames. In general, recompression of a video frame takes an input video frame and produces a second copy of the video frame that has the same or lower bitrate. The proposed techniques address the problem of recompressing a video frame with no perceivable loss in visual quality (i.e., visually lossless recompression) compared to the original recording of the video frame. In addition, the disclosed techniques provide one-step recompression of video frames that includes a single decoding and encoding of each video frame.
摘要:
The motion estimation techniques and video encoding device(s) described use a two dimensional pipeline to generate accurate motion estimation parameters for a current video block. The two dimensional pipeline uses previously calculated motion estimation parameters of relevant neighboring video blocks, including a preceding video block on a same row as the current video block, prior to the generation of the accurate motion vectors, motion vector predictors, and mode decision of the current video block The use of the two dimensional pipeline allows accurate motion vector prediction from neighboring video blocks previously not available, in the computation of motion vectors. Three engines may be used in the two dimensional pipeline, a fetch engine, an integer search engine and a fractional and spatial search engine. While the fetch engine and fraction and spatial search engine operate on one row, the integer search engine operates on another row.
摘要:
The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.
摘要:
An embodiment is directed to a method for selecting a predictive macroblock partition from a plurality of candidate macroblock partitions in motion estimation and compensation in a video encoder including determining a bit rate signal for each of the candidate macroblock partitions, generating a distortion signal for each of the candidate macroblock partitions, calculating a cost for each of the candidate macroblock partitions based on respective bit rate and distortion signals to produce a plurality of costs, and determining a motion vector from the costs. The motion vector designates the predictive macroblock partition.
摘要:
Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.
摘要:
This disclosure describes rate control techniques that can improve video coding based on a 'two-pass' approach. The first pass codes a video sequence using a first set of quantization parameters (QPs) for the purpose of estimating rate-distortion characteristics of the video sequence based on the statistics of the first pass. A second set of QPs can then be defined for a second coding pass. The estimated rate-distortion characteristics of the first pass are used to select QPs for the second pass in a manner that minimizes distortion of the frames of the video sequence.
摘要:
A voice recognition (VR) system is disclosed that utilizes a combination of speaker independent (SI) (230 and 232) and speaker dependent (SD) (234) acoustic models. At least one SI acoustic model (230 and 232) is used in combination with at least one SD acoustic model (234) to provide a level of speech recognition performance that at least equals that of a purely SI acoustic model. The disclosed hybrid SI/SD VR system continually uses unsupervised training to update the acoustic templates in the one ore more SD acoustic models (234). The hybrid VR system then uses the updated SD acoustic models (234) in combination with the at least one SI acoustic model (230 and 232) to provide improved VR performance during VR testing.
摘要:
Motion estimation in video compressions systems. A programmable motion estimator may be used to estimate a motion vector for a macroblock in a current frame by searching for a matching macroblock in a previous frame. A controller may be used to program the motion estimator to perform a particular search.
摘要:
Techniques for intensity compensation in video processing are provided. In one configuration, a wireless communication device compliant with the VC1-SMPTE standard (e.g., cellular phone, etc.) comprises a processor that is configured to execute instructions operative to reconstruct reference frames from a received video bitstream. A non-intensity-compensated copy of a reference frame of the bitstream is stored in a memory of the device and used for defining the displayable images and for on-the-fly generation of a stream of intensity-compensated pixels to perform motion compensation calculations for frames of the video bitstream.