摘要:
This disclosure describes video encoding techniques and video encoding devices that implement such techniques. In one embodiment, this disclosure describes a video encoding device comprising a motion estimator that computes a motion vector predictor based on motion vectors previously calculated for video blocks in proximity to a current video block to be encoded, and uses the motion vector predictor in searching for a prediction video block used to encode the current video block, and a motion compensator that generates a difference block indicative of differences between the current video block to be encoded and the prediction video block.
摘要:
Motion estimation in video compressions systems. A programmable motion estimator may be used to estimate a motion vector for a macroblock in a current frame by searching for a matching macroblock in a previous frame. A controller may be used to program the motion estimator to perform a particular search.
摘要:
The motion estimation techniques and video encoding device(s) described use a two dimensional pipeline to generate accurate motion estimation parameters for a current video block. The two dimensional pipeline uses previously calculated motion estimation parameters of relevant neighboring video blocks, including a preceding video block on a same row as the current video block, prior to the generation of the accurate motion vectors, motion vector predictors, and mode decision of the current video block. The use of the two dimensional pipeline allows accurate motion vector prediction from neighboring video blocks previously not available, in the computation of motion vectors. Three engines may be used in the two dimensional pipeline, a fetch engine, an integer search engine and a fractional and spatial search engine. While the fetch engine and fraction and spatial search engine operate on one row, the integer search engine operates on another row.
摘要:
This disclosure describes rate control techniques that can improve video coding based on a “two-pass” approach. The first pass codes a video sequence using a first set of quantization parameters (QPs) for the purpose of estimating rate-distortion characteristics of the video sequence based on the statistics of the first pass. A second set of QPs can then be defined for a second coding pass. The estimated rate-distortion characteristics of the first pass are used to select QPs for the second pass in a manner that minimizes distortion of the frames of the video sequence.
摘要:
The disclosure is directed to video processing. The various video processing techniques include generating blocks of information for a frame of video, allocating bits from a bit budget to each of the blocks, the number of bits being allocated to each of the blocks being a function of the information contained therein, and using the bits allocated to each of the blocks to represent the information contained therein.
摘要:
This disclosure describes electronic video image stabilization techniques for imaging and video devices. The techniques involve determining motion and spatial statistics for individual macroblocks of a frame, and determining a global motion vector for the frame based on the statistics of each of the macroblocks. In one embodiment, a method of performing electronic image stabilization includes performing spatial estimation on each of a plurality of macroblocks within a frame of an image to obtain spatial statistics for each of the macroblocks, performing motion estimation on each of the plurality of macroblocks to obtain motion statistics for each of the macroblocks, integrating the spatial statistics and the motion statistics of each of the macroblocks to determine a global motion vector for the frame, and offsetting the image with respect to a reference window according to the global motion vector.
摘要:
This disclosure describes rate control techniques that can improve video coding based on a “two-pass” approach. The first pass codes a video sequence using a first set of quantization parameters (QPs) for the purpose of estimating rate-distortion characteristics of the video sequence based on the statistics of the first pass. A second set of QPs can then be defined for a second coding pass. The estimated rate-distortion characteristics of the first pass are used to select QPs for the second pass in a manner that minimizes distortion of the frames of the video sequence.
摘要:
This disclosure describes rate control techniques that can improve video encoding. In particular, the described rate control techniques exploit relationships between the number of bits encoded per frame and the number of non-zero coefficients of the video blocks after quantization. The number of number of non-zero coefficients of the video blocks after quantization is referred to as rho (ρ). The value of ρ is generally proportional to the number of bits used in the video encoding. This disclosure utilizes a relationship between ρ and a quantization parameter (QP) in order to achieve rate controlled video encoding. More specifically, this disclosure provides techniques for generating a lookup table (LUT) that maps values of ρ to different QPs.
摘要:
A method and system for speech recognition combines different types of engines in order to recognize user-defined digits and control words, predefined digits and control words, and nametags. Speaker-independent engines are combined with speaker-dependent engines. A Hidden Markov Model (HMM) engine is combined with Dynamic Time Warping (DTW) engines.
摘要:
Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.