摘要:
Embodiments of the present invention provide a video encoding system that codes video sequence into a multi-level hierarchy based on levels of long term reference (LTR) frames. According to the present invention, an encoder designates a reference frame as a long term reference (LTR) frame and transmits the LTR frame to a receiver. Upon receiving feedback from the receiver acknowledging receipt of the LTR frame, the encoder periodically codes subsequent frames as reference frames using the acknowledged LTR frame as a reference and designates subsequent reference frames as secondary LTR frames. A determined number of frames after each secondary LTR frame may be coded using a preceding secondary LTR frame as a reference.
摘要:
Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
摘要:
Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
摘要:
Techniques for adjusting exposure parameters of a camera such that video data captured by the camera may be coded efficiently. A camera with auto exposure control may capture and output frames of video. A pre-processor may estimate brightness of the frames of the video output from the camera. A controller may estimate a rate of brightness change among the frames, and when the rate of change is lower than a predetermined threshold, the controller may reduce sensitivity of the auto exposure control. A coding engine may predictively code the video.
摘要:
Video coders may perform perspective transformation of reference frames during coding in a manner that conserves processing resources. When a new input frame is available for coding, a camera position for the input frame may be estimated. A video coder may search for reference pictures having similar camera positions as the position of the input frame and, for each reference picture identified, the video coder may perform a prediction search to identify a reference picture that is the best prediction match for the input frame. Once the video coder identifies a reference picture to serve as a prediction source for the input frame, the video coder may derive a transform to match the reference frame data to the input frame data and may transform the reference picture accordingly. The video coder may code the input frame using the transformed reference picture as a prediction reference and may transmit coded frame data and the camera position of the input frame to a decoder. Thus, the video coder may perform derivation and execution of transforms on a limited basis which conserves system resources.
摘要:
Embodiments of the present invention provide techniques for efficiently coding/decoding video data during circumstances where no single coding mode is appropriate. A coder may predict content of an input pixel block according to a prediction technique for intra-coding and obtain a first predicted pixel block therefrom. The coder may predict content of the input pixel block according to a prediction technique for inter-coding and obtain a second predicted pixel block therefrom. The coder may average the first and second predicted pixel blocks by weighted averaging. The weight of the first predicted pixel block may be inversely proportional to the weight of the second predicted pixel block coding. The coder may predictively code the input pixel block based on a third predicted pixel block obtained by the averaging.
摘要:
Techniques for adjusting exposure parameters of a camera such that video data captured by the camera may be coded efficiently. A camera with auto exposure control may capture and output frames of video. A pre-processor may estimate brightness of the frames of the video output from the camera. A controller may estimate a rate of brightness change among the frames, and when the rate of change is lower than a predetermined threshold, the controller may reduce sensitivity of the auto exposure control. A coding engine may predictively code the video.
摘要:
A system an method for determining to select frames from a video sequence that have high visual appeal and can be coded at high quality when frame rates of coded video drop to such low levels that perceptual sensations of moving video are lost. A metric is derived from a candidate input frame, and such metric is used to determine whether to increase or decrease a weight accorded to the candidate input frame. In an embodiment, the metric may be the auto- exposure data associated with the candidate input frame.
摘要:
Embodiments of the present invention provide a video encoding system in which a video coding engine establishes coding quality metrics that govern its own operation as well as the operation of a camera and/or a pre-processor. An imaging system may include an image acquisition system, a pre-processor and a coding engine. The coding engine may output a quality indicator identifying, for each portion of a video sequence currently being coded, a relatively level of coding quality that is being achieved. The imaging system further may include an image acquisition controller and a pre-processor controller that impose respective operating parameters upon the image acquisition system and the pre-processor in response to these quality indicators. In this manner, overall performance of the imaging system may be improved.
摘要:
Video coding systems incorporate techniques for deriving scalars W and/or O for use in weighted prediction. W represents a scaling factor and O represents an offset value. Given a frame of input video to be coded, a prediction match may be established one or more reference frames. The input frame may be parsed into a plurality of regions. Thereafter the scaling factor W and/or offset value O may be derived by developing a system of equations relating a predicted pixel to the pixel in the frame by the scaling factor W and/or offset value O. Equations within the system may be prioritized according to priority among regions, and scaling factor W and/or offset value O may be solved for. The scaling factor W and/or offset value O may be used during weighted prediction of the input frame.