摘要:
A video coding/decoding system builds implied reference frames from a plurality of reference frames developed during coding. Coded data of reference pictures are decoded and stored in a reference picture cache. An implied reference frame may be derived from a plurality of reference frames and may be stored in the reference picture cache. Thereafter, coding of new input data may proceed using the implied reference frame as a source of prediction. The method may be used to identify visual elements such as background elements that may persist in video over a longer period that can be captured by the system under conventional reference frames assembly and eviction. Implied reference frames may be built at both an encoder and a decoder to serve as sources of prediction.
摘要:
A video coding/decoding system builds implied reference frames from a plurality of reference frames developed during coding. Coded data of reference pictures are decoded and stored in a reference picture cache. An implied reference frame may be derived from a plurality of reference frames and may be stored in the reference picture cache. Thereafter, coding of new input data may proceed using the implied reference frame as a source of prediction. The method may be used to identify visual elements such as background elements that may persist in video over a longer period that can be captured by the system under conventional reference frames assembly and eviction. Implied reference frames may be built at both an encoder and a decoder to serve as sources of prediction.
摘要:
Video coders may perform perspective transformation of reference frames during coding in a manner that conserves processing resources. When a new input frame is available for coding, a camera position for the input frame may be estimated. A video coder may search for reference pictures having similar camera positions as the position of the input frame and, for each reference picture identified, the video coder may perform a prediction search to identify a reference picture that is the best prediction match for the input frame. Once the video coder identifies a reference picture to serve as a prediction source for the input frame, the video coder may derive a transform to match the reference frame data to the input frame data and may transform the reference picture accordingly. The video coder may code the input frame using the transformed reference picture as a prediction reference and may transmit coded frame data and the camera position of the input frame to a decoder. Thus, the video coder may perform derivation and execution of transforms on a limited basis which conserves system resources.
摘要:
Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
摘要:
Embodiments of the present invention provide techniques for coding video data efficiently based on detection of objects within video sequences. A video coder may perform object detection on the frame and when an object is detected, develop statistics of an area of the frame in which the object is located. The video coder may compare pixels adjacent to the object location to the object's statistics and may define an object region to include pixel blocks corresponding to the object's location and pixel blocks corresponding to adjacent pixels having similar statistics as the detected object. The coder may code the video frame according to a block-based compression algorithm wherein pixel blocks of the object region are coded according to coding parameters generating relatively high quality coding and pixel blocks outside the object region are coded according to coding parameters generating relatively lower quality coding.
摘要:
Embodiments of the present invention provide a video encoding system in which a video coding engine establishes coding quality metrics that govern its own operation as well as the operation of a camera and/or a pre-processor. An imaging system may include an image acquisition system, a pre-processor and a coding engine. The coding engine may output a quality indicator identifying, for each portion of a video sequence currently being coded, a relatively level of coding quality that is being achieved. The imaging system further may include an image acquisition controller and a pre-processor controller that impose respective operating parameters upon the image acquisition system and the pre-processor in response to these quality indicators. In this manner, overall performance of the imaging system may be improved.
摘要:
A system and method for coding video data wherein a pixel block may be coded for refresh with reference to an LTR frame that was successfully transmitted, or has a high probability of having been successfully transmitted from the encoder to the decoder. Not all pixel blocks in the frame may be refreshed at the same rate. Pixel blocks containing edge details, containing a significant object, or containing foreground image data may be refreshed more often than pixel blocks containing smooth, background, or relatively less significant image data.
摘要:
A coding technique is disclosed in which frames of a video sequence are assigned to one of a plurality of sub-channels to be transmitted to a decoder. The frames are coded according to predictive coding techniques such that ordinarily prediction references of the frames in each sub-channel only reach the reference frames that occur within the same sub-channel. Thus, if transmission errors arise with respect to one sub-channel, decoding may occur for another sub-channel until the transmission error is detected and corrected.
摘要:
A coding technique is disclosed in which frames of a video sequence are assigned to one of a plurality of sub-channels to be transmitted to a decoder. The frames are coded according to predictive coding techniques such that ordinarily prediction references of the frames in each sub-channel only reach the reference frames that occur within the same sub-channel. Thus, if transmission errors arise with respect to one sub-channel, decoding may occur for another sub-channel until the transmission error is detected and corrected.
摘要:
Some embodiments provide a method for allowing a first device that is in a video conference with a second mobile device to remotely control the second mobile device. The method sends images captured by a camera of the first device to the second device. The method receives images captured by a camera of the second device. The method sends a command through a communication channel of a real-time communication session to the second device. The command is for instructing the second device to perform an operation that modifies the images captured by the camera of the second device.