Abstract:
A video streaming method for transitioning between multiple sequences of coded video data may include receiving and decoding transmission units from a first sequence of coded video data. In response to a request to transition to a second sequence of coded video data, the method may determine whether a time to transition to the second sequence of coded video data can be reduced by transitioning to the second sequence of coded video data via an intermediate sequence of coded video data. If the time can be reduced, the method may include receiving at least one transmission unit from an intermediate sequence of coded video data that corresponds to the request to transition, decoding the transmission unit from the intermediate sequence, and transitioning from the first sequence to the second sequence via the decoded transmission unit from the intermediate sequence.
Abstract:
Video coding techniques are disclosed for resource-limited destination display devices. Input video data may be coded by converting a first representation of the input video to a resolution of a destination display and base layer coding the converted representation. Additionally, a region of interest may be predicted from within the input video. The predicted ROI may be converted to a resolution of the destination display, and the converted ROI may be enhancement layer coded. The base layer coded data and the enhancement layer data may be transmitted to the destination display where the coded base layer data is decoded and displayed until a zoom event occurs. When a zoom event occurs, both the coded base layer data and the coded enhancement layer data may be decoded and displayed. Thus, the switchover from a first field of view to an ROI view may be performed quickly.
Abstract:
Image and video processing techniques are disclosed for processing components of a color space individually by determining limits for each component based on the relationship between each component in a color space. These limits may then be used to clip each component such that the component values are within the determined range for that component. In this manner, more efficient processing of images and/or video may be achieved.
Abstract:
Video encoders often produce banding artifacts on areas with smooth gradients and low levels of detail/noise. In this disclosure, a video encoding system identifies the banded areas and adjusts coding parameters accordingly. The video coder may include a pre-coding banding detector and a post-coding banding detector. The pre-coding detector may identify regions in the input picture with smooth gradients that are likely to have banding artifacts after encoding. The post-coding detector may identify regions in the reconstructed picture with visible banding. Usage of pre-coding detector and/or post-coding detector depends on how an encoder operates. In a single-pass encoding or during the first pass of a multi-pass encoding, the pre-coding detection maps are used. During picture re-encoding or during later passes of a multi-pass encoding, the post-coding detector maps are used.
Abstract:
A method of adaptive chroma downsampling is presented. The method comprises converting a source image to a converted image in an output color format, applying a plurality of downsample filters to the converted image and estimating a distortion for each filter chose the filter that produces the minimum distortion. The distortion estimation includes applying an upsample filter, and a pixel is output based on the chosen filter. Methods for closed loop conversions are also presented.
Abstract:
In a video coding system, a common video sequence is coded multiple times to yield respective instances of coded video data. Each instance may be coded according to a set coding parameters derived from a target bit rate of a respective tier of service. Each tier may be coded according to a constraint that limits a maximum coding rate of the tier to be less than a target bit rate of another predetermined tier of service. Having been coded according to the constraint facilitates dynamic switching among tiers by a requesting client device processing resources or communication bandwidth changes. Improved coding systems to switch among different coding streams may increase quality of video streamed while minimizing transmission and storage size of such content.
Abstract:
A video coding/decoding system codes data efficiently even when input video data exhibits changes in dynamic range. The system may map pixel values of the first frame from a dynamic range specific to the input image data to a second dynamic range that applies universally to a plurality of frames that have different dynamic ranges defined for them. The system may code the mapped pixel values to reduce bandwidth of the mapped frame data, and thereafter transmit the coded image data to a channel.
Abstract:
Embodiments of the present invention provide techniques for efficiently coding/decoding video data during circumstances when constraints are imposed on the video data. A frame from a video sequence may be marked as a delayed decoder refresh frame. Frames successive to the delayed decoder refresh frame in coding order may predictively coded without reference to frames preceding the delayed decoder refresh frame in coding order. The distance between the delayed decoder refresh frame and the successive frames may exceed a distance threshold. Frames successive to a current frame in decoding order may be decoded without reference to frames preceding the current frame in decoding order. The distance between the current frame and the successive frames may exceed a distance threshold.
Abstract:
Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
Abstract:
Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.