Abstract:
The present disclosure describes techniques for coding video data in a manner that provides consistency to portions of the video that have similar content. According to such techniques, a video sequence may be parsed into partitions and content of the partitions may be analyzed. Partitions may be grouped together based on detected similarities in content. Coding parameters may be selected for each partition based on the partition's membership in the groups. Thus, when the video sequence is coded, coding parameters for frames of two commonly-grouped partitions may be similar, which causes coded video data to have similar presentation.
Abstract:
A system comprises an encoder configured to compress attribute information for a point cloud and/or a decoder configured to decompress compressed attribute information for the point cloud. Attribute values for at least one starting point are included in a compressed attribute information file and attribute correction values used to correct predicted attribute values are included in the compressed attribute information file. Attribute values are predicted based, at least in part, on attribute values of neighboring points and distances between a particular point for whom an attribute value is being predicted and the neighboring points. The predicted attribute values are compared to attribute values of a point cloud prior to compression to determine attribute correction values. A decoder follows a similar prediction process as an encoder and corrects predicted values using attribute correction values included in a compressed attribute information file.
Abstract:
Methods of in-loop deblocking filter for high dynamic range (HDR) video compression are disclosed. HDR processing and standard dynamic range (SDR) processing adopt different electro-optical transfer function (EOTF) to convert digital code words to linear luminance. For compressing HDR video, EOTF is proposed to be involved in the selection of two deblocking parameters, β and tC, which control the strength of deblocking filter. In local activity checking for filter decisions, the calculated local signal characteristics and the thresholds are adjusted according to EOTF. After deblocking filter, the clipping range is modified based on EOTF. The chroma deblocking filter is also extended to inter-inter block boundary.
Abstract:
Methods of sample adaptive offset (SAO) for high dynamic range (HDR) video compression are disclosed. For band offset (BO), the sample range of each component is unequally divided into a number of bands based on the distribution of samples, the electro-optical transfer function (EOTF) of HDR source and one perceptually uniform EOTF. Each component may have a different number of bands, different number of signaled bands, and different range value for signaled offsets in BO mode. For edge offset (EO), Cb and Cr may use separate syntax elements. In the sample classification for EO, the difference between the current sample and its neighbor is compared with a threshold to determine whether it is a local valley, a local peak, or corners. A distortion measure for HDR source images are also disclosed.
Abstract:
Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.
Abstract:
Embodiments of the present disclosure provide techniques for environment aware supervised HDR tone mapping. According to those techniques, responsive to metadata provided with HDR source video data, the received metadata may be compared to sensor date representing viewing conditions at a display device. Tone mapping corrections to the HDR source video data may be derived from the comparison of the received metadata to the sensor data. The HDR source video data may be altered based on the tone mapping corrections. The altered HDR source video data may be used to drive the display device.
Abstract:
Methods and systems provide video compression to reduce a “popping” effect caused by differences in quality between a refresh frame (e.g., I-frame or IDR-frame) and neighboring frame(s). In an embodiment, a quality of a pre-definable number of frames preceding a refresh frame (“preceding frames”) may be increased. A quantization parameter may be decreased for a region within at least one of the preceding frame(s). In an embodiment, preceding frames may be coded according to an open group of pictures (“GOP”) structure. In an embodiment, a refresh frame may be first coded as a P-frame, then coded based on the coded P-frame. In an embodiment, preceding frames may be first coded based on the refresh frame, then coded based on the coded frames without referencing the refresh frame. Each of these methods may increase consistency of quality in a sequence of frames, and, correspondingly, minimize or remove I-frame popping.
Abstract:
A method and system for caching and streaming media content, including predictively delivering and/or acquiring content is provided. In the system, client devices may be communicatively coupled in a network, and may access and share cached content. Video segments making up a media stream may be selectively delivered to the clients such that a complete media stream may be formed from the different segments delivered to the different clients. Video segments may be pushed by the server to the client or requested by the client according to a prioritization scheme, including downloading: partial items on a client's subscription log, lower quality version(s) of content before higher quality version(s), higher bitrate segments before lower bitrate segments, summaries of full-length content, advertisements and splash screens common to multiple video clips.
Abstract:
A system and method is presented to mask artifacts with content-adaptive comfort noise. Encoder side analysis may determine initial comfort noise characteristics. Noise parameters may then be developed for each frame or sequence of frames that define comfort noise patches that mask the artifacts. At the decoder, a comfort noise patch can be fetched from memory or created based on the amplitude and spatial characteristics of the comfort noise specified in the noise parameters. The noise patch may additionally be scaled or otherwise adjusted to accommodate the capabilities and/or limitations of the specific decoder.
Abstract:
A system comprises an encoder configured to compress attribute information and/or spatial information for a point cloud and/or a decoder configured to decompress compressed attribute and/or spatial information for the point cloud. The encoder is configured to convert a point cloud into an image based representation. The encoder packs patch images into an image frame and fills empty spaces in the image frame with a padding. The encoder is also configured to determine quantized minimum depths and/or maximum depths patch images in the image frames, wherein depth information is signaled relative to the quantized minimum depth.