摘要:
A system for decoding a video bitstream includes receiving a frame of the video that includes at least one slice and at least one tile and where each of the at least one slice and the at least one tile are not all aligned with one another.
摘要:
Multi-layered video structures are scaled over a range of perceived quality levels. An estimated Mean Opinion Score (eMOS)-based encoder control loop is utilized to determine one or more encoder key performance index (KPI) associated with a particular perceived quality level. A KPI-based encoder control loop is then utilized to guide generation of a hierarchical structure having quality and/or temporal and/or spatial enhancement layers, without recalculating eMOS for the scalable structure. In addition, eMOS is used to guide the generation of a hierarchical structure at best-perceived quality levels for a given bitrate budget. Rate adaptation may occur by dropping segments, changing hierarchical structure, or changing the KPI target values. With the structure scaled as a function of perceived quality, perceived quality is impacted predictably as the encoding rate is adapted.
摘要:
A system for decoding a video bitstream includes receiving a frame of the video that includes at least one slice and at least one tile and where each of the at least one slice and the at least one tile are not all aligned with one another.
摘要:
There are disclosed various methods, apparatuses and computer program products for video encoding. In some embodiments pictures are encoded into a bitstream. The bitstream comprises at least two scalability layers and pictures being associated with access units. A first indication and a second indication are encoded into the bitstream. The first indication is configured to indicate an output layer. And the second indication is configured to indicate at least one alternative output layer. A first picture of said at least one alternative output layer is output by a decoding process of the bitstream when no picture of the output layer is in an access unit containing said first picture of said at least one alternative output layer.
摘要:
An apparatus configured to code video information includes a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a video layer having a picture. The processor is configured to determine whether the picture is a non-picture-order-count (POC)-anchor picture, and based on the determination of whether the picture is a non-POC-anchor picture, perform one of (1) refraining from indicating a POC reset in connection with the picture, or (2) indicating the POC reset in connection with the picture. The processor may encode or decode the video information.
摘要:
A plurality of data streams are obtained; they may be compressed, uncompressed, or a mixture of compressed and uncompressed. Statistical parameters associated with each of the data streams are determined. A plurality of storage constraints are obtained. A plurality of output bit rates are determined for encoding or transcoding, as the case may be, each of the data streams, based on the statistical parameters and the storage constraints. The output bit rates are determined to jointly reduce (and preferably minimize) an overall cost. The overall cost includes the cost associated with storing compressed versions of the data streams. For each of the data streams, the encoding or transcoding into the compressed versions, is carried out in accordance with the output bit rates.
摘要:
A method and apparatus are provided for coding an image or a sequence of images, generating a data stream including data representative of pixel groups, referred to as blocks, in one of the images. The method includes: grouping blocks in a cluster of blocks according to the proximity of their respective values corresponding to at least one block parameter to be coded; determining a value of the parameter, the value being characteristic of said group of blocks; coding blocks of the cluster, where the values of the blocks for the parameter are coded implicitly by inheritance of the characteristic value or are coded as refinements relative to the characteristic value, and coding a data structure associated with the cluster of blocks, the data structure including data associated with the characteristic value.
摘要:
An apparatus and method for encoding video data and an apparatus and method for decoding video data are provided. The encoding method includes: splitting a current picture into at least one maximum coding unit; determining a coded depth to output an encoding result by encoding at least one split region of the at least one maximum coding unit according to operating mode of coding tool, respectively, based on a relationship among a depth of at least one coding unit of the at least one maximum coding unit, a coding tool, and an operating mode, wherein the at least one split region is generated by hierarchically splitting the at least one maximum coding unit according to depths; and outputting a bitstream including encoded video data of the coded depth, information regarding a coded depth of at least one maximum coding unit, information regarding an encoding mode, and information regarding the relationship.
摘要:
A method and interactive system for the on-line transmission of a high-resolution video sequence composed of a succession of T images includes a step of selecting relevant images comprising at least the following steps: split each image to be transmitted at the instant t into a number N of zones, for each zone n determined in the previous step, calculate a value representative of the content of said zone, for each image to be transmitted, generate a vector representative of the content of said image containing the values obtained in the previous step, calculate a normalized coefficient of correlation α between the reference vector determined for a previously selected image and that calculated for the current image, make a decision on the selection (or not) of the current image as a function of the value of the normalized correlation coefficient α.
摘要:
An image decoding method according to the present invention may comprise the steps of: deriving a merge candidate from a candidate block; generating a first merge candidate list including the merge candidate; specifying any one of a plurality of merge candidates included in the first merge candidate list; deriving affine vectors of a current block on the basis of motion information of the specified merge candidate; deriving a motion vector of a sub-block in the current block on the basis of the affine vectors; and performing motion compensation on the sub-block on the basis of the motion vector.