摘要:
A method and apparatus for scalable video coding are disclosed, wherein the video data is configured into a Base Layer (BL) and an Enhancement Layer (EL) and wherein the EL has higher spatial resolution or better video quality than the BL. According to embodiments of the present invention, information from the base layer is exploited for coding the enhancement layer. The information coding for the enhancement layer includes CU structure, motion information, motion information, MVP/merge candidates, intra prediction mode, residual quadtree information, texture information, residual information, context adaptive entropy coding, Adaptive Loop Filter (ALF), Sample Adaptive Offset (SAO), and deblocking filter.
摘要:
The invention is related to prediction of a lost or damaged block of an enhanced spatial layer frame. A method for predicting a lost or damaged block of an enhanced spatial layer frame (E5) comprises the steps of determining an adjuvant frame (E3) in the enhanced spatial layer (EL) by help of reference information from a lower spatial layer frame (B5) corresponding said enhanced spatial layer frame (E5), generating an information reduced block by help of said adjuvant frame (E3) and predicting the lost or damaged block by help of the information reduced block. The reference information of the corresponding lower spatial layer frame can be decoded independently from any lower spatial layer reference frame and the adjuvant enhanced spatial layer frame is already decoded. Thus, the computational effort is reduced. By generation of the information reduced block the prediction is made smoother which makes it less vivid and therefore less salient to a user.
摘要:
A video decoder, a video decoding method, a video encoder and a video encoding method are disclosed. A video decoder for decoding a video bitstream for an image block includes a motion vector resolution reducer and a motion compensator. The motion vector resolution reducer is for receiving decoded high resolution motion vectors included in the video bitstream and for reducing an accuracy of the high resolution motion vectors to correspond to a low resolution. The motion compensator, in signal communication with the motion vector resolution reducer, is for forming a motion compensated high resolution prediction using the reduced accuracy motion vectors. The video encoder for encoding scalable video comprises a motion compensator for forming a motion compensated full resolution prediction and combining the motion compensated full resolution prediction from an image block to form a prediction residual. The prediction residual is downsampled to form a low resolution downsampled prediction residual and then coded.
摘要:
A method of inter-layer motion vector scaling using an inter-layer MV scaling factor to reduce computational complexity is disclosed. In one embodiment, image size information regarding the EL picture and the BL picture of the video sequence is received. An inter-layer motion vector (MV) scaling factor is determined based on the image size information. Scaled MVs are determined based on the BL MVs and the inter-layer MV scaling factor. The scaled MVs are then provided for inter-layer coding of the EL picture. In another embodiment, an inter-layer position-mapping scaling factor is determined based on the image size information regarding the EL picture and the BL picture. BL mapping positions corresponding to EL pixel positions are determined based on the EL pixel positions and the inter-layer position-mapping scaling factor. The up-sampled BL picture at the BL mapping positions is then provided for inter-layer coding of the EL picture.
摘要:
There is provided methods, apparatuses and computer program products for video coding and decoding. A first part of a first coded video representation is decoded, and information on decoding a second coded video representation is received and parsed. The coded second representation differs from the first coded video representation in chroma format, sample bit depth, color gamut and/or spatial resolution, and the information indicates if the second coded video representation may be decoded using processed decoded pictures of the first coded video representation as reference pictures. If the information indicates that the second coded video representation may be decoded using processed decoded pictures of the first coded video representation as a prediction reference, decoded picture(s) of the first part is/are processed into processed decoded picture(s) by resampling and/or sample value scaling; and decoding a second part of a second video representation using said processed decoded picture(s) as reference pictures.
摘要:
Approaches to robust encoding and decoding of escape-coded pixels in a palette mode are described. For example, sample values of escape-coded pixels in palette mode are encoded/decoded using a binarization process that depends on a constant value of quantization parameter (“QP”) for the sample values. Or, as another example, sample values of escape-coded pixels in palette mode are encoded/decoded using a binarization process that depends on sample depth for the sample values. Or, as still another example, sample values of escape-coded pixels in palette mode are encoded/decoding using a binarization process that depends on some other fixed rule. In example implementations, these approaches avoid dependencies on unit-level QP values when parsing the sample values of escape-coded pixels, which can make encoding/decoding more robust to data loss.
摘要:
Ways to mitigate loss in inter-operability scenarios for digital video are presented. For example, a bitstream modification tool (such as a bitstream rewriter running on a network node of a videoconferencing system) receives an incoming bitstream of encoded video (e.g., from an encoder that uses a first loss recovery strategy). The bitstream modification tool processes the incoming bitstream of encoded video to produce an outgoing bitstream of encoded video. In doing so, the bitstream modification tool changes at least one syntax element between the incoming bitstream and the outgoing bitstream so as to mitigate picture loss effects during decoding of the outgoing bitstream under lossy delivery conditions. The bitstream modification tool outputs the outgoing bitstream. In this way, the bitstream modification tool can help avoid blank screens, frozen screens, or other failures during decoding under lossy delivery conditions (e.g., with a decoder that uses a different loss recovery strategy).
摘要:
3D video coding/decoding supporting inter-component prediction is made operatable with respect to a broader range of settings with respect to accompanying individual views with depth and deactivation/enablement of inter-component prediction, by using one or more flags in the data stream to control the enablement/disablement of the availability of inter-component prediction for coding units of a current picture.
摘要:
A method for processing image and video data, comprising providing high dynamic range image data (HDR) and a corresponding low dynamic range image data (LDR). The LDR image is encoded in a legacy code stream, while a residual code stream is generated and encoded containing data necessary for the reconstruction of the HDR image in a decoding phase. To generate the residual code stream, LDR image data are processed so as to obtain a processed LDR* image, then a component data of the HDR image and a corresponding component of the processed LDR* image are selected and residual data of the selected component data are calculated by dividing the component of the processed LDR* image data by the component of the HDR image data. The residual data of the selected component are scaled into the range of values comprised between 0 and 1 and are then encoded in a residual code stream. A decoding method is also disclosed which allows reconstruction of an HDR image from a legacy code stream and a residual data code stream.