摘要:
Stereoscopic video data and corresponding depth map data for multi-view auto-stereoscopic displays are coded using a multiplexed asymmetric image frame that combines an image data partition and a depth map data partition, wherein the size of the image data partition is different than the size of the depth map data partition. The image data partition comprises one or more of the input views while the depth map partition comprises at least a portion of the depth map data rotated with respect to the orientation of the image data in the multiplexed output image frame.
摘要:
In one implementation, an apparatus is provided for encoding or decoding video information. The apparatus comprises a memory unit configured to store reference layer pictures associated with a reference layer, an enhancement layer, or both. The apparatus further comprises a processor operationally coupled to the memory unit. In one embodiment, the processor is configured to restrict usage of at most one reference layer pictures that has a different spatial resolution than a current picture as an inter-layer reference picture, and predict the current picture using inter-layer prediction and the inter-layer reference picture.
摘要:
An apparatus configured to code video information comprises a memory unit and a processor in communication with the memory unit. The memory unit is configured to store video information associated with a reference layer (RL) and an enhancement layer (EL). The EL comprises an EL video unit and the RL comprises an RL video unit corresponding to the EL video unit. The processor is configured to perform upsampling and bit-depth conversion on pixel information of the RL video unit in a single combined process to determine predicted pixel information of the EL video unit, and determine the EL video unit using the predicted pixel information.
摘要:
In one implementation, an apparatus is provided for encoding or decoding video information. The apparatus comprises a memory unit configured to store video information associated with a base layer and/or an enhancement layer. The apparatus further comprises a processor operationally coupled to the memory unit. In one embodiment, the processor is configured to determine a scaling factor based on spatial dimension values associated with the base and enhancement layers such that the scaling factor is constrained within a predetermined range. The processor is also configured to spatially scale an element associated with the base layer or enhancement layer using the scaling factor and a temporal motion vector scaling process.
摘要:
The present disclosure generally relates to a method and device for encoding a frame, comprising a processor configured for: - encoding (12) a backlight frame determined (11) from the frame to be encoded; - obtaining (13) at least one component of a residual frame by dividing each component of the frame by a decoded version of the backlight frame; - encoding (19) the residual frame; the method is characterized in that the processor is further configured in order that the backlight frame is represented by a weighted linear combination of at least one set of 2D shape functions, the 2D shape functions of each set forming a partition of unity of a domain defined by the union of their supports. The disclosure relates also to a method and device for decoding a bitstream representing a residual frame calculated by dividing a frame by a backlight frame.
摘要:
Scalable coding concepts are described. One aspect improves parallel decoding of inter-dependent layers of a multi-layer video data stream by introducing a long-term syntax element structure for guaranteeing that during a predetermined time period the pictures of the dependent layer are subdivided so that borders of the spatial segments of the pictures of the second layer and the spatial segments of the first layer overlay. Another aspect concerns upsampling from base layer to enhancement layer. Another aspect introduces a long-term syntax element structure allowing the decoder to determine the inter-layer offset for a predetermined time period. Another aspect introduces a type indicator field changing a way a layer indicator field within the NAL unit headers is to be interpreted. Another aspect allows different codecs/standards to be used for the different layers. Another aspect concerns a syntax element structure which indicates the inter-layer offset in units of the base layer blocks.
摘要:
A frame sequence of moving picture data is divided into a tile image sequence 250, and the color space of the tile image sequence 250 is converted to generate a YCbCr image sequence 252 (S10). Each frame is reduced to 1/2 time in the vertical and horizontal directions (S12), and a compression process is carried out to generate compression data 260 of a reference image (S14). The compression data 260 of the reference image is decoded and decompressed similarly as upon image display to restore a YCbCr image as the reference image, and a difference image sequence 262 is generated from the reference image and the original YCbCr image 252 (S16). Then, compression data 266 of a difference image is generated (S18), and compression data 268 obtained by connecting the compression data 260 of the reference image and the compression data 266 of the difference image is generated for every four frames of a tile image (S20).
摘要:
An objective of the present invention is to make a video image frame into a hierarchical structure represented with a plurality of resolutions. With a zeroth layer (30), a first layer (32), a second layer (34), and a third layer (36) in order by increasing resolution, layer data which denotes a frame at a time (t1) treats the zeroth layer (30) and the second layer (34) as base image layers, and the first layer (32) and third layer (36) as difference image layers. In such a circumstance, when a region (124a) is displayed with the resolution of the third layer (36), an image of a corresponding region (126a) which the second layer (34) retains is enlarged to the resolution of the third layer (36) and each pixel value added to a difference image of a region (124a) which the third layer (36) retains. The layer which is treated as the difference image layer is switched with the passage of time (t2, t3, ...).
摘要:
The present invention generally relates to a method and device for encoding an image. The method is characterized in that it comprises: - obtaining (100) a luminance component of the image, said luminance component belonging to a perceptual space having a metric whose values are a function of the differences between the visual perceptions of two points of said perceptual space, and a perceptual threshold (Δ E ) defining an upper bound of the said metric and enabling a control of the visual losses in a displayed decoded version of the image; - obtaining (101) a low-spatial-frequency version ( L lf ) of the luminance component of the image; - encoding (108) the low-spatial-frequency version ( L lf ) of the luminance component of the image; - obtaining (102) a first differential image ( Diff 1 ) which comprises a differential luminance component ( L r ) which is obtained by calculating the difference between the luminance component (L) of the image and a decoded version L l f ^ of the low-spatial-frequency version ( L lf ) of the luminance component of the image; and - encoding (105) the first differential image ( Diff 1 ); - the precision of the encoding of the first differential image ( Diff 1 ) depending on the perceptual threshold (Δ E ). The invention relates also to a method and apparatus for decoding a bitstream representing an image.