摘要:
A device converts three dimensional [3D] image data arranged for a source spatial viewing configuration to a 3D display signal (56) for a 3D display in a target spatial viewing configuration. 3D display metadata has target width data indicative of a target width W t of the 3D display in the target spatial viewing configuration. A processor (52,18) changes the mutual horizontal position of images L and R by an offset O to compensate differences between the source spatial viewing configuration and the target spatial viewing configuration. The processor (52) retrieves source offset data provided for the 3D image data for calculating the offset O, and determines the offset O in dependence of the source offset data. Advantageously the 3D perception for the viewer is automatically adapted based on the source offset data as retrieved to be substantially equal irrespective of the screen size.
摘要:
A 3-D picture is provided by providing a pair of pictures that includes a first picture for one eye of a viewer, and a second picture for the other eye of the viewer. In addition, a depth map specifically dedicated to the first picture is provided. The depth map includes depth indication values. A depth indication value relates to a particular portion of the first picture and indicates a distance between an object at least partially represented by that portion of the first picture and the viewer. The 3-D picture is supplemented with rendering guidance data that specifies respective parameters for respective rendering contexts. These respective parameters relate to generating a shifted viewpoint picture from the first picture and the depth map.
摘要:
A 3-D picture is provided as follows. A pair of pictures (LP, RP) is provided that comprises a first picture (LP) being intended for one eye of a viewer, and a second picture (RP) being intended for the other eye of the viewer. In addition, a depth map (DM) specifically dedicated to the first picture (LP) is provided. The depth map (DM) comprises depth indication values. A depth indication value relates to a particular portion of the first picture (LP) and indicates a distance between an object at least partially represented by that portion of the first picture and the viewer. Such a 3-D picture allows a satisfactory 3-D visual rendering on a great variety of display devices. Preferably, the 3-D picture is supplemented with rendering guidance data (GD) that specifies respective parameters for respective rendering contexts. These respective parameters preferably relate to generating a shifted viewpoint picture from the first picture (LP) and the depth map (DM).
摘要:
A method of encoding a video data signal (15) is provided, the method comprising providing at least a first image (21) of a scene (100) as seen from a first viewpoint, providing rendering information (22) for enabling the generation of at least one rendered image of the scene (100) as seen from a rendering viewpoint, providing a preferred direction indicator (23), defining a preferred orientation of the rendering viewpoint relative to the first viewpoint, and generating (24) the video data signal (15) comprising encoded data representing the first image, the rendering information and the preferred direction indicator.
摘要:
A method and apparatus is disclosed for enhancing the efficiency of spatial scalable compression schemes by lowering the bitrate of the enhancement layer. The complete image is used during motion estimation and/or motion compensation in the enhancement layer by inserting both the reconstructed video stream from the base layer and the input video stream into the motion estimation unit or both the motion estimation unit and the motion compensation unit in the enhancement layer.
摘要:
The concept of B-frames gives the MPEG video compression standard its high encoding efficiency. However, B-frame encoding roughly doubles the complexity of an MPEG encoder. In view thereof, MPEG encoders have been developed which produce I-frames and P-frames only. They are less complex but also less efficient. To improve the efficiency of such “IPP encoders”, selected P-frames are quantized more coarsely than other P-frames, for example, by multiplying the conventional quantization step size by 1.4. Although this results in isolated frames (“virtual B-frames”) being encoded with a lower quality, the overall perceptual quality is not affected. It has been found that the gain in bit rate obtained by the coarser quantization is not lost in subsequent P-frames, even though the subsequent frames are encoded with reference to the lower quality frames.
摘要:
A method of encoding a video data signal (15) is provided, together with a method for decoding. The encoding comprises providing color information (51) for pixels in an image, providing a depth map with depth information (52) for the pixels, providing transition information (56, 57, 60, 70, 71) being representative of a width (63, 73) of a transition region (61, 72) in the image, the transition region (61, 72) comprising a depth transition (62) and blended pixels in which colors of a foreground object and a background object are blended, and generating (24) the video data signal (15) comprising encoded data representing the color information (51), the depth map (52) and the transition information (56, 57, 60, 70, 71). The decoding comprises using the transition information (56, 57, 60, 70, 71) for determining the width (63, 73) of the transition regions (61, 72) and for determining alpha values (53) for pixels inside the transition regions (61, 72). The determined alpha values (53) are used for determining the color of a blended pixel at the transition of a foreground object and a background object.
摘要:
In a method for encoding and an encoder for a 3D video signal, centre view frames, a depth map for centre view frames and an occlusion data frame are encoded. On the basis of the depth map for the centre view frame a distinction is made between functional and non-functional data in an occlusion data frame. This allows a strong reduction in bits needed for the encoded occlusion data frame. In the decoder a combined data stream is made of functional data in the encoded occlusion data frames and the centre view frames. Preferably the centre view frames are used as reference frames in encoding the occlusion data frames.
摘要:
The invention is a method of modifying data for obtaining a scalable video signal composed of a base video signal and a set of enhancement video signals from a non-scalable video signal. The method includes a set of attenuation steps applied to coefficients composing the non-scalable video signal, the attenuation steps being assembled in cascaded or in series for delivering the base video signal. The method further includes a re-encoding step associated with each one of said attenuation steps for delivering one of said enhancement video signals, from the coding error generated in each attenuation step.
摘要:
A method of encoding a video data signal (15) is provided, the method comprising providing at least a first image (21) of a scene (100) as seen from a first viewpoint, providing rendering information (22) for enabling the generation of at least one rendered image of the scene (100) as seen from a rendering viewpoint, providing a preferred direction indicator (23), defining a preferred orientation of the rendering viewpoint relative to the first viewpoint, and generating (24) the video data signal (15) comprising encoded data representing the first image, the rendering information and the preferred direction indicator.