Abstract:
There is disclosed methods and apparatuses for multi-view video encoding, decoding and display. A depth map is provided for each of the available views. The depth maps of the available views are used to synthesize a target view for rendering an image from the perspective of the target view based on images of the available views.
Abstract:
A method includes organizing a first media source block in the media container file; calculating forward error correction (FEC) redundancy data based on the first media source block; organizing the FEC redundancy data in at least one FEC reservoir in the media container file; providing, in the media container file, meta data providing an association between the first media source block and the at least one FEC reservoir; storing the first media source block as a first elementary item in the media container file; and providing, in the media container file, information that the first elementary item comprises the first media source block
Abstract:
There is disclosed a method for encoding at least two views of a video scene into a multiview video bitstream, where said views have different spatial resolutions. The method comprises prediction between pictures belonging to different views after resampling of one of these pictures. There is also disclosed a method for decoding a multiview video bitstream comprising at least two views having different spatial resolutions. The method comprises prediction between pictures belonging to different views after resampling of one of these pictures. There are also disclosed corresponding apparatuses and computer program products.
Abstract:
A method, apparatus, and computer program product are therefore provided for identifying a person or people in a media file by using object recognition and near-field communication to detect nearby devices that may be associated with a person or people featured in the media file. Associating a nearby device with a person or people featured in a media file may add to the confidence level with which a person is identified within a media file using object recognition, which may include facial recognition and/or speaker recognition.
Abstract:
A method comprises forming a packet payload by encapsulating at least one data unit associated with media data; determining whether a size of the packet payload is less than a predetermined threshold; and if the size of the packet payload is less than the predetermined threshold, appending an enhancement data unit to the packet payload.
Abstract:
Embodiments of the present invention relate to video coding for multi-view video content. It provides a coding system enabling scalability for the multi-view video content. In one embodiment, a method is provided for encoding at least two views representative of a video scene, each of the at least two views being encoded in at least two scalable layers, wherein one of the at least two scalable layers representative of one view of the at least two views is encoded with respect to a scalable layer representative of the other view of the at least two views.
Abstract:
A method includes organizing a first media source block in the media container file; calculating forward error correction (FEC) redundancy data based on the first media source block; organizing the FEC redundancy data in at least one FEC reservoir in the media container file; providing, in the media container file, meta data providing an association between the first media source block and the at least one FEC reservoir; storing the first media source block as a first elementary item in the media container file; and providing, in the media container file, information that the first elementary item comprises the first media source block
Abstract:
There is disclosed a method, apparatus and computer program product for adaptive streaming. At least one file comprising media data is generated, wherein a first segment and a second segment are received, and a first instruction and a second instruction are received. The first segment and the second segment are modified on the basis of the first instruction and the second instruction. The at least one file is created on the basis of the modified first segment and the modified second segment.
Abstract:
A method, apparatus, system and computer program product are provided to provide switching point information to facilitate switching between different representations of the media content. In an instance in which a content consumption device determines that a switch from a first representation to a second representation is merited, the content consumption device may identify the appropriate switching point from the switching point information provided by the server. The content consumption device may then request the second representation of the media content beginning at the switching point.
Abstract:
In accordance with an example embodiment of the present invention, an apparatus comprising a processing unit configured to receive information related to available camera views of a three dimensional scene, request a synthetic view which is different from any available camera view and determined by the processing unit and receive media data comprising video data associated with the synthetic view.