Abstract:
A method comprising: determining a number of units that are to be assigned to partitions and are initialized as unassigned; indicating or inferring a number of explicitly sized partitions to be assigned; indicating sizes for the explicitly sized partitions and accordingly marking unassigned units as assigned to partitions in a pre-defined scan order; indicating a count of units; repetitively assigning the count of units to partitions and accordingly marking unassigned units as assigned in the pre-defined scan order until the number of unassigned units is less than the count of units; and assigning, if the number of unassigned units is greater than 0, the unassigned units to a last partition.
Abstract:
A method includes maintaining a set of parameters or weights derived through online learning for a neural net; transmitting an update of the parameters or weights to a decoder; deriving a first prediction block based on an output of the neural net using the parameters or weights; deriving a first encoded prediction error block through encoding a difference of the first prediction block and a first input block; encoding the first encoded prediction error block into a bitstream; deriving a reconstructed prediction error block based on the first encoded prediction error block; deriving a second prediction block based on an output of the neural net using the parameters or weights and the reconstructed prediction error block; deriving a second encoded prediction error block through encoding a difference of the second prediction block and a second input block; and encoding the second encoded prediction error block into a bitstream.
Abstract:
A method, apparatus and computer program product enable multiple timeline support in playback of omnidirectional media content with overlay. The method, apparatus and computer program product receive a visual overlay configured to be rendered as a multi-layer visual content with an omnidirectional media content file (30). The omnidirectional media content file is associated with a first presentation timeline. The visual overlay is associated with a second presentation timeline. The method, apparatus and computer program product construct an overlay behavior definition file associated with the visual overlay (32). The overlay behavior definition file indicates a behavior of the second presentation timeline with respect to the first presentation in an instance that a pre-defined user interaction switch occurs during a playback of the omnidirectional media content file.
Abstract:
A method comprising: obtaining a block of a picture or a picture in an encoder; determining if the block/picture is used for on-line learning; if affirmative, encoding the block/picture; reconstructing a coarse version of the block/picture or the respective prediction error block/picture; enhancing the coarse version using a neural net; fine-tuning the neural net with a training signal based on the coarse version; determining if the block/picture is enhanced using the neural net; and if affirmative, encoding the block/picture with enhancing using the neural net.
Abstract:
There are disclosed various methods, apparatuses and computer program products for video encoding. In some embodiments a method comprises: obtaining a coded first image and a coded second image, wherein the coded second image depends on the coded first image; determining that, when displaying a decoded second image that is decoded from the coded second image, it is allowed to first display a decoded first image that is decoded from the coded first image; and in response to said determination: including the coded first image into a first image container file and including the coded second image into a second image container file; indicating, in a markup language code, that a first image container file and a second image container file are alternatives; and indicating, in the markup language code, that the second image container file depends on the first image container file.
Abstract:
A method for decoding an encoded video representation from a bitstream, the method comprising decoding an identifier indicating that all samples within a scope of the bitstream have been coded with a prediction restriction; determining that the scope covers a region of interest within a picture; decoding at least a first coding unit preceding said region of interest in decoding order in a parse mode such that syntax elements belonging to said at least first coding unit are parsed, but a sample reconstruction process of said syntax elements is omitted; and decoding at least a second coding unit belonging to said region of interest such that syntax elements belonging to said at least second coding unit are parsed and a sample reconstruction process is performed to said syntax elements.
Abstract:
A method is provided for defining a metadata box of a neural network representation (NNR) item data, wherein the NNR item data comprises an NNR bitstream; and defining an association between the NNR item data and an NNR configuration by using a configuration item property, wherein the NNR configuration item property comprises information about stored NNR item data. Corresponding apparatuses and computer program products are also provided.
Abstract:
A method, apparatus and computer program product to signal and store various segment properties in omnidirectional media, including sizes, priority ranks, and the like. The method, apparatus and computer program product access one or more media segments (30). The one or more media segments includes media data configured for streaming playback. The method, apparatus and computer program product access size information (32) or priority rank information of one or more tiles for each of the one or more media segments. The method, apparatus and computer program product encode the size information (34) or priority rank information for each of the one or more media segments in a media file within one or more metadata tracks associated with the one or more media segments or one or more sub-segments associated with the one or more media segments.
Abstract:
The embodiments relate to a method comprising receiving visual data in a file format compatible with ISO base media file format; processing the visual data to detect one or more content elements; storing the detected one or more content elements and information on the used process as a metadata; and including the metadata to the media file in association with the visual data. The embodiments also relate to a technical equipment for implementing the method.
Abstract:
A method, apparatus and computer program product are provided to store and signal pre-derivation properties in an image container file (24, 26). Relative to the construction of image package comprising an image container file, the method, apparatus and computer program product assign a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images (22). With respect to the processing of an image container file, the method, apparatus and computer program product permit an image container file and a pre-derivation property identifier data structure identifying one or more pre-derivation properties of one or more pre-derived images in the image be processed to cause one or more pre-derived image items from the image container file to be rendered or edited and regenerated in accordance with the pre-derivation properties.