Abstract:
A file format design supports storage of multi-source multimedia presentations via the inclusion of indications as to whether a presentation is a multi-source presentation, and for one media type, the tracks of which are from different sources and should be played simultaneously. If a multi-source presentation exists, additional indications may be provided including: an indication of a multi-source presentation type being stored; indications regarding the source of each track and which tracks have the same source; indications of different parties' information such as phone numbers, etc. Thus, a player may play back a recorded presentation in the same or substantially the same manner as it was presented during the actual session, and may automatically manipulate the presentation to be more informative or efficient. The file format design further supports storage of other types of multi-source presentations that render more than one media stream for at least one type of media.
Abstract:
In one example, a video coder is configured to code information indicative of whether view synthesis prediction is enabled for video data. When the information indicates that view synthesis prediction is enabled for the video data, the video coder may generate a view synthesis picture using the video data and code at least a portion of a current picture relative to the view synthesis picture. The at least portion of the current picture may comprise, for example, a block (e.g., a PU, a CU, a macroblock, or a partition of a macroblock), a slice, a tile, a wavefront, or the entirety of the current picture. On the other hand, when the information indicates that view synthesis prediction is not enabled for the video data, the video coder may code the current picture using at least one of intra-prediction, temporal inter-prediction, and inter-view prediction without reference to any view synthesis pictures.
Abstract:
An encoder for encoding a video signal, wherein the encoder is configured to generate an encoded scalable data stream comprising a base layer and at least one enhancement layer, wherein the encoder is further configured to generate information associated with each of the base layer and the at least one enhancement layer.
Abstract:
The exemplary embodiments of this invention provide in one aspect thereof an ability to signal multiple decoding times for each sample in a file format level in order to allow, for example, different decoding times for each sample (or sample subset) between decoding an entire stream and decoding a subset of the stream. An alternate decoding time box is specified to allow for the signaling of multiple decoding times for each sample. Such a box can contain a compact version of a table that allows indexing from an alternate decoding time to a sample number, where an alternate decoding time is a decoding time to be used with a sample when only a subset of an elementary stream stored in a track is to be decoded. Furthermore, each entry in the table provides the number of consecutive samples with the same time delta, and the delta between those consecutive samples. By adding the deltas a complete time-to-sample map can be constructed.
Abstract:
Methods and systems for coordinating user terminals are disclosed. A user terminal may receive a user terminal identifier and a sensor identifier from a user terminal, determine a group topology based on the user terminal identifier and the sensor identifier to identify a spatial relationship relative to the user terminal, receive a media signal, and identify a subsection of the media signal. The user terminal also may generate subsection information to assign a subsection of the media signal to the user terminal corresponding to the spatial relationship, and may communicate the subsection information to the user terminal.
Abstract:
A system and method for conveying information that is helpful for a network middlebox or a media player to decided which coded data units to forward or process within an RTP payload or a file format data unit in an easy-to-access manner. This mechanism can be used to provide indications of items such as redundant coded pictures, temporal level switching points, gradual decoding refresh access points, view identifiers, and view random access points. A middlebox and/or receiver can then use this information to determine whether certain coded data units need to be processed and/or transmitted.
Abstract:
An encoder comprising an input for inputting video signal to be encoded to form an encoded video signal comprising pictures of at least a first coded video sequence and a second coded video sequence, a hypothetical decoder for hypothetically decoding encoded video signal, an encoded picture buffer, and a decoded picture buffer, and a definer for defining a parameter indicative of the temporal difference between the last picture of the first coded video sequence and the first picture of the second coded video sequence in output/display order.
Abstract:
A method for signaling ROI scalability information in a file format. The present invention provides an efficient signaling of ROI scalability information in the file format, wherein the signaling comprises providing the geometrical information of a ROI and an indication to identify the ROI each coded data unit is associated with within a tier or layer.
Abstract:
Techniques are described related to coding of long-term reference pictures for a reference picture set. In some examples, a video coder may code candidate long-term reference pictures in a parameter set. The video coder also code syntax elements that indicate which long-term reference pictures from the candidate long-term reference pictures belong in the reference picture set.
Abstract:
Techniques are described related to deriving a reference picture set. A reference picture set may identify reference pictures that can potentially be used to inter-predict a current picture and picture following the current picture in decoding order. In some examples, deriving the reference picture set may include constructing a plurality of reference picture subsets that together form the reference picture set.