摘要:
This disclosure describes techniques for coding 3D video block units. In one example, a video encoder is configured to receive one or more texture components from at least a portion of an image representing a view of three dimensional video data, receive a depth map component for at least the portion of the image, code a block unit indicative of pixels of the one or more texture components for a portion of the image and the depth map component. The coding comprises receiving texture data for a temporal instance of a view of video data, receiving depth data corresponding to the texture data for the temporal instance of the view of video data, and encapsulating the texture data and the depth data in a view component for the temporal instance of the view, such that the texture data and the depth data are encapsulated within a common bitstream.
摘要:
This disclosure describes techniques relevant to HTTP streaming of media data. According to these techniques, a server device may signal a byte range for at least one intra-decodable (I-frame) of a video fragment. According to the techniques of this disclosure, a client device may communicate a request to a server device to retrieve the at least one I-frame based on the signaled byte range, and use the retrieved I-frame to provide a high speed version of a video presentation that includes the at least one I-frame. A high speed version of a video presentation may be a trick mode of the video presentation, such as a fast forward or fast rewind version of the video presentation.
摘要:
In an example, aspects of this disclosure relate to a method of coding data that includes coding a sequence of bins according to a context adaptive entropy coding process. A current coding cycle used to code at least one bin of the sequence of bins includes determining a context for the bin; selecting a probability model based on the context, wherein the probability model is updated based on a value of a previous bin coded with the context and coded at least two coding cycles prior to the current coding cycle; applying the probability model to code the bin; and updating the probability model based on a value of the bin.
摘要:
This disclosure describes techniques for coding of refinement coefficients of an enhancement layer in a scalable video coding (SVC) scheme. According to this disclosure, a method may comprise evaluating a history of transform coefficient values associated with one or more previous layers of the SVC scheme, and estimating one or more refinement coefficient values associated with a current layer of the SVC scheme based on the history. On the encoding side, the coding process may include excluding information for one or more refinement coefficient values from the bitstream and signaling to the decoder that such information is excluded from the bitstream. On the decoding side, coding process include parsing the bitstream to identify information that signals to the decoder that information is excluded from the bitstream, and generating such information based on the history associated with one or more previous layers of the SVC scheme.
摘要:
This disclosure presents methods and systems for coding video in merge mode of a motion vector prediction process. A method of coding video data may determining a merge candidate set for a current prediction unit of a current coding unit, wherein the merge candidate set is determined without comparing motion information of a merge candidate in the merge candidate set to motion information of any other prediction units, and performing a merge motion vector prediction process for the current prediction unit using the merge candidate set. The method may further comprise excluding merge candidates from the merge candidate set that are within another prediction unit of the current coding unit.
摘要:
In an example, a process for coding video data includes coding, with a variable length code, a syntax element indicating depth modeling mode (DMM) information for coding a depth block of video data. The process also includes coding the depth block based on the DMM information.
摘要:
A video encoder may encode video data by adaptively selecting between one-eighth-pixel and one-quarter-pixel precision motion vectors, and signal the selected precision. In one example, an apparatus includes a video encoder to encode a block of video data using a one-eighth-pixel precision motion vector when use of the one-eighth-pixel precision motion vector is determined to be preferable for the block over a one-quarter-pixel precision motion vector, and to generate a signal value indicative of the use of the one-eighth-pixel precision motion vector for the block, and an output interface to output the encoded block and the signal value. A video decoder may be configured to receive the signal value and the encoded block, analyze the signal value to determine whether the block was encoded using one-eighth-pixel precision or one-quarter-pixel precision, and decode the block based on the determination.
摘要:
During a prediction stage of video coding, a video coder may use relatively longer interpolation filters to generate predictive sub-pixel values using values of reference integer pixels of a reference block of video data positioned in parallel relative to a scanning order associated with the block and may use relatively shorter interpolation filters to generate predictive sub-pixel values using values of reference integer pixels of the block positioned perpendicular relative to the scanning order, wherein a longer interpolation filter generally refers to a filter with relatively more filter coefficients, or “taps,” and a shorter filter generally refers to a filter with relatively fewer taps.
摘要:
A receiver receives coded coefficient values of enhancement layer video blocks. A control unit defines one or more vectors of transform coefficients for decoding of the enhancement layer blocks, and selects a prediction mode for the enhancement layer blocks based on the vectorized entropy decoding. Each of the vectors comprises one or more of the transform coefficients in a scan order having an end position indicated by a vector control signal. The control unit selects weighted prediction when the vectorized entropy decoding establishes two or more vectors, and selects non-weighted prediction when the defined vectorized entropy coding establishes a single vector. A prediction unit performs predictive decoding based on the prediction mode. An entropy decoding unit performs the vectorized entropy decoding. A scanning unit scans the enhancement layer video blocks from the vectors into two-dimensional blocks of transform coefficients, and separately entropy decodes the vectors.
摘要:
This disclosure describes techniques for estimating a depth of image objects for a two-dimensional (2D) view of a video presentation. For example, an initial indication of depth (e.g., an optical flow) may be determined for a 2D view. The initial indication of depth may be used to estimate global motion, e.g., motion of an observer (e.g., camera), of the 2D view. The initial indication of depth may be modified based on the estimation of global motion to create a global motion-adjusted indication of depth. The global motion-adjusted depth indication may be used to create a depth map for the 2D view, which may be used to generate an alternative view of the video presentation that may be used to display a three-dimensional (3D) video presentation.