Abstract:
Super-transform coding may include identifying a plurality of sub-blocks for prediction coding a current block, determining whether to encode the current block using a super-transform, and super-prediction coding the current block. Super-prediction coding may include generating a super-prediction block for the current block by generating a prediction block for each unpartitioned sub-block of the current block, generating a super-prediction block for each partitioned sub-block of the current block by super-prediction coding the sub-block, and including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block. Including the prediction blocks and super-prediction blocks for the sub-blocks in a super-prediction block for the current block may include filtering at least a portion of each prediction block and each super-prediction block based on a spatially adjacent prediction block. Super-transform coding may include transforming the super-prediction block for the current block using a corresponding super-transform.
Abstract:
Encoding and decoding using prediction dependent transform coding are provided. Encoding and decoding using prediction dependent transform coding may include identifying a current input block from a current input frame from an input video stream, generating a prediction block for the current input block, generating a residual block based on a difference between the current input block and the prediction block, generating, by a processor in response to instructions stored on a non-transitory computer readable medium, an encoded block by encoding the residual block based on the prediction block using the using prediction dependent transform coding, including the encoded block in an output bitstream, and outputting or storing the output bitstream.
Abstract:
A system and method provides transcoding a video in parallel by multiple transcoders based on a measure of video coding complexity (VCC). The system has a video pre-processing module, a master transcoder, a transcoding controller and multiple transcoders. The video pre-processing module transcodes a source video into an intermediate video. The master transcoder partitions the intermediate video into multiple transcoding units, where a transcoding unit identifies a portion of data of the intermediate video for transcoding. The master transcoder further generates a transcoding request for each transcoding unit. The transcoding controller distributes the transcoding units to the multiple transcoders, which transcode the data of the transcoding units in parallel into transcoded data
Abstract:
Systems and methods can facilitate identifying 2D content in media that is suitable for conversion into 3D content, and converting the suitable 2D content into 3D content. The identifying can be based on quality criteria applied to the 2D content. For 2D content converted into 3D content, a user can be offered a choice between viewing the content in its original 2D form, or viewing the content in its converted, 3D form. The systems and methods can thereby provide users with greater choice of content and an enhanced viewing experience.
Abstract:
A system and method provide a video description length (VDL) guided constant quality video encoding strategy with bitrate constraint and a video coding system for optimizing encoding bitrate, distortion and complexity of an input video. The method obtains an overall VDL, temporal VDL and spatial VDL of the input video and compares the overall VDL, temporal VDL and spatial VDL of the input video with a reference VDL, temporal VDL and spatial VDL. Based on the comparison, the method adjusts the encoding bitrate, the overall encoding complexity, temporal encoding complexity and spatial encoding complexity of the input video and encodes the input video with the adjusted encoding bitrate, overall encoding complexity, temporal encoding complexity and spatial encoding complexity of the input video.
Abstract:
Encoding and decoding using spatial proximity context entropy coding may include identifying a plurality of transform coefficients for a current block of a current frame of a video stream, wherein the plurality of transform coefficients is ordered based on a scan order, identifying a current transform coefficient from the plurality of transform coefficients, identifying a plurality of context coefficients from the plurality of transform coefficients, wherein each context coefficient from the plurality of context coefficients is spatially proximate to the current transform coefficient and is available for entropy coding the current transform coefficient, identifying an entropy coding probability for the current transform coefficient based on the scan order and the plurality of context coefficients, entropy coding the current transform coefficient based on the entropy coding probability, including the entropy coded current transform coefficient in an output bitstream, and storing or transmitting the output bitstream.
Abstract:
A code is received. The code conveys data about a quantized coefficient that corresponds to a pixel of an image file. A decoding mode and a version of a code mapping format for decoding the received code is determined. The decoding mode and the version of the code mapping format dynamically change based on a previously-decoded code. The received code is decoded, during the decoding mode, according to the version of the code mapping format to obtain the data about the quantized coefficient.
Abstract:
An image converter compiles three-dimensional content into a data store, identifies a number of stereo image pairs from the three-dimensional content, computes a depth map for each of the stereo image pairs from the three-dimensional content, and partitions the stereo image pairs in the data store into multiple categories. The image converter determines a depth cue for each of the categories based on the depth map for each of the stereo image pairs in each category. The image converter computes a depth map for a category associated with a two-dimensional input image based on the determined depth cue and renders a three-dimensional output image from the two-dimensional input image using the depth map for the category.
Abstract:
An image converter compiles three-dimensional content into a data store, identifies a number of stereo image pairs from the three-dimensional content, computes a depth map for each of the stereo image pairs from the three-dimensional content, and partitions the stereo image pairs in the data store into multiple categories. The image converter determines a depth cue for each of the categories based on the depth map for each of the stereo image pairs in each category. The image converter computes a depth map for a category associated with a two-dimensional input image based on the determined depth cue and renders a three-dimensional output image from the two-dimensional input image using the depth map for the category.
Abstract:
Encoding or decoding a video signal includes determining a first partitioning of at least a portion of a frame before a current frame in a video sequence. The first partitioning is modified to a second partitioning using a motion vector that predicts at least one prediction sub-region of a first region from the first partitioning. The second partitioning is used to encode or decode a current region of the current frame. The current region may be collocated with the first region. The technique eliminates the need to re-partition an entire frame each time, and can be used with both rectangular-shaped block partitioning and partitioning using masks and other partitioning techniques.