Abstract:
Several implementations relate, for example, to depth encoding and/or filtering for 3D video (3DV) coding formats. A sparse dyadic mode for partitioning macroblocks (MBs) along edges in a depth map is provided as well as techniques for trilateral (or bilateral) filtering of depth maps that may include adaptive selection between filters sensitive to changes in video intensity and/or changes in depth. One implementation partitions a depth picture, and then refines the partitions based on a corresponding image picture. Another implementation filters a portion of a depth picture based on values for a range of pixels in the portion. For a given pixel in the portion that is being filtered, the filter weights a value of a particular pixel in the range by a weight that is based on one or more of location distance, depth difference, and image difference.
Abstract:
An apparatus includes a video/image encoder configured to design a two-dimensional star-shaped spatial filter and encode image/video information using the X-shaped spatial filter. The star-shaped spatial filter includes a first linear arrangement of coefficients that extend outwardly in a first diagonal direction from a center pixel coefficient, a second linear arrangement of coefficients that extend outwardly in a second diagonal direction, and two linear arrangements of coefficients that extend outwardly in horizontal and vertical directions from the center pixel coefficient from the center pixel coefficient. The second diagonal direction is oriented in a different direction relative to the first diagonal direction.
Abstract:
A remote control device is operative to enable and facilitate user control of video systems that are operative to provide one or more three-dimensional (3D) viewing effects. According to an exemplary embodiment, the remote control device includes a user input terminal having an input element operative to receive user inputs to adjust at least one of a volume setting and a channel setting of a video system, and further operative to receive user inputs to adjust a three-dimensional (3-D) viewing effect of the video system. A transmitter is operative to transmit control signals to the video system in response to the user inputs.
Abstract:
An apparatus includes a video/image encoder configured to design a two-dimensional star-shaped spatial filter and encode image/video information using the X-shaped spatial filter. The star-shaped spatial filter includes a first linear arrangement of coefficients that extend outwardly in a first diagonal direction from a center pixel coefficient, a second linear arrangement of coefficients that extend outwardly in a second diagonal direction, and two linear arrangements of coefficients that extend outwardly in horizontal and vertical directions from the center pixel coefficient from the center pixel coefficient. The second diagonal direction is oriented in a different direction relative to the first diagonal direction.
Abstract:
Various implementations are directed to providing inter-layer dependency information. In one implementation, syntax elements are generated that indicate an inter-layer dependency structure among three-dimensional video (3DV) layers. Based on the inter-layer dependency structure, an inter-layer reference is identified for a picture from a layer of the 3DV layers. The picture is encoded based, at least in part, on the inter-layer reference. Corresponding decoding implementations are also provided. Additionally, in another implementation, a transmission priority and an indication of network congestion are used to determine whether to transmit data for a particular 3DV layer. The transmission priority is based on an inter-layer dependency structure among multiple 3DV layers. Another implementation is directed to a network abstraction layer unit that can explicitly identify and convey inter-layer references and corresponding dependencies.
Abstract:
A method of RDPCM (Residue Differential Pulse-Code Modulation) coding incorporating unified Intra and Inter RDPCM coding is disclosed. The same RDPCM encoding and decoding steps in the same processing order are applied to blocks coded in the Intra prediction mode and Inter prediction mode/IntraBC prediction mode (Intra-block copy mode). A method of extending Inter RDPCM coding to blocks coded in the IntraBC prediction mode. RDPCM processing in the vertical or horizontal direction is applied to IntraBC residue.
Abstract:
An apparatus and method for video fingerprinting are provided. The method includes, for each frame of a video sequence including a plurality of frames, removing a portion of the frame, dividing a remaining portion of the frame into blocks, dividing each block into sub-blocks, computing a block level feature as a mean of pixels in each sub-block within the block, concatenating all block level features in the frame, and concatenating features of all frames in the video sequence.
Abstract:
A remote control device is operative to enable and facilitate user control of video systems that are operative to provide one or more three-dimensional (3D) viewing effects. According to an exemplary embodiment, the remote control device includes a user input terminal having an input element operative to receive user inputs to adjust at least one of a volume setting and a channel setting of a video system, and further operative to receive user inputs to adjust a three-dimensional (3D) viewing effect of the video system. A transmitter is operative to transmit control signals to the video system in response to the user inputs.
Abstract:
Methods and apparatuses for coding occlusion layers, such as occlusion video data and occlusion depth data in 3D video, are disclosed. A decoding method comprising the steps of: extracting an indicator representative of an original format for received occlusion data, the original format selected from a one of a sparse occlusion data format and a filled occlusion data format; arranging 2D data, which is associated with said occlusion data, at location after temporal and inter-view pictures in a reference picture list; identifying at least one of an occlusion area macroblock and a non-occlusion area macroblock for the occlusion data; decoding said occlusion data to produce decoded occlusion data, wherein said decoding includes: for each non-occlusion macroblock, when said indicator indicates the filled occlusion data format, replacing the occlusion data in said non-occlusion macroblock with a corresponding macroblock of associated 2D data to produce a decoded occlusion data; and when said indicator indicates the sparse occlusion data format, filling said non-occlusion macroblock with data indicative of a defined characteristic to produce decoded occlusion data; and otherwise for each occlusion macroblock, decoding said occlusion macroblock to produce decoded occlusion data; and outputting the decoded occlusion data.
Abstract:
An apparatus and method for video fingerprinting are provided. The method includes, for each frame of a video sequence including a plurality of frames, removing a portion of the frame, dividing a remaining portion of the frame into blocks, dividing each block into sub-blocks, computing a block level feature as a mean of pixels in each sub-block within the block, concatenating all block level features in the frame, and concatenating features of all frames in the video sequence.