Multi-view video compression and streaming based on viewpoints of remote viewer

    公开(公告)号:US09648346B2

    公开(公告)日:2017-05-09

    申请号:US12491775

    申请日:2009-06-25

    摘要: Multi-view video that is being streamed to a remote device in real time may be encoded. Frames of a real-world scene captured by respective video cameras are received for compression. A virtual viewpoint, positioned relative to the video cameras, is used to determine expected contributions of individual portions of the frames to a synthesized image of the scene from the viewpoint position using the frames. For each frame, compression rates for individual blocks of a frame are computed based on the determined contributions of the individual portions of the frame. The frames are compressed by compressing the blocks of the frames according to their respective determined compression rates. The frames are transmitted in compressed form via a network to a remote device, which is configured to render the scene using the compressed frames.

    Particle-based tracking of objects within images
    4.
    发明授权
    Particle-based tracking of objects within images 有权
    基于粒子的跟踪图像中的对象

    公开(公告)号:US09478033B1

    公开(公告)日:2016-10-25

    申请号:US12848653

    申请日:2010-08-02

    摘要: Objects are tracked within images. According to an example embodiment, video data is processed for tracking one or more objects. A computer circuit executes instructions to process a series of ordered video frames in a set of video data as follows. A region of interest is defined in an initial one of the video frames, and a particle mesh is formed from a set of feature points for an object in the defined region. The particle mesh is propagated to an adjacent video frame using motion vectors of points in the mesh to form a coarse boundary at the adjacent frame. Occlusion and scene boundaries are detected in the adjacent video frame, and the detected boundaries are used to set a boundary and a number of feature points within the boundary in the adjacent video frame to maintain mesh coherence and track the object in the subsequent video frames.

    摘要翻译: 对象在图像中被跟踪。 根据示例实施例,处理视频数据以跟踪一个或多个对象。 计算机电路执行如下处理一组视频数据中的一系列有序视频帧的指令。 感兴趣区域被定义在视频帧中的初始视频帧中,并且从用于定义区域中的对象的一组特征点形成粒子网格。 使用网格中的点的运动向量将粒子网格传播到相邻的视频帧,以在相邻帧处形成粗略边界。 在相邻视频帧中检测到遮挡和场景边界,并且使用检测到的边界来设置相邻视频帧内的边界内的特征点的边界和数量,以维持网格相干性,并跟踪后续视频帧中的对象。

    Affine motion prediction in video coding
    5.
    发明授权
    Affine motion prediction in video coding 有权
    视频编码中的仿射运动预测

    公开(公告)号:US09438910B1

    公开(公告)日:2016-09-06

    申请号:US14204100

    申请日:2014-03-11

    申请人: Google Inc.

    摘要: A block of a video frame may be predicted and encoded using a local affine motion prediction scheme. At least three points within the video frame are selected, each point being adjacent to a respective corner of the current block to be encoded. For each point, a motion search is performed for a template block centered about the point within a reference frame to find a template reference block. The template reference block has a center point that forms an anchor point pair with the corresponding point adjacent to the current block. Using coordinates of the anchor point pairs, affine coefficients for an affine transformation are generated. Using the affine transformation, a prediction block for the current block is generated and used to encode the current block within a bitstream. Providing the local affine motion prediction scheme can improve the prediction of certain blocks with complex motion, increasing compression.

    摘要翻译: 可以使用本地仿射运动预测方案来预测和编码视频帧的块。 选择视频帧内的至少三个点,每个点与待编码的当前块的相应角相邻。 对于每个点,对于以参考帧内的点为中心的模板块执行运动搜索以找到模板参考块。 模板参考块具有形成与当前块相邻的对应点的锚点对的中心点。 使用锚点对的坐标,生成仿射变换的仿射系数。 使用仿射变换,生成用于当前块的预测块并用于对比特流内的当前块进行编码。 提供局部仿射运动预测方案可以改善某些具有复杂运动的块的预测,增加压缩。

    System, components and method for parametric motion vector prediction for hybrid video coding
    7.
    发明授权
    System, components and method for parametric motion vector prediction for hybrid video coding 有权
    用于混合视频编码的参数运动矢量预测的系统,组件和方法

    公开(公告)号:US09066108B2

    公开(公告)日:2015-06-23

    申请号:US13452364

    申请日:2012-04-20

    CPC分类号: H04N19/54 H04N19/52 H04N19/61

    摘要: Parametric Motion Vector Prediction (PMVP) methodologies and components and systems for performing those methodologies are provided to more effectively and efficiently encode video content that includes complex motion such as zoom or rotation. By substituting the PMVP for a collocated MVP used in HEVC in order to reduce the amount of bit rate increase required when including the PMVP analysis in the bit stream. Further, compression of the motion vectors is provided in a three stage approach based on transformation, quantization and difference coding.

    摘要翻译: 提供了用于执行这些方法的参数运动矢量预测(PMVP)方法和组件和系统,以更有效和高效地编码包括复杂运动(例如缩放或旋转)的视频内容。 通过将PMVP替换为在HEVC中使用的并置MVP,以便在将PMVP分析包括在比特流中时所需的比特率增加量减少。 此外,基于变换,量化和差分编码的三阶段方法提供运动矢量的压缩。

    METHOD FOR PERFORMING HYBRID MULTIHYPOTHESIS PREDICTION DURING VIDEO CODING OF A CODING UNIT, AND ASSOCIATED APPARATUS
    8.
    发明申请
    METHOD FOR PERFORMING HYBRID MULTIHYPOTHESIS PREDICTION DURING VIDEO CODING OF A CODING UNIT, AND ASSOCIATED APPARATUS 有权
    在编码单元的视频编码期间执行混合多媒体预测的方法及相关设备

    公开(公告)号:US20150131724A1

    公开(公告)日:2015-05-14

    申请号:US14601247

    申请日:2015-01-21

    申请人: MEDIATEK INC.

    摘要: A method and apparatus for performing hybrid multihypothesis prediction during video coding of a coding unit includes: processing a plurality of sub-coding units in the coding unit; and performing disparity vector (DV) derivation when the coding unit is processed by a 3D or multi-view coding tool or performing block vector (BV) derivation when the coding unit is processed by intra picture block copy (IntraBC) mode. The step of performing DV or BV derivation includes deriving a plurality of vectors for multihypothesis motion-compensated prediction of a specific sub-coding unit from at least one other sub-coding/coding unit. The one other sub-coding/coding unit is coded before the corresponding DV or BV is derived for multihypothesis motion-compensated prediction of the specific sub-coding unit. A linear combination of a plurality of pixel values derived from the plurality of vectors is used as a predicted pixel value of the specific sub-coding unit.

    摘要翻译: 一种用于在编码单元的视频编码期间执行混合多重假设预测的方法和装置包括:处理编码单元中的多个子编码单元; 并且当编码单元被3D或多视图编码工具处理时执行视差矢量(DV)导出,或者当通过帧内图像块复制(IntraBC)模式处理编码单元时执行块向量(BV)导出。 执行DV或BV导出的步骤包括从至少一个其他子编码/编码单元导出用于特定子编码单元的多假设运动补偿预测的多个向量。 在对应的DV或BV被导出用于特定子编码单元的多假设运动补偿预测之前,另一个子编码/编码单元被编码。 将从多个向量导出的多个像素值的线性组合用作特定子编码单元的预测像素值。

    VIDEO COMPRESSION REPOSITORY AND MODEL REUSE
    9.
    发明申请
    VIDEO COMPRESSION REPOSITORY AND MODEL REUSE 有权
    视频压缩报告和模式重用

    公开(公告)号:US20150124874A1

    公开(公告)日:2015-05-07

    申请号:US14527477

    申请日:2014-10-29

    IPC分类号: H04N19/17 H04N19/136

    CPC分类号: H04N19/54 H04N19/167

    摘要: Systems and methods of improving video encoding/decoding efficiency may be provided. A feature-based processing stream is applied to video data having a series of video frames. Computer-vision-based feature and object detection algorithms identify regions of interest throughout the video datacube. The detected features and objects are modeled with a compact set of parameters, and similar feature/object instances are associated across frames. Associated features/objects are formed into tracks, and each track is given a representative, characteristic feature. Similar characteristic features are clustered and then stored in a model library, for reuse in the compression of other videos. A model-based compression framework makes use of the preserved model data by detecting features in a new video to be encoded, relating those features to specific blocks of data, and accessing similar model information from the model library. The formation of model libraries can be specialized to include personal, “smart” model libraries, differential libraries, and predictive libraries. Predictive model libraries can be modified to handle a variety of demand scenarios.

    摘要翻译: 可以提供改善视频编码/解码效率的系统和方法。 基于特征的处理流被应用于具有一系列视频帧的视频数据。 基于计算机视觉的特征和对象检测算法识别整个视频数据库中的感兴趣区域。 检测到的特征和对象用一组紧凑的参数建模,并且相似的特征/对象实例在帧之间相关联。 相关特征/对象被形成轨道,并且每个轨道被赋予代表性的特征。 类似的特征特征被聚类,然后存储在模型库中,以便在压缩其他视频中重用。 基于模型的压缩框架通过检测要编码的新视频中的特征来使用保留的模型数据,将这些特征与特定的数据块相关联,以及从模型库访问相似的模型信息。 模型库的形成可以专门包括个人,“智能”模型库,差异库和预测库。 可以修改预测模型库来处理各种需求情况。

    Progressive mesh decoding apparatus and method
    10.
    发明授权
    Progressive mesh decoding apparatus and method 有权
    逐行网格解码装置及方法

    公开(公告)号:US08928660B2

    公开(公告)日:2015-01-06

    申请号:US12714069

    申请日:2010-02-26

    申请人: Sung-yul Choe

    发明人: Sung-yul Choe

    摘要: A progressive mesh decoding apparatus, a progressive mesh decoding method and a computer readable recording medium using the same are provided. In the progressive mesh decoding method, a first desired resolution mesh is progressively reconstructed from a base mesh through vertex addition. Vertex level information indicating a resolution level of a mesh corresponding to a predetermined process of the first desired resolution mesh reconstructing process is recorded for respective vertices added at the predetermined process. In response to a second desired resolution mesh having a second desired resolution level lower than the first desired resolution level being requested, the second desired resolution mesh is reconstructed by deleting vertices from the first desired resolution mesh, wherein vertices to be deleted in a predetermined process of the second desired resolution mesh reconstructing process are determined based on the vertex level information.

    摘要翻译: 提供了逐行网格解码装置,逐行网格解码方法和使用其的计算机可读记录介质。 在逐行网格解码方法中,通过顶点加法从基本网格逐渐重建第一期望分辨率网格。 针对在预定处理中添加的各个顶点记录指示对应于第一期望分辨率网格重建处理的预定处理的网格的分辨率级别的顶点等级信息。 响应于具有低于所请求的第一期望分辨率水平的第二所需分辨率水平的第二所需分辨率网格,通过从第一期望分辨率网格中删除顶点来重建第二期望分辨率网格,其中在预定处理中要删除的顶点 基于顶点等级信息来确定第二期望分辨率网格重建处理。