Systems and methods with error resilience in enhancement layer bitstream of scalable video coding
    81.
    发明申请
    Systems and methods with error resilience in enhancement layer bitstream of scalable video coding 有权
    在可扩展视频编码的增强层比特流中具有错误弹性的系统和方法

    公开(公告)号:US20050069036A1

    公开(公告)日:2005-03-31

    申请号:US10977773

    申请日:2004-10-29

    IPC分类号: G06T9/00 H04N19/89 H04N7/12

    摘要: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer i bitstream. The video coding scheme also facilitates redundant encoding of header information from the higher-level VOP header down into lower level bit plane headers and video packet headers. Header extension codes are added to the bit plane and video packet headers to identify whether the redundant data is included.

    摘要翻译: 将视频数据帧编码成多层的可分级的分层视频编码方案,包括具有越来越高质量视频的较低质量视频和多个增强层的基本层,向增强层增加了错误恢复能力。 将独特的再同步标记插入到与每个视频分组相关联的报头中的增强层比特流中,与每个位平面相关联的报头以及与每个视频平面(VOP)段相关联的报头。 在传输增强层比特流之后,解码器尝试检测分组中的错误。 在检测到时,解码器在比特流中向前寻找下一个已知的再同步标记。 一旦找到该标记,解码器就可以开始对下一个视频数据包进行解码。 通过在每个帧内添加许多重新同步标记,解码器可以在接收到的增强层i比特流中的分组丢失或信道错误的情况下非常快速地恢复并且具有最小的数据丢失。 视频编码方案还有助于将标题信息从较高级别的VOP头部冗余编码到较低级位平面头部和视频分组头部。 标题扩展码被添加到位平面和视频分组报头,以标识是否包括冗余数据。

    Systems and methods with error resilience in enchancement layer bitstream of scalable video coding
    82.
    发明申请
    Systems and methods with error resilience in enchancement layer bitstream of scalable video coding 失效
    可扩展视频编码的增强层比特流中具有错误弹性的系统和方法

    公开(公告)号:US20050063463A1

    公开(公告)日:2005-03-24

    申请号:US10977417

    申请日:2004-10-29

    IPC分类号: G06T9/00 H04N19/89 H04N7/12

    摘要: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer bitstream. The video coding scheme also facilitates redundant encoding of header information from the higher-level VOP header down into lower level bit plane headers and video packet headers. Header extension codes are added to the bit plane and video packet headers to identify whether the redundant data is included.

    摘要翻译: 将视频数据帧编码成多层的可分级的分层视频编码方案,包括具有越来越高质量视频的较低质量视频和多个增强层的基本层,向增强层增加了错误恢复能力。 将独特的再同步标记插入到与每个视频分组相关联的报头中的增强层比特流中,与每个位平面相关联的报头以及与每个视频平面(VOP)段相关联的报头。 在传输增强层比特流之后,解码器尝试检测分组中的错误。 在检测到时,解码器在比特流中向前寻找下一个已知的再同步标记。 一旦找到该标记,解码器就可以开始对下一个视频数据包进行解码。 通过在每个帧内添加许多重新同步标记,解码器可以在接收到的增强层比特流中的分组丢失或信道错误的情况下非常快速地恢复并且具有最小的数据丢失。 视频编码方案还有助于将标题信息从较高级别的VOP头部冗余编码到较低级位平面头部和视频分组头部。 标题扩展码被添加到位平面和视频分组报头,以标识是否包括冗余数据。

    Video coding using wavelet transform of pixel array formed with motion information
    83.
    发明申请
    Video coding using wavelet transform of pixel array formed with motion information 失效
    使用运动信息形成的像素阵列的小波变换的视频编码

    公开(公告)号:US20050008076A1

    公开(公告)日:2005-01-13

    申请号:US10901740

    申请日:2004-07-29

    IPC分类号: H04N7/12 H04N7/26

    摘要: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients. In particular, the coefficients are assigned various contexts based on the significance of neighboring samples in previous, current, and next frame, thereby taking advantage of any motion information between frames. The wavelet coefficients are coded independently for each sub-band to permit easy separation at a decoder, making resolution scalability and temporal scalability natural and easy. During the coding, bits are allocated among sub-bands according to a technique that optimizes rate-distortion characteristics.

    摘要翻译: 视频编码系统和方法利用利用运动信息的三维(3-D)小波变换和熵编码来降低对运动的敏感度。 在一个实施方案中,编码过程最初估计视频序列中的视频对象中的像素的运动轨迹,以考虑视频对象在整个帧中的运动。 运动估计后,三维小波变换分为两部分。 首先,沿时间方向沿着运动轨迹将时间1-D小波变换应用于相应的像素。 时间小波变换产生时间小波变换的分解帧,其中每帧内的空间相关性保持良好。 其次,将空间二维小波变换应用于包含时间小波系数的所有帧。 小波变换产生不同子带内的系数。 然后,该过程编码小波系数。 特别地,根据前一帧,当前帧和下一帧中相邻采样的重要性,将系数分配给各种上下文,从而利用帧之间的任何运动信息。 对于每个子带,小波系数被独立地编码,以便在解码器处容易地分离,使得分辨率可扩展性和时间可扩展性自然而简单。 在编码期间,根据优化速率失真特性的技术,在子带中分配比特。

    System and method with advance predicted bit-plane coding for progressive fine-granularity scalable (PFGS) video coding
    84.
    发明授权
    System and method with advance predicted bit-plane coding for progressive fine-granularity scalable (PFGS) video coding 有权
    具有用于逐行细粒度可缩放(PFGS)视频编码的预测位平面编码的系统和方法

    公开(公告)号:US06700933B1

    公开(公告)日:2004-03-02

    申请号:US09505254

    申请日:2000-02-15

    IPC分类号: H04N712

    摘要: A video encoding scheme employs progressive fine-granularity layered coding to encode video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from at least one same or lower quality layer in a reference frame, whereby the lower quality layer is not necessarily the base layer. Use of multiple reference layers of different quality results in occasional fluctuations in the encoded image data. The video encoding scheme efficiently eliminates such fluctuations by predicting higher quality data from the lower quality data encoded in the base layer and a low quality enhancement layer.

    摘要翻译: 视频编码方案采用逐行细粒度分层编码将视频数据帧编码为多层,包括具有较低质量视频的基本层和越来越高质量视频的多个增强层。 当前帧中的一些增强层从参考帧中的至少一个相同质量层或较低质量层预测,由此较低质量层不一定是基本层。 使用不同质量的多个参考层导致编码图像数据的偶尔波动。 视频编码方案通过从基层编码的较低质量数据和低质量增强层预测更高质量的数据来有效地消除这种波动。

    System and method for robust image representation over error-prone channels
    86.
    发明授权
    System and method for robust image representation over error-prone channels 有权
    用于在容易出错的信道上进行鲁棒图像表示的系统和方法

    公开(公告)号:US06621935B1

    公开(公告)日:2003-09-16

    申请号:US09561686

    申请日:2000-05-01

    IPC分类号: G06K936

    CPC分类号: G06T9/008

    摘要: An image distribution system has a source that encodes digital images and transmits them over an error-prone channel to a destination. The source has an image coder that processes the digital images using vector transformation followed by vector quantization. This produces groups of vectors and quantized values that are representative of the images. The image coder orders the vectors in the codebooks and assigns vector indexes to the vectors such that a bit error occurring at a less significant bit in a vector index results in less distortion than a bit error occurring at a more significant bit. Depending upon the format and the capabilities of the source and destination, the image coder may allocate different numbers of bits to different groups of vectors according to a bit allocation map for this allocation process. The source also has a UEP (Unequal Error Protection) coder that layers the vector indexes according to their significance. Two possible approaches include frequency-based UEP and bit-plane based UEP. The source transmits a bitstream that includes the image values, a bit allocation map, and the layered vector indexes. The destination receives the bitstream and recovers the vectors using the vector indexes and bit allocation map. The destination then reconstructs the image from the image values and the vectors.

    摘要翻译: 图像分配系统具有对数字图像进行编码并将其通过易出错的信道发送到目的地的源。 源具有图像编码器,其使用向量变换处理数字图像,随后进行矢量量化。 这产生代表图像的矢量组和量化值。 图像编码器对码本中的向量进行排序,并向向量分配向量索引,使得在矢量索引中以较低有效位发生的位错误导致比在更高有效位处发生的位错误更少的失真。 根据源和目的地的格式和能力,图像编码器可以根据用于该分配处理的比特分配图来向不同的向量组分配不同数量的比特。 该源还具有UEP(不等错误保护)编码器,其根据其意义对向量索引进行分层。 两种可能的方法包括基于频率的UEP和基于位平面的UEP。 源发送包括图像值,比特分配图和分层向量索引的比特流。 目的地接收比特流并使用向量索引和比特分配映射来恢复向量。 目的地然后从图像值和向量重建图像。

    System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding
    87.
    发明授权
    System and method for robust video coding using progressive fine-granularity scalable (PFGS) coding 有权
    用于使用逐行细粒度可扩展(PFGS)编码的鲁棒视频编码的系统和方法

    公开(公告)号:US06614936B1

    公开(公告)日:2003-09-02

    申请号:US09454489

    申请日:1999-12-03

    IPC分类号: G06K936

    摘要: A video encoding scheme employs progressive fine-granularity layered coding to encode video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from at least one lower quality layer in a reference frame, whereby the lower quality layer is not necessarily the base layer.

    摘要翻译: 视频编码方案采用逐行细粒度分层编码将视频数据帧编码为多层,包括具有较低质量视频的基本层和越来越高质量视频的多个增强层。 当前帧中的一些增强层从参考帧中的至少一个较低质量层预测,由此较低质量层不一定是基本层。

    Three-dimensional shape-adaptive wavelet transform for efficient object-based video coding
    88.
    发明授权
    Three-dimensional shape-adaptive wavelet transform for efficient object-based video coding 有权
    用于有效的基于对象的视频编码的三维形状自适应小波变换

    公开(公告)号:US06597739B1

    公开(公告)日:2003-07-22

    申请号:US09597976

    申请日:2000-06-20

    IPC分类号: H04N730

    摘要: A three-dimensional (3D) shape-adaptive discrete wavelet transform (SA-DWT) is provided for efficient object-based video coding. In a first stage, a one-dimensional SA-DWT is performed along the temporal direction among pixels that have temporal correspondence. The correspondence can be established by motion estimation or other matching approaches. SA-DWT in the temporal direction is used to treat emerging pixels, terminating pixels or pixels that have colliding correspondence pixels. After the temporal SA-DWT transform, the resulting temporal wavelet coefficients are placed in the spatial positions corresponding to the original pixels to maintain the spatial correlation within each frame. Then, in a second stage, a two-dimensional SA-DWT is applied to the temporal SA-DWT coefficients within each frame. The 3D SA-DWT can handle arbitrarily shaped video objects while providing flexible spatial and temporal scalability as in any wavelet-based coding scheme. The 3D SA-DWT can also track the video object motion and perform the wavelet transform among corresponding pixels for that object while keeping the spatial correlation within a frame.

    摘要翻译: 提供三维(3D)形状自适应离散小波变换(SA-DWT),用于高效的基于对象的视频编码。 在第一阶段中,沿着具有时间对应关系的像素之间的时间方向执行一维SA-DWT。 通信可以通过运动估计或其他匹配方法建立。 时间方向的SA-DWT用于处理新出现的像素,终止具有碰撞对应像素的像素或像素。 在时间SA-DWT变换之后,将所得到的时间小波系数放置在与原始像素对应的空间位置中,以保持每帧内的空间相关性。 然后,在第二阶段中,将二维SA-DWT应用于每帧内的时间SA-DWT系数。 3D SA-DWT可以处理任意形状的视频对象,同时提供灵活的空间和时间可伸缩性,如在任何基于小波的编码方案中。 3D SA-DWT还可以跟踪视频对象运动,并在该对象的相应像素之间执行小波变换,同时保持帧内的空间相关性。

    Apparatus and method for quadtree based variable block size motion
estimation
    90.
    发明授权
    Apparatus and method for quadtree based variable block size motion estimation 失效
    用于基于四叉树的可变块大小运动估计的装置和方法

    公开(公告)号:US6084908A

    公开(公告)日:2000-07-04

    申请号:US735869

    申请日:1996-10-23

    摘要: A method and apparatus for determining an optimal quadtree structure for quadtree-based variable block size (VBS) motion estimation. The method computes the motion vectors for the entire quadtree from the largest block-size to the smallest block-size. Next, the method may optionally select an optimal quantizer scale for each block. The method then compares from "bottom-up" the sum of the distortion from encoding all sub-blocks or sub-nodes (children) as compared to the distortion from encoding the block or node (parent) from which the subnodes are partitioned from. If the sum of the distortion from encoding the children is greater than that of the parent then the node is "merged". Conversely, if the sum of the distortion from encoding the children is less than that of the parent then the node is "split" and the Lagrangian cost for the parent node is set as the sum of the Lagrangian cost of its children. This step is repeated for the all nodes through every level until an optimal quadtree structure is obtained.

    摘要翻译: 一种用于确定用于基于四叉树的可变块大小(VBS)运动估计的最佳四叉树结构的方法和装置。 该方法从最大块大小到最小块大小计算整个四叉树的运动矢量。 接下来,该方法可以可选地为每个块选择最佳量化器标度。 然后,该方法从与所编码的所有子块或子节点(子节点)中的失真和的总和相比,从编码从子节点被分割的块或节点(父节点)的失真相比较“自下而上”。 如果来自编码孩子的失真的总和大于父节点的失真的总和,则节点被“合并”。 相反,如果来自编码孩子的失真的总和小于父母的失真之和,则节点“分裂”,并且父节点的拉格朗日代价被设置为其孩子的拉格朗日代价的总和。 通过每个级别对所有节点重复此步骤,直到获得最佳四叉树结构。