Video coding system and method using 3-D discrete wavelet transform and entropy coding with motion information
    31.
    发明授权
    Video coding system and method using 3-D discrete wavelet transform and entropy coding with motion information 失效
    使用3-D离散小波变换和运动信息熵编码的视频编码系统和方法

    公开(公告)号:US07680189B2

    公开(公告)日:2010-03-16

    申请号:US10984159

    申请日:2004-11-09

    摘要: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a wavelet transform is applied to produce coefficients within different sub-bands. The wavelet coefficients are coded independently for each sub-band to permit easy separation at a decoder, making resolution scalability and temporal scalability natural and easy. In particular, the coefficients are assigned various contexts based on the significance of neighboring samples in previous, current, and next frame, thereby taking advantage of any motion information between frames.

    摘要翻译: 视频编码系统和方法利用利用运动信息的三维(3-D)小波变换和熵编码来降低对运动的敏感度。 在一个实施方案中,编码过程最初估计视频序列中的视频对象中的像素的运动轨迹,以考虑视频对象在整个帧中的运动。 运动估计后,应用小波变换产生不同子带内的系数。 对于每个子带,小波系数被独立地编码,以便在解码器处容易地分离,使得分辨率可扩展性和时间可扩展性自然而简单。 特别地,根据前一帧,当前帧和下一帧中相邻采样的重要性,将系数分配给各种上下文,从而利用帧之间的任何运动信息。

    Systems and methods with error resilience in enhancement layer bitstream of scalable video coding
    32.
    发明授权
    Systems and methods with error resilience in enhancement layer bitstream of scalable video coding 失效
    在可扩展视频编码的增强层比特流中具有错误弹性的系统和方法

    公开(公告)号:US07664185B2

    公开(公告)日:2010-02-16

    申请号:US10978277

    申请日:2004-10-29

    IPC分类号: H04N7/18

    摘要: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer bitstream. The video coding scheme also facilitates redundant encoding of header information from the higher-level VOP header down into lower level bit plane headers and video packet headers. Header extension codes are added to the bit plane and video packet headers to identify whether the redundant data is included.

    摘要翻译: 将视频数据帧编码成多层的可分级的分层视频编码方案,包括具有越来越高质量视频的较低质量视频和多个增强层的基本层,向增强层增加了错误恢复能力。 将独特的再同步标记插入到与每个视频分组相关联的报头中的增强层比特流中,与每个位平面相关联的报头以及与每个视频平面(VOP)段相关联的报头。 在传输增强层比特流之后,解码器尝试检测分组中的错误。 在检测到时,解码器在比特流中向前寻找下一个已知的再同步标记。 一旦找到该标记,解码器就可以开始对下一个视频数据包进行解码。 通过在每个帧内添加许多重新同步标记,解码器可以在接收到的增强层比特流中的分组丢失或信道错误的情况下非常快速地恢复并且具有最小的数据丢失。 视频编码方案还有助于将标题信息从较高级别的VOP头部冗余编码到较低级位平面头部和视频分组头部。 标题扩展码被添加到位平面和视频分组报头,以标识是否包括冗余数据。

    Video coding using wavelet transform of pixel array formed with motion information
    33.
    发明授权
    Video coding using wavelet transform of pixel array formed with motion information 失效
    使用运动信息形成的像素阵列的小波变换的视频编码

    公开(公告)号:US07653134B2

    公开(公告)日:2010-01-26

    申请号:US10901740

    申请日:2004-07-29

    摘要: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients. In particular, the coefficients are assigned various contexts based on the significance of neighboring samples in previous, current, and next frame, thereby taking advantage of any motion information between frames. The wavelet coefficients are coded independently for each sub-band to permit easy separation at a decoder, making resolution scalability and temporal scalability natural and easy. During the coding, bits are allocated among sub-bands according to a technique that optimizes rate-distortion characteristics.

    摘要翻译: 视频编码系统和方法利用利用运动信息的三维(3-D)小波变换和熵编码来降低对运动的敏感度。 在一个实施方案中,编码过程最初估计视频序列中的视频对象中的像素的运动轨迹,以考虑视频对象在整个帧中的运动。 运动估计后,三维小波变换分为两部分。 首先,沿时间方向沿着运动轨迹将时间1-D小波变换应用于相应的像素。 时间小波变换产生时间小波变换的分解帧,其中每帧内的空间相关性保持良好。 其次,将空间二维小波变换应用于包含时间小波系数的所有帧。 小波变换产生不同子带内的系数。 然后,该过程编码小波系数。 特别地,根据前一帧,当前帧和下一帧中相邻采样的重要性,将系数分配给各种上下文,从而利用帧之间的任何运动信息。 对于每个子带,小波系数被独立地编码,以便在解码器处容易地分离,使得分辨率可扩展性和时间可扩展性自然而简单。 在编码期间,根据优化速率失真特性的技术,在子带中分配比特。

    Generalized differentiation methods and arrangements for adaptive multimedia communications
    36.
    发明授权
    Generalized differentiation methods and arrangements for adaptive multimedia communications 有权
    广义差分方法和自适应多媒体通信的安排

    公开(公告)号:US07389356B2

    公开(公告)日:2008-06-17

    申请号:US09862410

    申请日:2001-05-21

    摘要: Various methods and arrangements are provided for transmitting adaptive multimedia content over networks that provide differential services. By way of example, one method includes compressing video objects, generating at least one corresponding elementary stream containing the compressed video objects, classifying information within each elementary stream based on importance, and assembling the classified information into packets associated with different classes of network packets. In classifying the information within each elementary stream based on importance, different priority levels can be assigned to shape, motion, and texture information. Methods and arrangements are also provided for use with multimedia content information that includes audio information, image information, textual information, and the like.

    摘要翻译: 提供各种方法和装置用于通过提供差分服务的网络传输自适应多媒体内容。 作为示例,一种方法包括压缩视频对象,生成包含压缩视频对象的至少一个对应基本流,基于重要性对每个基本流内的信息进行分类,以及将分类信息组合成与不同类别的网络分组相关联的分组。 在基于重要性对每个基本流中的信息进行分类时,可以将不同的优先级分配给形状,运动和纹理信息。 还提供了与包括音频信息,图像信息,文本信息等的多媒体内容信息一起使用的方法和装置。

    Automatic video object extraction
    40.
    发明授权
    Automatic video object extraction 有权
    视频对象自动提取

    公开(公告)号:US06785329B1

    公开(公告)日:2004-08-31

    申请号:US09468985

    申请日:1999-12-21

    IPC分类号: H04N732

    摘要: Automatic video object extraction that defines substantially precise objects is disclosed. In one embodiment, color segmentation and motion segmentation are performed on a source video. The color segmentation segments the video by substantially uniform color regions thereof. The motion segmentation segments the video by moving regions thereof. The color regions and the moving regions are then combined to define the video objects. In varying embodiments, pre-processing and post-processing is performed to further clean the source video and the video objects defined, respectively.

    摘要翻译: 公开了定义基本精确对象的自动视频对象提取。 在一个实施例中,对源视频执行颜色分割和运动分割。 颜色分割通过其基本上均匀的颜色区域来分割视频。 运动分割通过其移动区域来分割视频。 然后组合颜色区域和移动区域以定义视频对象。 在不同的实施例中,执行预处理和后处理以进一步清洁分别定义的源视频和视频对象。