Intra compression of pixel blocks using predicted mean
    1.
    发明授权
    Intra compression of pixel blocks using predicted mean 有权
    使用预测平均值的像素块的内部压缩

    公开(公告)号:US07162091B2

    公开(公告)日:2007-01-09

    申请号:US10403780

    申请日:2003-03-31

    IPC分类号: G06K9/36 G06K9/46

    摘要: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

    摘要翻译: 提供了一种用于编码视频帧的装置和方法。 视频帧被分成块进行编码。 视频块的编码利用运动检测,运动估计和自适应压缩,以获得针对特定比特率的期望的压缩。 自适应压缩包括帧内压缩(不考虑其他帧)和帧间压缩(关于其他帧)。 根据需要逐块地执行内部压缩,具有运动检测的帧间压缩和具有运动估计的帧间压缩。 提供分段以比较块的编码与其子块的编码,并选择用于编码的最佳块大小。

    Receiver-driven layered error correction multicast over heterogeneous packet networks
    2.
    发明授权
    Receiver-driven layered error correction multicast over heterogeneous packet networks 有权
    接收器驱动的分层纠错多播在异构分组网络上

    公开(公告)号:US06996097B1

    公开(公告)日:2006-02-07

    申请号:US09315810

    申请日:1999-05-21

    IPC分类号: H04L12/56

    摘要: A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission. A preferred process for generating the error correction streams involves the use of a unique adaptation of the Forward Error Correction (FEC) techniques. This process encodes the transmission data using a linear transform which adds redundant elements. The redundancy permits losses to be corrected because any of the original data elements can be derived from any of the encoded elements. Thus, as long as enough of the encoded data elements are received so as to equal the number of the original data elements, it is possible to derive all the original elements.

    摘要翻译: 一种用于在异构分组网络(例如因特网)下校正在实时媒体的接收机驱动分层多播(RLM)期间发生的错误和损失的系统和方法。 这是通过用一层或多层纠错信息增强RLM来实现的。 这允许每个接收机通过订阅至少一个纠错层来分别优化所接收的音频和视频信息的质量。 理想地,RLM中的每个源层将具有与其相关联的一个或多个多播的纠错数据流(即,层)。 每个纠错层将包含可用于替换相关源层丢失的分组的信息。 提出了多于一个纠错层,因为包含在替换相关源流中丢失的分组所需的数据流中的一些纠错分组本身可能在传输中丢失。 用于产生纠错流的优选过程涉及使用前向纠错(FEC)技术的唯一适配。 该过程使用添加冗余元素的线性变换对传输数据进行编码。 冗余允许修正损失,因为任何原始数据元素可以从任何编码元素导出。 因此,只要接收到足够的编码数据元素以便等于原始数据元素的数量,就有可能导出所有的原始元素。

    Apparatus and method for unequal error protection in multiple-description coding using overcomplete expansions
    3.
    发明授权
    Apparatus and method for unequal error protection in multiple-description coding using overcomplete expansions 有权
    使用不完全扩展的多描述编码中的不等错误保护的装置和方法

    公开(公告)号:US06460153B1

    公开(公告)日:2002-10-01

    申请号:US09276955

    申请日:1999-03-26

    IPC分类号: H03M1300

    CPC分类号: H04N19/89

    摘要: A projection onto convex sets (POCS)-based method for consistent reconstruction of a signal from a subset of quantized coefficients received from an N×K overcomplete transform. By choosing a frame operator F to be the concatenization of two or more K×K invertible transforms, the POCS projections are calculated in RK space using only the K×K transforms and their inverses, rather than the larger RN space using pseudo inverse transforms. Practical reconstructions are enabled based on, for example, wavelet, subband, or lapped transforms of an entire image. In one embodiment, unequal error protection for multiple description source coding is provided. In particular, given a bit-plane representation of the coefficients in an overcomplete representation of the source, one embodiment of the present invention provides coding the most significant bits with the highest redundancy and the least significant bits with the lowest redundancy. In one embodiment, this is accomplished by varying the quantization stepsize for the different coefficients. Then, the available received quantized coefficients are decoded using a method based on alternating projections onto convex sets.

    摘要翻译: 基于凸集(POCS)的方法的投影,用于从从NxK过完全变换接收的量化系数的子集的信号的一致重构。 通过选择一个帧运算符F作为两个或多个KxK可逆变换的并置,POCS投影在RK空间中仅使用KxK变换及其反转而不是使用伪逆变换的较大的RN空间来计算。 基于例如整个图像的小波,子带或重叠变换来实现实际重建。 在一个实施例中,提供了用于多描述源编码的不等差错保护。 特别地,给定源的过完整表示中的系数的位平面表示,本发明的一个实施例提供了具有最高冗余度的最高有效位和具有最低冗余度的最低有效位的编码。 在一个实施例中,这通过改变不同系数的量化步长来实现。 然后,使用基于在凸集上的交替投影的方法对可用的接收量化系数进行解码。

    Receiver-driven layered error correction multicast over heterogeneous packet networks
    4.
    发明授权
    Receiver-driven layered error correction multicast over heterogeneous packet networks 有权
    接收器驱动的分层纠错多播在异构分组网络上

    公开(公告)号:US06594798B1

    公开(公告)日:2003-07-15

    申请号:US09316696

    申请日:1999-05-21

    IPC分类号: G06F702

    摘要: Correction of errors and losses in a receiver-driven layered multicast (RLM) of real-time media over a network is augmented using one or more layers of error correction information. Each receiver separately optimizes the quality of received information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM has one or more associated multicasted error correction data streams. Each error correction layer contains information for replacing lost packets from the associated source layer. More than one error correction layer is proposed to correct for lost packets in other error correction layers. Error correction streams are preferably generated using a pseudo-Automatic Repeat Request (ARQ) wherein a broadcaster sends both the source packets in a primary stream and delayed versions thereof in one or more redundant streams. A hybrid technique combines the psuedo-ARQ method with an adaptation of Forward Error Correction (FEC) techniques.

    摘要翻译: 使用一层或多层纠错信息来增强网络中实时媒体的接收机驱动的分层多播(RLM)中的错误和损失的校正。 每个接收器通过订阅至少一个纠错层分别优化接收信息的质量。 理想地,RLM中的每个源层具有一个或多个关联的多播纠错数据流。 每个纠错层包含用于替换相关源层丢失数据包的信息。 提出了多个纠错层来纠正其他纠错层中的丢包。 优选地,使用伪自动重传请求(ARQ)生成错误校正流,其中广播者在一个或多个冗余流中发送主流中的源分组和其延迟版本。 混合技术将伪秩ARQ方法与前向纠错(FEC)技术相结合。

    Intra compression of pixel blocks using predicted mean
    5.
    发明授权
    Intra compression of pixel blocks using predicted mean 失效
    使用预测平均值的像素块的内部压缩

    公开(公告)号:US06571016B1

    公开(公告)日:2003-05-27

    申请号:US08850957

    申请日:1997-05-05

    IPC分类号: G06K936

    摘要: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

    摘要翻译: 提供了一种用于编码视频帧的装置和方法。 视频帧被分成块进行编码。 视频块的编码利用运动检测,运动估计和自适应压缩,以获得针对特定比特率的期望的压缩。 自适应压缩包括帧内压缩(不考虑其他帧)和帧间压缩(关于其他帧)。 根据需要逐块地执行内部压缩,具有运动检测的帧间压缩和具有运动估计的帧间压缩。 提供分段以比较块的编码与其子块的编码,并选择用于编码的最佳块大小。

    Receiver-driven layered error correction multicast over heterogeneous packet networks
    7.
    发明授权
    Receiver-driven layered error correction multicast over heterogeneous packet networks 有权
    接收器驱动的分层纠错多播在异构分组网络上

    公开(公告)号:US06532562B1

    公开(公告)日:2003-03-11

    申请号:US09316869

    申请日:1999-05-21

    IPC分类号: G06F1100

    摘要: “Correction of errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet is accomplished by augmenting RLM with one or more layers of error correction information. Each receiver separately optimizes the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more associated multicasted error correction data streams (i.e., layers). Each error correction layer contains information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission.”

    摘要翻译: 通过异构分组网络(如Internet)的实时媒体接收机驱动分层多播(RLM)中发生的错误和损失的纠正是通过用一层或多层纠错信息增强RLM来实现的,每个接收机分别优化 通过订阅至少一个纠错层,接收的音频和视频信息的质量理想地,RLM中的每个源层将具有一个或多个关联的多播纠错数据流(即,层),每个纠错层包含 可以使用来自相关源层的丢失分组来替代不止一个纠错层,因为包含在数据流中的一些纠错分组被包含在替换相关源流中丢失的分组所需的数据流本身可能在传输中丢失 “。

    Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets
    8.
    发明授权
    Reconstruction of missing coefficients of overcomplete linear transforms using projections onto convex sets 失效
    使用投影到凸集上重建缺失的完全线性变换系数

    公开(公告)号:US06470469B1

    公开(公告)日:2002-10-22

    申请号:US09276842

    申请日:1999-03-26

    IPC分类号: H03M1300

    摘要: A projection onto convex sets (POCS)-based method for consistent reconstruction of a signal from a subset of quantized coefficients received from an N×K overcomplete transform. By choosing a frame operator F to be the concatenization of two or more K×K invertible transforms, the POCS projections are calculated in RK space using only the K×K transforms and their inverses, rather than the larger RN space using pseudo inverse transforms. Practical reconstructions are enabled based on, for example, wavelet, subband, or lapped transforms of an entire image. In one embodiment, unequal error protection for multiple description source coding is provided. In particular, given a bit-plane representation of the coefficients in an overcomplete representation of the source, one embodiment of the present invention provides coding the most significant bits with the highest redundancy and the least significant bits with the lowest redundancy. In one embodiment, this is accomplished by varying the quantization stepsize for the different coefficients. Then, the available received quantized coefficients are decoded using a method based on alternating projections onto convex sets.

    摘要翻译: 基于凸集(POCS)的方法的投影,用于从从NxK过完全变换接收的量化系数的子集的信号的一致重构。 通过选择一个帧运算符F作为两个或多个KxK可逆变换的并置,POCS投影在RK空间中仅使用KxK变换及其反转而不是使用伪逆变换的较大的RN空间来计算。 基于例如整个图像的小波,子带或重叠变换来实现实际重建。 在一个实施例中,提供了用于多描述源编码的不等差错保护。 特别地,给定源的过完整表示中的系数的位平面表示,本发明的一个实施例提供了具有最高冗余度的最高有效位和具有最低冗余度的最低有效位的编码。 在一个实施例中,这通过改变不同系数的量化步长来实现。 然后,使用基于在凸集上的交替投影的方法对可用的接收量化系数进行解码。

    Efficient coding of digital media spectral data using wide-sense perceptual similarity
    10.
    发明授权
    Efficient coding of digital media spectral data using wide-sense perceptual similarity 有权
    使用广义感知相似性对数字媒体光谱数据进行高效编码

    公开(公告)号:US08645127B2

    公开(公告)日:2014-02-04

    申请号:US12324689

    申请日:2008-11-26

    IPC分类号: G10L11/04

    摘要: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

    摘要翻译: 传统的音频编码器可以通过编码少于所有频谱系数来节省编码比特率,这可以在重建中产生模糊的低通声音。 使用广义感知相似性的音频编码器通过编码被忽略的频谱系数的感知相似版本来提高质量,表示为已编码频谱的缩放版本。 省略的频谱系数被划分为多个子带。 子带被编码为两个参数:比例因子,其可以表示频带中的能量; 以及形状参数,其可以表示带的形状。 形状参数可以是指向已编码频谱的一部分的运动矢量的形式,固定码本中的频谱形状的索引或随机噪声向量。 因此,编码有效地表示在解码时要复制的类似形状的频谱部分的缩放版本。