Apparatus and method for quadtree based variable block size motion
estimation
    12.
    发明授权
    Apparatus and method for quadtree based variable block size motion estimation 失效
    用于基于四叉树的可变块大小运动估计的装置和方法

    公开(公告)号:US6084908A

    公开(公告)日:2000-07-04

    申请号:US735869

    申请日:1996-10-23

    摘要: A method and apparatus for determining an optimal quadtree structure for quadtree-based variable block size (VBS) motion estimation. The method computes the motion vectors for the entire quadtree from the largest block-size to the smallest block-size. Next, the method may optionally select an optimal quantizer scale for each block. The method then compares from "bottom-up" the sum of the distortion from encoding all sub-blocks or sub-nodes (children) as compared to the distortion from encoding the block or node (parent) from which the subnodes are partitioned from. If the sum of the distortion from encoding the children is greater than that of the parent then the node is "merged". Conversely, if the sum of the distortion from encoding the children is less than that of the parent then the node is "split" and the Lagrangian cost for the parent node is set as the sum of the Lagrangian cost of its children. This step is repeated for the all nodes through every level until an optimal quadtree structure is obtained.

    摘要翻译: 一种用于确定用于基于四叉树的可变块大小(VBS)运动估计的最佳四叉树结构的方法和装置。 该方法从最大块大小到最小块大小计算整个四叉树的运动矢量。 接下来,该方法可以可选地为每个块选择最佳量化器标度。 然后,该方法从与所编码的所有子块或子节点(子节点)中的失真和的总和相比,从编码从子节点被分割的块或节点(父节点)的失真相比较“自下而上”。 如果来自编码孩子的失真的总和大于父节点的失真的总和,则节点被“合并”。 相反,如果来自编码孩子的失真的总和小于父母的失真之和,则节点“分裂”,并且父节点的拉格朗日代价被设置为其孩子的拉格朗日代价的总和。 通过每个级别对所有节点重复此步骤,直到获得最佳四叉树结构。

    Computer system for statistical multiplexing of bitstreams
    13.
    发明授权
    Computer system for statistical multiplexing of bitstreams 有权
    用于比特流统计复用的计算机系统

    公开(公告)号:US06754241B1

    公开(公告)日:2004-06-22

    申请号:US09478128

    申请日:2000-01-05

    IPC分类号: H04J302

    摘要: A PC-type computer has a system bus (e.g., a PCI bus) configured with a main CPU board, a statistical multiplexing (stat-mux) board, and a plurality of video/audio encoder boards, each configured to receive and compress a corresponding video/audio stream. The stat-mux board performs statistical multiplexing on the different compressed bitstreams to transmit multiple bitstreams over individual shared communication channels. Although each of the boards is configured to the system bus, each encoder board has a digital signal processor (DSP) with a synchronized serial interface (SSI) output port that is directly connected to an SSI input port on a DSP on the stat-mux board (which, in one embodiment, has four such DSPs each with six such SSI input ports). As such, (up to 24) compressed video/audio bitstreams generated on the various encoder boards can be transmitted directly to the stat-mux board without having to go through the system bus. In this way, the computer system can provide statistical multiplexing of low-latency video/audio bitstreams without having to suffer the processing delays associated with conventional transmission over PCI system buses.

    摘要翻译: PC型计算机具有配置有主CPU板,统计复用(stat-mux)板和多个视频/音频编码器板的系统总线(例如,PCI总线),每个配置用于接收和压缩 相应的视频/音频流。 统计复用板对不同的压缩比特流执行统计复用,以在各个共享通信信道上传输多个比特流。 虽然每个板都配置为系统总线,但每个编码器板都有一个数字信号处理器(DSP),具有同步的串行接口(SSI)输出端口,它直接连接到数模转换器上的DSP上的SSI输入端口 板(在一个实施例中,具有四个这样的DSP,每个具有六个这样的SSI输入端口)。 因此,在各种编码器板上生成的(多达24个)压缩视频/音频比特流可以直接发送到统一复用板,而不必通过系统总线。 以这种方式,计算机系统可以提供低延迟视频/音频比特流的统计复用,而不必忍受与PCI系统总线上的常规传输相关联的处理延迟。

    Latency-based statistical multiplexing
    14.
    发明授权
    Latency-based statistical multiplexing 失效
    基于延迟的统计复用

    公开(公告)号:US06665872B1

    公开(公告)日:2003-12-16

    申请号:US09478127

    申请日:2000-01-05

    IPC分类号: H04J316

    摘要: When two or more different video streams a e compressed for concurrent transmission of multiple compressed video bitstreams over a single shared communication channel, control over both (1) the transmission of data over the shared channel and (2) the compression processing that generates the bitstreams is exercised taking into account the differing levels of latency required for the corresponding video applications. For example, interactive video games typically require lower latency than other video applications such as video streaming, web browsing, and electronic mail. A multiplexer and traffic controller takes these differing latency requirements, along with bandwidth and image fidelity requirements, into account when controlling both traffic flow and compression processing. In addition, an off-line profiling tool analyzes typical video applications off-line in order to generate profiles of different types of video applications that are then accessed in real-time by a call admission manager responsible to controlling the admission of new video application sessions as well as the assignment of admitted applications to specific available video encoders, which themselves may differ in video compression processing power as well as in the degree to which they allow external processors (like the multiplexer and traffic controller) to control their internal compression processing.

    摘要翻译: 当两个或多个不同的视频流被压缩以通过单个共享通信信道同时传输多个压缩视频比特流时,可以对共享信道上的数据传输进行控制(2)产生比特流的压缩处理是 考虑到相应视频应用程序所需的等待时间不同的行为。 例如,交互式视频游戏通常比其他视频应用(例如视频流,网页浏览和电子邮件)需要更低的延迟。 多路复用器和流量控制器在控制流量和压缩处理两者时考虑到这些不同的延迟要求以及带宽和图像保真度要求。 此外,离线分析工具可以离线分析典型的视频应用程序,以便生成不同类型的视频应用程序的配置文件,然后由负责控制新视频应用程序会话的呼叫接纳管理员实时访问 以及将允许的应用分配给特定可用的视频编码器,它们本身可能在视频压缩处理能力方面可能不同,以及允许外部处理器(如多路复用器和流量控制器)控制其内部压缩处理的程度。

    Method and apparatus for motion estimation with all binary representation
    15.
    发明授权
    Method and apparatus for motion estimation with all binary representation 失效
    具有所有二进制表示的运动估计的方法和装置

    公开(公告)号:US07020201B2

    公开(公告)日:2006-03-28

    申请号:US10301415

    申请日:2002-11-20

    IPC分类号: H04B1/66 H04N7/12

    CPC分类号: H04N5/145 H04N19/53

    摘要: A method of motion estimation for video encoding constructs a binary pyramid structure having three binary layers. A state update module registers and updates repeat occurrence of final motion vectors and a static-state checking module determines if the method is in a static mode or a normal mode based on the repeat occurrence. In a normal mode, the first binary layer is searched within a ±3 pixel refinement window to determine a first level motion vector. In the second binary layer, a search range is computed based on six motion vector candidates. By checking every point within in the search range, a second binary layer search generates a second level motion vector. Finally, a third binary layer search within a ±2 pixel refinement window generates a final motion vector according to the second level motion vector. In a static mode, a fine tuning module performs search within a ±1 pixel refinement window and generates a final motion vector.

    摘要翻译: 视频编码的运动估计方法构成具有三个二进制层的二进制金字塔结构。 状态更新模块注册和更新最终运动矢量的重复发生,并且静态状态检查模块基于重复发生来确定该方法是处于静态模式还是正常模式。 在正常模式中,在±3像素细化窗口内搜索第一二进制层以确定第一级运动矢量。 在第二二进制层中,基于六个运动矢量候选来计算搜索范围。 通过检查搜索范围内的每个点,第二二进制层搜索生成第二级运动矢量。 最后,±2像素细化窗口内的第三二进制层搜索根据第二级运动矢量生成最终运动矢量。 在静态模式中,微调模块在±1像素细化窗口内进行搜索,并生成最终的运动矢量。

    Mode decision using approximate 1/2 pel interpolation
    16.
    发明授权
    Mode decision using approximate 1/2 pel interpolation 有权
    使用近似1/2像素插值的模式决策

    公开(公告)号:US08265151B1

    公开(公告)日:2012-09-11

    申请号:US11300185

    申请日:2005-12-14

    IPC分类号: H04N7/12

    摘要: A method for video encoding is disclosed. The method generally includes the steps of (A) generating first sub-pel data for at least one of (i) a motion estimation and (ii) a mode decision by first filtering reference data and (B) generating second sub-pel data for a motion compensation by second filtering the reference data. Wherein a first performance of the first filtering may be different than a second performance of the second filtering.

    摘要翻译: 公开了一种用于视频编码的方法。 该方法通常包括以下步骤:(A)为(i)运动估计和(ii)通过首先过滤参考数据的模式决定和(B)生成第二子像素数据中的至少一个生成第一子像素数据, 通过对参考数据进行二次过滤来进行运动补偿。 其中,第一过滤的第一性能可以不同于第二过滤的第二性能。

    MPEG-4 Streaming System With Adaptive Error Concealment
    17.
    发明申请
    MPEG-4 Streaming System With Adaptive Error Concealment 审中-公开
    具有自适应误差隐藏的MPEG-4流系统

    公开(公告)号:US20090086819A1

    公开(公告)日:2009-04-02

    申请号:US12326104

    申请日:2008-12-02

    IPC分类号: H04N7/32

    摘要: An MPEG-4 system with error concealment is provided for video service under the network with packet loss. The MPEG-4 system includes an encoder and a decoder. The encoder uses an intra-refreshment technique is used to make coded bitstream more robust against noise in order to stop error propagation. The rate-distortion optimization criterion is also introduced to adaptively update in synchronization with intra-coded blocks adaptively based on the true network condition with minimal overhead. The Lagrange multiplier is modified to achieve the best rate-distortion balance. In addition, a decoder loop is used in the encoder and is synchronized with the true decoder to achieve the best performance and avoid mismatch with the decoder used in the MPEG-4 system. The decoder is able to achieve resilient decoding from any kind of noise and enhance the reconstructed image quality with spatial and temporal hybrid concealment method. The result shows that a 3.65-9.71 dB further improvement on peak-signal-to-noise-ratio (PSNR) can be achieved in comparison with the existing methods that adopt spatial copy and zero motion concealment in decoding.

    摘要翻译: 为具有丢包的网络下的视频服务提供了具有错误隐藏的MPEG-4系统。 MPEG-4系统包括编码器和解码器。 编码器使用内部刷新技术来使编码比特流更加鲁棒地抵抗噪声,以便阻止错误传播。 还引入了速率失真优化标准,以最小的开销,基于真实的网络条件自适应地自适应地更新与帧内编码块的内部编码块。 修改拉格朗日乘数以获得最佳的速率 - 失真平衡。 另外,在编码器中使用解码器环路,并且与真实解码器同步以实现最佳性能并避免与MPEG-4系统中使用的解码器不匹配。 解码器能够通过任何种类的噪声实现弹性解码,并通过空间和时间混合隐藏方法增强重建的图像质量。 结果表明,与采用空间复制和零运动隐藏解码的现有方法相比,可以实现峰值信噪比(PSNR)的3.65-9.71dB进一步改善。

    Fast motion estimation using N-queen pixel decimation

    公开(公告)号:US07027511B2

    公开(公告)日:2006-04-11

    申请号:US10118370

    申请日:2002-04-09

    IPC分类号: H04B1/66

    CPC分类号: H04N5/145 H04N19/53

    摘要: The present invention proposes a fast motion estimation using N-queen pixel decimation, whereby after a reference block and a block to be processed are selected in a video sequence, an N×N queens pattern is used for pixel decimation to perform block match, thereby obtaining a good enough block difference value. The present invention combines pixel decimation with fast motion estimation for search points reduction to achieve the object of simplifying computational complexity of motion estimation. Therefore, the present invention can sieve out sufficiently representative pixels and will not increase extra computational complexity.

    Architecture for stack robust fine granularity scalability
    19.
    发明申请
    Architecture for stack robust fine granularity scalability 审中-公开
    堆栈的架构稳健的细粒度可扩展性

    公开(公告)号:US20050195896A1

    公开(公告)日:2005-09-08

    申请号:US10793830

    申请日:2004-03-08

    IPC分类号: H04N7/12

    CPC分类号: H04N19/36 H04N19/34 H04N19/61

    摘要: The present invention relates to an architecture for stack robust fine granularity scalability (SRFGS), more particularly, SRFGS providing simultaneously temporal scalability and SNR scalability. SRFGS first simplifies the RFGS temporal prediction architecture and then generalizes the prediction concept as the following: the quantization error of the previous layer can be inter-predicted by the reconstructed image in the previous time instance of the same layer. With this concept, the RFGS architecture can be extended to multiple layers that forming a stack to improve the temporal prediction efficiency. SRFGS can be optimized at several operating points to fit the requirements of various applications while the fine granularity and error robustness of RFGS are still remained. The experiment results show that SRFGS can improve the performance of RFGS by 0.4 to 3.0 dB in PSNR.

    摘要翻译: 本发明涉及用于堆栈鲁棒的精细粒度可伸缩性(SRFGS)的架构,更具体地说,涉及提供同时时间可扩展性和SNR可伸缩性的SRFGS。 SRFGS首先简化RFGS时间预测架构,然后将预测概​​念概括如下:先前层的量化误差可以由同一层的先前时间实例中的重建图像进行预测。 利用这个概念,RFGS架构可以扩展到形成堆栈的多个层,以提高时间预测效率。 SRFGS可以在几个操作点进行优化,以适应各种应用的要求,同时仍保持RFGS的细粒度和误差鲁棒性。 实验结果表明,SRFGS可以在PSNR中将RFGS的性能提高0.4〜3.0dB。

    Mode decision using approximate ½ pel interpolation
    20.
    发明授权
    Mode decision using approximate ½ pel interpolation 有权
    使用近似½贝尔插值的模式决策

    公开(公告)号:US08498338B1

    公开(公告)日:2013-07-30

    申请号:US13585091

    申请日:2012-08-14

    IPC分类号: H04N7/12

    摘要: A method for video encoding is disclosed. The method generally includes the steps of (A) generating first sub-pel data for at least one of (i) a motion estimation and (ii) a mode decision by first filtering reference data and (B) generating second sub-pel data for a motion compensation by second filtering the reference data. Wherein a first performance of the first filtering may be different than a second performance of the second filtering.

    摘要翻译: 公开了一种用于视频编码的方法。 该方法通常包括以下步骤:(A)为(i)运动估计和(ii)通过首先过滤参考数据的模式决定和(B)生成第二子像素数据中的至少一个生成第一子像素数据, 通过对参考数据进行二次过滤来进行运动补偿。 其中,第一过滤的第一性能可以不同于第二过滤的第二性能。