Video Decoding with Reduced IDCT Calculations
    11.
    发明申请
    Video Decoding with Reduced IDCT Calculations 有权
    视频解码与减少的IDCT计算

    公开(公告)号:US20080215654A1

    公开(公告)日:2008-09-04

    申请号:US12121018

    申请日:2008-05-15

    IPC分类号: G06F17/14

    CPC分类号: G06F17/147 G06T9/007

    摘要: Reduced complexity inverse discrete cosine transform (IDCT) masks and a method for reducing the number of IDCT calculations in video decoding are provided. The method comprises: accepting an n×m matrix of DCT coefficients; performing (n−y) horizontal IDCT operations, where y is greater than 0; performing y scaling operations; and, generating an n×m block of pixel information. Some aspects of the method further comprise: performing (m−z) vertical IDCT operations, where z is in the range between 0 and m/2. In some aspects, performing (n−y) horizontal ICDT operations includes performing IDCT operations for the first (n−y) horizontal rows. Then, performing y scaling operations includes: selecting the DC component from the first position of each horizontal row; scaling the selected DC component; and, copying the scaled DC component into the remaining positions of each of horizontal row.

    摘要翻译: 提供了复杂度降低的离散余弦变换(IDCT)掩模和减少视频解码中IDCT计算次数的方法。 该方法包括:接受DCT系数的n×m矩阵; 执行(n-y)水平IDCT操作,其中y大于0; 执行y缩放操作; 并且生成像素信息的n×m块。 该方法的一些方面还包括:执行(m-z)垂直IDCT操作,其中z在0和m / 2之间的范围内。 在一些方面,执行(n-y)个水平ICDT操作包括执行第一(n-y)个水平行的IDCT操作。 然后,执行y缩放操作包括:从每个水平行的第一位置选择DC分量; 缩放选定的直流分量; 并且将缩放的直流分量复制到水平行中的每一个的剩余位置。

    Temporal scalable coding using AVC coding tools
    16.
    发明申请
    Temporal scalable coding using AVC coding tools 审中-公开
    使用AVC编码工具的时间可伸缩编码

    公开(公告)号:US20060013305A1

    公开(公告)日:2006-01-19

    申请号:US10951863

    申请日:2004-09-27

    申请人: Shijun Sun

    发明人: Shijun Sun

    摘要: A temporally scalable video coding method is provided to interleave pictures from all layers of a video sequence including video sub-sequences organized using enhancement layers following a set of rules: (1) pictures in each layer are to be coded sequentially within the layer; (2) a picture from an upper layer should be coded when its temporally closest neighboring pictures among all lower layers (in both forward and backward directions if available) have been already coded; in other words, coding of an upper-layer picture requires the temporally closest neighboring pictures among all lower layers (in both forward and backward directions if available) be coded before hand. To ensure a reasonable coding efficiency, for each picture, its qualified reference pictures may be reordered so that the reference pictures are ordered using their relative temporal distance from the current picture instead of the default picture coding order.

    摘要翻译: 提供了一种时间上可分级的视频编码方法,用于根据一组规则来交织包括使用增强层组织的视频子序列的视频序列的所有层的图像:(1)每层中的图像将被顺序地编码在该层内; (2)当上层图片的时间上最接近的相邻图片(如果有的话)向前和向后方向(如果有的话)已经被编码,则编码来自上层的图片; 换句话说,上层图像的编码需要所有下层之间的时间上最接近的相邻图像(在前向和后向两个方向(如果可用))之前被编码。 为了确保合理的编码效率,对于每个图像,其合格的参考图片可以被重新排序,使得使用它们与当前图片的相对时间距离而不是默认图片编码顺序来排序参考图片。

    Template matching in 3 dimensions using correlative auto-predictive search
    17.
    发明授权
    Template matching in 3 dimensions using correlative auto-predictive search 失效
    使用相关自动预测搜索的三维模板匹配

    公开(公告)号:US06243494B1

    公开(公告)日:2001-06-05

    申请号:US09216691

    申请日:1998-12-18

    IPC分类号: G06K962

    CPC分类号: G06K9/6203

    摘要: A template is analyzed to determine step sizes for searching within a search area. The template is analyzed by first padding the template with data points to increase its size. Cross-correlation between the padded template and the original template leads to identification of an effective step size along multiple axes. Step sizes for each of a horizontal, vertical and a third axis are derived. Third axis step sizes may correspond to rotation, scaling factor, subsampling factor, linear distance, time or frequency. Windows of the search area, selected based on the step sizes, then are tested in a fast search by correlating the template to selected windows to derive correlation coefficients. Any tested window which has a correlation coefficient exceeding a given value is a potential match for the template and is subject to a refined stage of comparison.

    摘要翻译: 分析模板以确定在搜索区域内搜索的步长。 通过首先用数据点填充模板来分析模板以增加其大小。 填充模板和原始模板之间的相互关系导致沿着多个轴的有效步长的识别。 导出水平,垂直和第三轴各自的步长。 第三轴步长可以对应于旋转,缩放因子,子采样因子,线性距离,时间或频率。 基于步长选择的搜索区域的Windows,然后通过将模板与所选择的窗口相关联来进行快速搜索来测试以导出相关系数。 具有超过给定值的相关系数的任何测试窗口是模板的潜在匹配,并且经过精细的比较阶段。

    Reducing DC leakage in HD photo transform
    18.
    发明授权
    Reducing DC leakage in HD photo transform 有权
    降低高清摄影中的直流泄漏

    公开(公告)号:US08369638B2

    公开(公告)日:2013-02-05

    申请号:US12165474

    申请日:2008-06-30

    IPC分类号: G06K9/36

    摘要: In certain embodiments, to eliminate DC leakage into surrounding AC values, scaling stage within a photo overlap transform operator is modified such that the off-diagonal elements of the associated scaling matrix have the values of 0. In certain embodiments, the on-diagonal scaling matrix are given the values (0.5, 2). In some embodiments, the scaling is performed using a combination of reversible modulo arithmetic and lifting steps. In yet other embodiments, amount of DC leakage is estimated at the encoder, and preprocessing occurs to mitigate amount of leakage, with the bitstream signaling that preprocessing has occurred. A decoder may then read the signal and use the information to mitigate DC leakage.

    摘要翻译: 在某些实施例中,为了将DC泄漏消除到周围的AC值中,修改照片重叠变换算子内的缩放阶段,使得相关联的缩放矩阵的非对角元素具有值0.在某些实施例中, 矩阵给出值(0.5,2)。 在一些实施例中,使用可逆模运算和提升步骤的组合来执行缩放。 在其他实施例中,在编码器处估计DC泄漏量,并且发生预处理以减轻泄漏量,同时比特流指示已经发生预处理。 然后,解码器可以读取信号并使用该信息来减轻DC泄漏。

    Multi-level representation of reordered transform coefficients
    20.
    发明授权
    Multi-level representation of reordered transform coefficients 有权
    重排序变换系数的多级表示

    公开(公告)号:US08179974B2

    公开(公告)日:2012-05-15

    申请号:US12151069

    申请日:2008-05-02

    IPC分类号: H04B1/66

    摘要: Techniques and tools for encoding and decoding a block of frequency coefficients are presented. An encoder selects a scan order from multiple available scan orders and then applies the selected scan order to a two-dimensional matrix of transform coefficients, grouping non-zero values of the frequency coefficients together in a one-dimensional string. The encoder entropy encodes the one-dimensional string of coefficient values according to a multi-level nested set representation. In decoding, a decoder entropy decodes the one-dimensional string of coefficient values from the multi-level nested set representation. The decoder selects the scan order from among multiple available scan orders and then reorders the coefficients back into a two-dimensional matrix using the selected scan order.

    摘要翻译: 提出了用于编码和解码频率系数块的技术和工具。 编码器从多个可用扫描顺序中选择扫描顺序,然后将所选择的扫描顺序应用于变换系数的二维矩阵,将频率系数的非零值在一维串中分组。 编码器根据多级嵌套集合表示对一维系列值串进行编码。 在解码中,解码器熵从多级嵌套集合表示解码系数值的一维串。 解码器从多个可用扫描顺序中选择扫描顺序,然后使用所选择的扫描顺序将系数重新排序成二维矩阵。