Method and apparatus for modeling quantization matrices for image/video encoding
    1.
    发明授权
    Method and apparatus for modeling quantization matrices for image/video encoding 有权
    用于建模图像/视频编码的量化矩阵的方法和装置

    公开(公告)号:US08326068B1

    公开(公告)日:2012-12-04

    申请号:US11512736

    申请日:2006-08-30

    申请人: Huipin Zhang Guy Cote

    发明人: Huipin Zhang Guy Cote

    IPC分类号: G06K9/00

    摘要: A method for encoding an image is disclosed. The method generally includes the steps, of (A) generating a quantization matrix as a function of at least four parameters, (B) optimizing the parameters to maximize a quality metric for encoding the image and (C) encoding the image with the quantization matrix as optimized.

    摘要翻译: 公开了一种图像编码方法。 该方法通常包括以下步骤:(A)生成作为至少四个参数的函数的量化矩阵,(B)优化参数以最大化用于编码图像的质量度量;(C)使用量化矩阵对图像进行编码 优化。

    Methods and/or apparatus for controlling zero-residual coding in predictive image/video coding
    2.
    发明授权
    Methods and/or apparatus for controlling zero-residual coding in predictive image/video coding 有权
    用于在预测图像/视频编码中控制零残差编码的方法和/或装置

    公开(公告)号:US09036712B1

    公开(公告)日:2015-05-19

    申请号:US11430127

    申请日:2006-05-08

    申请人: Guy Cote Huipin Zhang

    发明人: Guy Cote Huipin Zhang

    IPC分类号: H04N7/12 H04N7/52

    摘要: A method for coding video is disclosed. The method generally includes the steps of (A) receiving a video signal having a series of pictures, each of the pictures having a plurality of blocks, (B) analyzing the blocks to forecast if coding the blocks in a zero-residual coding mode would generate a plurality of artifacts, (C) disabling the zero-residual coding mode for the blocks forecasted to generate at least one of the artifacts and (D) enabling the zero-residual coding mode for the blocks forecasted to generate none of the artifacts.

    摘要翻译: 公开了一种视频编码方法。 该方法通常包括以下步骤:(A)接收具有一系列图像的视频信号,每个图像具有多个块,(B)分析块以预测是否以零残差编码模式编码块 产生多个伪像,(C)禁止预测的块的零残余编码模式以产生伪像中的至少一个,以及(D)为预测的块生成无伪像的零残余编码模式。

    Wavelet based multiresolution video representation with spatially scalable motion vectors
    3.
    发明授权
    Wavelet based multiresolution video representation with spatially scalable motion vectors 有权
    基于小波的多分辨率视频表示与空间可缩放的运动矢量

    公开(公告)号:US08477849B2

    公开(公告)日:2013-07-02

    申请号:US11952721

    申请日:2007-12-07

    摘要: Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.

    摘要翻译: 公开了通过多尺度运动补偿时间滤波(MCTF)和空间小波变换产生的基于小波的多分辨率视频表示。 由于时间滤波和空间滤波在生成这种表示中是分开的,所以有许多不同的方式来交织单级MCTF和单级空间滤波,导致许多不同的视频表示方案具有空间可缩放的运动矢量,用于支持不同组合 空间可扩展性和时间可扩展性。 研究了这样的视频表示方案的设计问题,以满足空间/时间的可扩展性要求。 还调查了解码器的方案的信令。 由于MCTF通过子带进行子带,运动矢量可用于重建由二分解模式和最大空间分解水平限制的任何可能的降低的空间分辨率的视频序列。 因此,清楚的是,分解方案族为完全可扩展的视频编码提供了有效且通用的多分辨率视频表示。

    WAVELET BASED MULTIRESOLUTION VIDEO REPRESENTATION WITH SPATIALLY SCALABLE MOTION VECTORS
    4.
    发明申请
    WAVELET BASED MULTIRESOLUTION VIDEO REPRESENTATION WITH SPATIALLY SCALABLE MOTION VECTORS 有权
    基于波形的多分辨率视频表示与空间可调运动矢量

    公开(公告)号:US20080152011A1

    公开(公告)日:2008-06-26

    申请号:US11952721

    申请日:2007-12-07

    IPC分类号: H04N7/32

    摘要: Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.

    摘要翻译: 公开了通过多尺度运动补偿时间滤波(MCTF)和空间小波变换产生的基于小波的多分辨率视频表示。 由于时间滤波和空间滤波在生成这种表示中是分开的,所以有许多不同的方式来交织单级MCTF和单级空间滤波,导致许多不同的视频表示方案具有空间可缩放的运动矢量,用于支持不同组合 空间可扩展性和时间可扩展性。 研究了这样的视频表示方案的设计问题,以满足空间/时间的可扩展性要求。 还对该解码器的信令进行了调查。 由于MCTF通过子带进行子带,运动矢量可用于重建由二分解模式和最大空间分解水平限制的任何可能的降低的空间分辨率的视频序列。 因此,清楚的是,分解方案族为完全可扩展的视频编码提供了有效和通用的多分辨率视频表示。

    Wavelet based multiresolution video representation with spatially scalable motion vectors
    5.
    发明授权
    Wavelet based multiresolution video representation with spatially scalable motion vectors 有权
    基于小波的多分辨率视频表示与空间可缩放的运动矢量

    公开(公告)号:US07321625B2

    公开(公告)日:2008-01-22

    申请号:US10318802

    申请日:2002-12-13

    摘要: Wavelet based multiresolution video representations generated by multi-scale motion compensated temporal filtering (MCTF) and spatial wavelet transform are disclosed. Since temporal filtering and spatial filtering are separated in generating such representations, there are many different ways to intertwine single-level MCTF and single-level spatial filtering, resulting in many different video representation schemes with spatially scalable motion vectors for the support of different combination of spatial scalability and temporal scalability. The problem of design of such a video representation scheme to full the spatial/temporal scalability requirements is studied. Signaling of the scheme to the decoder is also investigated. Since MCTF is performed subband by subband, motion vectors are available for reconstructing video sequences of any possible reduced spatial resolution, restricted by the dyadic decomposition pattern and the maximal spatial decomposition level. It is thus clear that the family of decomposition schemes provides efficient and versatile multiresolution video representations for fully scalable video coding.

    摘要翻译: 公开了通过多尺度运动补偿时间滤波(MCTF)和空间小波变换产生的基于小波的多分辨率视频表示。 由于时间滤波和空间滤波在生成这种表示中是分开的,所以有许多不同的方式来交织单级MCTF和单级空间滤波,导致许多不同的视频表示方案具有空间可缩放的运动矢量,用于支持不同组合 空间可扩展性和时间可扩展性。 研究了这样的视频表示方案的设计问题,以满足空间/时间的可扩展性要求。 还对该解码器的信令进行了调查。 由于MCTF通过子带进行子带,运动矢量可用于重建由二分解模式和最大空间分解水平限制的任何可能的降低的空间分辨率的视频序列。 因此,清楚的是,分解方案族为完全可扩展的视频编码提供了有效且通用的多分辨率视频表示。

    Method and apparatus for selecting optimal video encoding parameter configurations
    6.
    发明授权
    Method and apparatus for selecting optimal video encoding parameter configurations 有权
    用于选择最佳视频编码参数配置的方法和装置

    公开(公告)号:US08831089B1

    公开(公告)日:2014-09-09

    申请号:US11496410

    申请日:2006-07-31

    申请人: Huipin Zhang

    发明人: Huipin Zhang

    IPC分类号: H04N11/02

    摘要: A method for determining optimal video encoding parameters is disclosed. The method generally includes the steps of (A) storing a plurality of configurable parameters each comprising a respective trial value, (B) generating a bitstream by encoding a test sequence of pictures using (i) a plurality of non-configurable parameters fixed in a design of the encoder, (ii) the configurable parameters and (iii) a plurality of dynamic parameters adjustable in real time by the encoder, (C) generating a reconstructed sequence of pictures by decoding the bitstream, (D) generating a quality metric based on the reconstructed sequence of pictures compared with the test sequence of pictures and (E) adjusting the respective trial values to optimize the quality metric.

    摘要翻译: 公开了一种用于确定最佳视频编码参数的方法。 该方法通常包括以下步骤:(A)存储多个可配置参数,每个可配置参数包括各自的试用值,(B)通过使用(i)多个不可配置参数固定在 编码器的设计,(ii)可配置参数和(iii)由编码器实时调节的多个动态参数,(C)通过解码比特流来产生重构的图像序列,(D)基于质量度量生成 对图像的重建序列与图像的测试序列进行比较,(E)调整各个试验值以优化质量度量。

    Method and/or apparatus for detecting homogeneity of blocks in an image processing system
    7.
    发明授权
    Method and/or apparatus for detecting homogeneity of blocks in an image processing system 有权
    用于检测图像处理系统中块的均匀性的方法和/或装置

    公开(公告)号:US07715652B1

    公开(公告)日:2010-05-11

    申请号:US11232459

    申请日:2005-09-21

    IPC分类号: G06K9/36

    CPC分类号: G06T7/44 G06T7/11 G06T2200/28

    摘要: An apparatus comprising a first circuit and a second circuit. The first circuit may be configured to (i) receive an image data stream comprising a plurality of frames each having a plurality of regions, (ii) select a particular region to be marked as being homogeneous or not homogeneous, and (iii) determine whether a group of neighboring regions to the selected region are qualified or not qualified. The second circuit may be configured to mark the selected region as being homogeneous when one or more of the adjacent regions are (i) qualified and (ii) previously marked as being homogeneous.

    摘要翻译: 一种包括第一电路和第二电路的装置。 第一电路可以被配置为(i)接收包括多个帧的图像数据流,每个帧具有多个区域,(ii)选择要标记为均匀或不均匀的特定区域,以及(iii)确定是否 所选地区的一组邻近地区有资格或不合格。 第二电路可以被配置为当一个或多个相邻区域(i)合格并且(ii)先前标记为均匀时,将所选择的区域标记为均匀。

    Method of performing sub-pixel based edge-directed image interpolation
    8.
    发明授权
    Method of performing sub-pixel based edge-directed image interpolation 有权
    执行基于子像素的边缘向量图像插值的方法

    公开(公告)号:US07136541B2

    公开(公告)日:2006-11-14

    申请号:US10273781

    申请日:2002-10-18

    IPC分类号: G06K9/32

    CPC分类号: G06T3/403

    摘要: A method of generating a value for a missing pixel “x” by determining a “least harmful” local edge direction between pixels, or sub-pixels, on substantially opposing sides of the missing pixel, and interpolating the difference to arrive at a value for pixel “x”. The method involves generating sub-pixel values for locations within neighboring pixels, the sub-pixels may comprise half-pixels, quarter-pixels, three-quarter pixels, and so forth, wherein any fractional pixel quantity may be created. Absolute difference values are calculated between neighboring pixels, or sub-pixel values, to determine a least harmful local edge direction along which a value is generated for pixel “x” by interpolation.

    摘要翻译: 通过在缺失像素的基本相对的侧面上确定像素或子像素之间的“最不利的”局部边缘方向来产生缺失像素“x”的值的方法,并且内插差值以得到 像素“x”。 该方法涉及为相邻像素内的位置生成子像素值,子像素可以包括半像素,四分之一像素,四分之三像素等,其中可以创建任何分数像素数量。 在相邻像素或子像素值之间计算绝对差值,以确定通过插值为像素“x”生成值的最不利的局部边缘方向。

    VIDEO WHISPER SESSIONS DURING ONLINE COLLABORATIVE COMPUTING SESSIONS
    9.
    发明申请
    VIDEO WHISPER SESSIONS DURING ONLINE COLLABORATIVE COMPUTING SESSIONS 审中-公开
    在线协作计算会议期间的视频会议

    公开(公告)号:US20120017149A1

    公开(公告)日:2012-01-19

    申请号:US12837042

    申请日:2010-07-15

    IPC分类号: G06F3/01 G06F17/00 G06F15/16

    摘要: In one embodiment, a plurality of attendee devices may participate in an online collaborative computing session to receive video and audio content for the online collaborative computing session. A particular attendee device may then either initiate or receive a communicated signal between a “whisperer” and “whisperee” that indicates a desire of the whisperer to establish a video whisper session with the whisperee. In response, the video whisper session may be established between the whisperer and whisperee devices, such as through a mutual subscription by the whisperer and whisperee to a video channel and audio channel of each other corresponding device. In this manner, users of the whisperer and whisperee devices may see and hear each other via the video whisper session, and attendee devices other than the whisperer and whisperee are prevented from playing audio from the video whisper session between the whisperer and whisperee.

    摘要翻译: 在一个实施例中,多个参与者设备可以参与在线协作计算会话以接收用于在线协作计算会话的视频和音频内容。 然后,特定的与会者设备可以在“低语者”和“whisperee”之间发起或接收传达的信号,其指示小孩与耳语建立视频耳语会话的愿望。 作为响应,视频耳语会话可以在小声和耳机设备之间建立,例如通过低音者的相互订阅和对彼此对应的设备的视频频道和音频频道的通话。 以这种方式,低声用户和耳机设备的用户可以通过视频耳语会话来看到和听到对方,并且防止低语者和耳语之外的与会者设备从窃听者和耳语之间的视频耳语会话中播放音频。

    Method of performing quantization within a multimedia bitstream utilizing division-free instructions

    公开(公告)号:US07065546B2

    公开(公告)日:2006-06-20

    申请号:US10120210

    申请日:2002-04-09

    IPC分类号: G06F7/52

    CPC分类号: G06F7/4873 G06F2207/3828

    摘要: Methods for enhancing the performance of quantization operations by converting division operations to a combination of multiplication and shift operations, which are preferably performed on a processor supporting single-instruction multiple-data (SIMD) instructions. A table of mantissa and exponent values is created for a sufficient range of values for 1/a. During quantization, the mantissa and exponent values are found in the table 1/a for associated with a given quantization division operation given by b/a which is found according to the formula b/a=(b×A)>>n. Aspects are described for application to processors that do not support non-uniform shift operations, and for reducing the necessary bit-width of the operations to increase efficiency. The quantization method may be applied to protocols such as MPEG-2 and other similar formats.