VIDEO CODING QUANTIZATION PARAMETER DETERMINATION SUITABLE FOR VIDEO CONFERENCING
    1.
    发明申请
    VIDEO CODING QUANTIZATION PARAMETER DETERMINATION SUITABLE FOR VIDEO CONFERENCING 审中-公开
    视频编码量化参数确定适用于视频会议

    公开(公告)号:US20150326857A1

    公开(公告)日:2015-11-12

    申请号:US14275109

    申请日:2014-05-12

    摘要: Systems and methods for determining quantization parameter (QP) for video coding. Embodiments may be particularly advantageous for strongly temporal correlated frames, such as for video conferencing applications. An initial QP for a frame of a video sequence may be modified based on a spatial complexity or a temporal complexity associated with the video frame, and/or based on an inter-predicted frame bitrate target cycle, as a function of whether the frame is intra- or inter-predicted. The inter-predicted frame bitrate target cycle includes a sequence of two or more inter-predicted frame bitrate targets that are assigned to the frame according to the inter-predicted frame bitrate target cycle. A reference frame for an inter-predicted frame may be selected based on the bitrate target associated with candidate reference frames. Initial QP of an inter-predicted frame with a scene change may be modified in a manner independent of an inter-predicted frame bitrate target cycle.

    摘要翻译: 用于确定用于视频编码的量化参数(QP)的系统和方法。 对于强时间相关帧,例如对于视频会议应用,实施例可能是特别有利的。 可以基于与视频帧相关联的空间复杂性或时间复杂度和/或基于帧间预测帧比特率目标周期来修改视频序列的帧的初始QP,作为帧是否为 内部或内部预测。 帧间预测帧比特率目标周期包括根据帧间预测帧比特率目标周期分配给帧的两个或多个帧间预测帧比特率目标的序列。 可以基于与候选参考帧相关联的比特率目标来选择帧间预测帧的参考帧。 可以以独立于帧间预测帧比特率目标周期的方式修改具有场景变化的帧间预测帧的初始QP。

    METHOD AND SYSTEM OF TRANSFORM BLOCK PROCESSING ACCORDING TO QUANTIZATION MATRIX IN VIDEO CODING
    2.
    发明申请
    METHOD AND SYSTEM OF TRANSFORM BLOCK PROCESSING ACCORDING TO QUANTIZATION MATRIX IN VIDEO CODING 有权
    基于视频编码的量化矩阵的变换块处理方法与系统

    公开(公告)号:US20140177728A1

    公开(公告)日:2014-06-26

    申请号:US14234505

    申请日:2012-08-21

    申请人: Ximin Zhang Shan Liu

    发明人: Ximin Zhang Shan Liu

    摘要: A method and system for processing transform blocks according to quantization matrices in a video coding system are disclosed. Embodiments of the present invention derive one or more derived quantization matrices from one or more initial quantization matrices or from one previously derived quantization matrix. In one embodiment, the initial quantization matrices include a 4×4 and 8×8 quantization matrices, which can be either default or user-defined. All quantization matrices larger than 8×8 can be derived from the 4×4 and 8×8 initial quantization matrices. Non-square quantization matrices can be derived from at least one initial square quantization matrix or at least one derived square quantization matrix. Individual initial quantization matrices may be used to derive respective larger quantization matrices. Furthermore, the individual initial quantization matrices may be derived from larger quantization matrices designed for corresponding transform sizes. Syntax design to enable the quantization matrix representation is also disclosed.

    摘要翻译: 公开了一种根据视频编码系统中的量化矩阵来处理变换块的方法和系统。 本发明的实施例从一个或多个初始量化矩阵或从一个先前导出的量化矩阵导出一个或多个导出的量化矩阵。 在一个实施例中,初始量化矩阵包括4×4和8×8个量化矩阵,其可以是默认的或用户定义的。 大于8×8的所有量化矩阵可以从4×4和8×8个初始量化矩阵导出。 可以从至少一个初始平方量化矩阵或至少一个导出的平方量化矩阵导出非平方量化矩阵。 各个初始量化矩阵可以用于导出相应较大的量化矩阵。 此外,各个初始量化矩阵可以从为相应的变换大小设计的较大的量化矩阵导出。 还公开了启用量化矩阵表示的语法设计。

    METHOD AND APPARATUS FOR INTRA PREDICTION USING NON-SQUARE BLOCKS
    3.
    发明申请
    METHOD AND APPARATUS FOR INTRA PREDICTION USING NON-SQUARE BLOCKS 有权
    使用非正方形块进行预测的方法和装置

    公开(公告)号:US20140140404A1

    公开(公告)日:2014-05-22

    申请号:US14234152

    申请日:2012-07-26

    摘要: A method and apparatus for Intra prediction of a block based on neighboring pixels around the block are disclosed. Embodiments according to the present invention use square blocks as well as non-square blocks for Intra prediction. For a 2N×2N Luma CU (coding unit), the CU can be partitioned into 2N×N, N×2N, 2N×2N or N×N PUs. The 2N×N and N×2N PUs can be further processed by either square transforms only or both non-square and square transforms. In one embodiment, the 2N×N PU or the N×2N PU is processed as two N×N TUs (transform units) and each of the N×N TU is further split into smaller N×N TUs based on quad-tree split. In another embodiment, the 2N×N and N×2N PUs are processed as two 2N×0.5N and 0.5N×2N TUs respectively.

    摘要翻译: 公开了一种基于块周围的相邻像素进行帧内预测的方法和装置。 根据本发明的实施例使用平方块以及用于帧内预测的非方块。 对于2N×2N亮度CU(编码单元),CU可以划分为2N×N,N×2N,2N×2N或N×N个PU。 2N×N和N×2N个PU可以通过正方形变换或非正方形和正方形变换进一步处理。 在一个实施例中,2N×N PU或N×2N PU被处理为两个N×N个TU(变换单元),并且N×N TU中的每一个进一步被分割成基于四叉树分割的较小的N×N个TU 。 在另一个实施例中,2N×N和N×2N个PU分别被处理为两个2N×0.5N和0.5N×2N个TU。

    Method and Apparatus of Adaptive Inter Mode Coding Using Variable Length Codes
    4.
    发明申请
    Method and Apparatus of Adaptive Inter Mode Coding Using Variable Length Codes 审中-公开
    使用可变长度码的自适应帧间编码的方法和装置

    公开(公告)号:US20120195366A1

    公开(公告)日:2012-08-02

    申请号:US13108055

    申请日:2011-05-16

    IPC分类号: H04N7/26

    摘要: A method and apparatus for adaptive inter prediction mode coding are disclosed. In the current HEVC, a fixed set of variable length codes is used for the underlying video data, which may not optimally match the statistics of underlying video data. Consequently, the compression efficiency associated with the fixed set of variable length codes will be compromised. Accordingly, an adaptive coding scheme for inter prediction modes is disclosed. The variable length codes used for each inter prediction mode in each coding unit depth is adaptively determined by its respective statistics. The statistics can be measured as the frequency of occurrence of each mode. In one embodiment according to the present invention, counters are used to collect the statistics. According to one embodiment of the present invention, the statistics of inter prediction modes are collected from the previous slice and the set of variable length codes is determined for the subsequent slice (immediately following the previous slice) accordingly. According to another embodiment of the present invention, the statistics of inter prediction modes are updated for each coding unit and the variable length code for each mode is adjusted according to the statistics change during the coding process. According to another embodiment of the present invention, the variable length code for each mode is reset in the beginning of each slice. The reset code word table is either a predefined code word table for whole sequence or a code word table determined by the previous slice.

    摘要翻译: 公开了一种自适应帧间预测模式编码的方法和装置。 在当前的HEVC中,一组固定的可变长度代码用于底层视频数据,这可能不能最佳地匹配底层视频数据的统计。 因此,与固定的可变长度代码组相关联的压缩效率将受到损害。 因此,公开了一种用于帧间预测模式的自适应编码方案。 在每个编码单元深度中用于每个帧间预测模式的可变长度码由其各自的统计量自适应地确定。 统计量可以作为每种模式发生的频率进行测量。 在根据本发明的一个实施例中,使用计数器来收集统计信息。 根据本发明的一个实施例,从先前的片段收集帧间预测模式的统计,并且相应地为随后的片段(紧接着先前的片段)确定可变长度码集合。 根据本发明的另一实施例,针对每个编码单元更新帧间预测模式的统计,并且根据编码处理期间的统计变化来调整每种模式的可变长度码。 根据本发明的另一实施例,每个模式的可变长度代码在每个切片的开头被复位。 复位码字表是用于整个序列的预定义码字表或由先前片确定的码字表。

    System and method for effectively performing an adaptive encoding procedure
    5.
    发明授权
    System and method for effectively performing an adaptive encoding procedure 有权
    用于有效执行自适应编码过程的系统和方法

    公开(公告)号:US07885476B2

    公开(公告)日:2011-02-08

    申请号:US11638810

    申请日:2006-12-14

    申请人: Ximin Zhang

    发明人: Ximin Zhang

    IPC分类号: G06K9/36 G06K9/46

    摘要: A system and method for effectively performing an adaptive encoding procedure includes a texture analyzer that initially determines texture characteristics for blocks of input image data. An image transformer converts the blocks of image data into sets of coefficients that represent the various blocks. A block categorizer utilizes the texture characteristics to associate texture categories with the sets of coefficients from the various blocks. Deadzone tables are provided for storing deadzone values that define deadzone regions for performing appropriate quantization procedures. A quantizer may then access the deadzone values from the deadzone tables to adaptively convert the coefficients into quantized coefficients according to their corresponding texture characteristics.

    摘要翻译: 用于有效执行自适应编码过程的系统和方法包括纹理分析器,其最初确定输入图像数据块的纹理特征。 图像变换器将图像数据块转换为表示各种块的系数集合。 块分类器利用纹理特征将纹理类别与来自各种块的系数集相关联。 死区表用于存储定义死区的死区值,用于执行适当的量化过程。 然后,量化器可以从死区表访问死区值,以根据其对应的纹理特征将系数自适应地转换为量化系数。

    CONTENT ADAPTIVE DETECTION OF IMAGES WITH STAND-OUT OBJECT
    6.
    发明申请
    CONTENT ADAPTIVE DETECTION OF IMAGES WITH STAND-OUT OBJECT 有权
    内容自适应检测图像与标准对象

    公开(公告)号:US20100321513A1

    公开(公告)日:2010-12-23

    申请号:US12486487

    申请日:2009-06-17

    IPC分类号: H04N5/228 G06K9/48 G06K9/34

    CPC分类号: G06K9/00

    摘要: Content adaptive detection of images having stand-out objects involves block variance-based detection and determining if an object includes a stand-out object. The images with a stand-out object are further processed to isolate an object of interest. The images without a detected stand-out object are further processed with a transition map-based detection method which includes generating a transition map. If an object portrait is determined from the transition map, then the image is further processed to isolate the object of interest.

    摘要翻译: 具有突出对象的图像的内容自适应检测涉及基于块方差的检测,并且确定对象是否包括独立对象。 具有突出对象的图像被进一步处理以隔离感兴趣的对象。 使用基于过渡映射的检测方法进一步处理没有检测到的突出对象的图像,该方法包括生成转换图。 如果从转换图确定对象画像,则进一步处理图像以隔离感兴趣的对象。

    ORIENTATION-BASED APPROACH FOR FORMING A DEMOSAICED IMAGE, AND FOR COLOR CORRECTING AND ZOOMING THE DEMOSAICED IMAGE
    7.
    发明申请
    ORIENTATION-BASED APPROACH FOR FORMING A DEMOSAICED IMAGE, AND FOR COLOR CORRECTING AND ZOOMING THE DEMOSAICED IMAGE 有权
    基于方向的方法,用于形成一个降解图像,并用于颜色校正和变焦图像

    公开(公告)号:US20100253817A1

    公开(公告)日:2010-10-07

    申请号:US12418207

    申请日:2009-04-03

    IPC分类号: H04N5/335 G06K9/00

    摘要: A method and apparatus for forming a demosaiced image from a color-filter-array (“CFA”) image is provided. The CFA image comprises a first set of pixels colored according to a first (e.g., a green) color channel, a second set of pixels colored according to a second (e.g., a red) color channel and a third set of pixels colored according to a third (e.g., blue) color channel. The method may include obtaining an orientation map, which includes, for each pixel of the color-filter-array image, an indicator of orientation of an edge bounding such pixel. The method may further include interpolating the first color channel at the second and third sets of pixels as a function of the orientation map so as to form a fourth set of pixels. The method may also include interpolating the second color channel at the first and third sets of pixels as a function of the orientation map and the fourth set of pixels; and interpolating the third color channel at the first and second sets of pixels as a function of the orientation map and the fourth set of pixels.

    摘要翻译: 提供了一种用于从彩色滤光片阵列(“CFA”)图像形成去马赛克图像的方法和装置。 CFA图像包括根据第一(例如,绿色)颜色通道着色的第一组像素,根据第二(例如,红色)颜色通道着色的第二组像素,以及根据第 第三(例如蓝色)彩色通道。 该方法可以包括获得方位图,其包括对于彩色滤波器阵列图像的每个像素的边界边界的方向的指示符。 该方法可以进一步包括根据取向图来插值第二和第三组像素处的第一颜色通道,以便形成第四组像素。 该方法还可以包括根据取向图和第四组像素来内插第一和第三组像素处的第二颜色通道; 以及作为所述取向图和所述第四组像素的函数,在所述第一和第二像素组处插入所述第三颜色通道。

    VIDEO SYSTEM WITH BLOCKING ARTIFACT FILTERING
    8.
    发明申请
    VIDEO SYSTEM WITH BLOCKING ARTIFACT FILTERING 失效
    具有阻塞文艺过滤的视频系统

    公开(公告)号:US20100111435A1

    公开(公告)日:2010-05-06

    申请号:US12266551

    申请日:2008-11-06

    IPC分类号: G06K9/40

    摘要: A video system includes: analyzing video data, having a block; performing a transition change detection for determining a spatial intensity transition within the block; performing a block-wise similarity measurement on the block in the video data for identifying a blocking artifact; and filtering with a two dimensional cross filter every pixel in the block for removing the blocking artifact.

    摘要翻译: 视频系统包括:分析具有块的视频数据; 执行用于确定所述块内的空间强度转换的转变变化检测; 对所述视频数据中的块进行块式相似度测量,以识别块伪影; 并使用块中的每个像素对二维交叉滤波进行滤波,以消除块伪影。

    Block based codec friendly edge detection and transform selection
    9.
    发明申请
    Block based codec friendly edge detection and transform selection 有权
    基于块的编解码器友好边缘检测和变换选择

    公开(公告)号:US20090262800A1

    公开(公告)日:2009-10-22

    申请号:US12148582

    申请日:2008-04-18

    IPC分类号: H04N7/30

    摘要: Low complexity edge detection and DCT type selection method to improve the visual quality of H.264/AVC encoded video sequence is described. Encoding-generated information is reused to detect an edge macroblock. Variance and Mean Absolute Difference (MAD) of one macroblock shows a certain relationship that is able to be used to differentiate the edge macroblock and the non-edge macroblock. Also, the variance difference of neighbor macroblocks provides a hint for edge existence. Then, a block-based edge detection method uses this information. To determine the DCT type for each block, the detected edges are differentiated as visual obvious edge, texture-like edge, soft edge and strong edge. 8×8 DCT is used for texture-like edges and the 4×4 DCT is used for all the other edges. The result is an efficient and accurate edge detection and transform selection method.

    摘要翻译: 描述了低复杂度边缘检测和DCT类型选择方法,以提高H.264 / AVC编码视频序列的视觉质量。 编码生成的信息被重新用于检测边缘宏块。 一个宏块的方差和平均绝对差(MAD)表示能够用于区分边缘宏块和非边缘宏块的一定关系。 此外,相邻宏块的方差差异提供了边缘存在的提示。 然后,基于块的边缘检测方法使用该信息。 为了确定每个块的DCT类型,检测到的边缘被区分为视觉明显的边缘,纹理样边缘,软边缘和强边。 8x8 DCT用于纹理样边缘,4x4 DCT用于所有其他边。 结果是一种高效准确的边缘检测和变换选择方法。

    CODING TOOL SELECTION IN VIDEO CODING BASED ON HUMAN VISUAL TOLERANCE
    10.
    发明申请
    CODING TOOL SELECTION IN VIDEO CODING BASED ON HUMAN VISUAL TOLERANCE 审中-公开
    基于人类视觉容忍度的视频编码中的编码工具选择

    公开(公告)号:US20090074058A1

    公开(公告)日:2009-03-19

    申请号:US11855888

    申请日:2007-09-14

    IPC分类号: H04N7/12

    摘要: In one embodiment, a coding mode selection method is provided to improve the visual quality of an encoded video sequence. The coding mode is selected based on a human visual tolerance level. Picture data may be received for a video coding process. The picture data is then analyzed to determine human visual tolerance adjustment information. For example, parameters of a cost equation may be adjusted based on the human visual tolerance level, which may be a tolerance that is based on a distortion bound that the human visual system can tolerate. The picture data may be analyzed in places that are considered visually sensitive areas, such as trailing suspicious areas, stripping suspicious areas, picture boundary areas, and/or blocking suspicious areas. Depending on what kind of visually sensitive area is found in the picture data, a parameter in a cost equation may be adjusted based on different visual tolerance thresholds. The coding mode is then determined based on the cost.

    摘要翻译: 在一个实施例中,提供编码模式选择方法以提高编码视频序列的视觉质量。 基于人的视觉容忍度来选择编码模式。 可以接收用于视频编码处理的图像数据。 然后分析图像数据以确定人类视觉容差调整信息。 例如,成本方程的参数可以基于人的视觉容忍度来调整,其可以是基于人类视觉系统可以容忍的失真约束的公差。 可以在被认为是视觉敏感区域的地方分析图像数据,例如可疑区域,剥离可疑区域,图像边界区域和/或阻止可疑区域。 根据在图像数据中找到什么样的视觉敏感区域,可以基于不同的视觉容差阈值来调整成本方程中的参数。 然后根据成本确定编码模式。