VISUALLY MASKED METRIC FOR PIXEL BLOCK SIMILARITY
    31.
    发明申请
    VISUALLY MASKED METRIC FOR PIXEL BLOCK SIMILARITY 审中-公开
    用于像素块的可视化掩模公差

    公开(公告)号:US20120207212A1

    公开(公告)日:2012-08-16

    申请号:US13075282

    申请日:2011-03-30

    IPC分类号: H04N7/26

    摘要: Selecting a coding mode for coding video data by measuring a distortion sensitivity threshold for a pixel block, calculating a distortion threshold representative of the maximum distortion that may be effectively masked by the brightness and texture of the pixel block, estimating the distortion induced by coding the pixel block according to skip mode and coding the source pixel block with a predictive coding technique if the estimated distortion value exceeds the distortion threshold. The distortion sensitivity threshold may include, for example, a brightness value or a texture value. The contrast between the pixel block and the surrounding pixel blocks may also be considered such that if the contrast exceeds a contrast threshold calculated based on the measurement of brightness and texture, the source pixel block may be coded with a predictive coding technique even if the estimated distortion value does not exceed the distortion threshold.

    摘要翻译: 通过测量像素块的失真灵敏度阈值来选择用于编码视频数据的编码模式,计算代表由像素块的亮度和纹理有效地屏蔽的最大失真的失真阈值,估计由编码 像素块,并且如果估计的失真值超过失真阈值,则用预测编码技术对源像素块进行编码。 失真灵敏度阈值可以包括例如亮度值或纹理值。 还可以考虑像素块和周围像素块之间的对比度,使得如果对比度超过基于亮度和纹理的测量计算的对比阈值,则可以用预测编码技术对源像素块进行编码,即使估计 失真值不超过失真阈值。

    JOINT FRAME RATE AND RESOLUTION ADAPTATION
    32.
    发明申请
    JOINT FRAME RATE AND RESOLUTION ADAPTATION 有权
    联合框架速度和分辨率适应

    公开(公告)号:US20120195372A1

    公开(公告)日:2012-08-02

    申请号:US13018298

    申请日:2011-01-31

    IPC分类号: H04N7/32 H04N7/12

    摘要: A video coder employs techniques for applying frame rate adaptation and variable resolution adaptation in response to environmental coding factors present at the coding terminal. According to such techniques, a coder may estimate a coding quality level to be applied based on the environmental coding factors. The coder may retrieve from a controller table, settings for resolution and frame rate based on the estimated quality level. Optionally, the coder further may retrieve settings identifying a range of quantization parameters that may be used during coding. Prior to coding, the coder may configure input video data to match the resolution and frame rate settings retrieved from the controller table. Thereafter, the coder may code the reconfigured input video data by motion-compensation prediction constrained, as applicable, by the retrieved quantization parameter range.

    摘要翻译: 视频编码器根据存在于编码终端处的环境编码因素,采用应用帧速率适配和可变分辨率自适应的技术。 根据这样的技术,编码器可以基于环境编码因素估计要应用的编码质量等级。 编码器可以根据估计的质量水平从控制器表检索分辨率和帧速率的设置。 可选地,编码器还可以检索识别可能在编码期间使用的量化参数的范围的设置。 在编码之前,编码器可以配置输入视频数据以匹配从控制器表检索的分辨率和帧率设置。 此后,编码器可以通过运动补偿预测约束(如适用)通过检索的量化参数范围对重新配置的输入视频数据进行编码。

    METHOD AND APPARATUS FOR ERROR RESILIENT LONG TERM REFERENCING BLOCK REFRESH
    33.
    发明申请
    METHOD AND APPARATUS FOR ERROR RESILIENT LONG TERM REFERENCING BLOCK REFRESH 审中-公开
    用于错误恢复长期参考块修改的方法和装置

    公开(公告)号:US20120106632A1

    公开(公告)日:2012-05-03

    申请号:US12914650

    申请日:2010-10-28

    IPC分类号: H04N7/50

    摘要: A system and method for coding video data wherein a pixel block may be coded for refresh with reference to an LTR frame that was successfully transmitted, or has a high probability of having been successfully transmitted from the encoder to the decoder. Not all pixel blocks in the frame may be refreshed at the same rate. Pixel blocks containing edge details, containing a significant object, or containing foreground image data may be refreshed more often than pixel blocks containing smooth, background, or relatively less significant image data.

    摘要翻译: 一种用于对视频数据进行编码的系统和方法,其中像素块可以被编码用于参考已经成功发送的LTR帧进行刷新,或者具有从编码器成功发送到解码器的高概率。 不是帧中的所有像素块可以以相同的速率刷新。 包含重要对象或包含前景图像数据的边缘细节的像素块可能比包含平滑,背景或相对不太重要的图像数据的像素块更频繁地刷新。

    Scene-aware automatic-exposure control
    34.
    发明授权
    Scene-aware automatic-exposure control 有权
    场景感知自动曝光控制

    公开(公告)号:US08077256B1

    公开(公告)日:2011-12-13

    申请号:US12793848

    申请日:2010-06-04

    IPC分类号: H04N5/238

    CPC分类号: H04N5/2351

    摘要: A scene-aware auto-exposure control process stabilizes changes in a camera's auto-exposure settings so as to reduce lighting and color flicker during image capture operations. A metric, referred to as the Modified Adjusted Luminance (MAL) metric, is defined to remain relatively constant as long as the lighting of the scene being captured remains relatively constant. Thus, scene changes such as an object moving into, out of, or around in a scene do not significantly affect the MAL metric's value and do not, therefore, trigger an exposure adjustment. Once the MAL metric indicates a scene's lighting is stable, the camera's auto-exposure operation may be suppressed. As long as incoming frames indicate a stable lighting condition (based on the MAL metric), auto-exposure operation may remain suppressed. When incoming frames result in a substantially different MAL over a specified number of frames, auto-exposure operation may be restored.

    摘要翻译: 场景感知自动曝光控制过程可稳定照相机自动曝光设置的变化,以减少图像拍摄过程中的照明和颜色闪烁。 被称为修正调整亮度(MAL)度量的度量被定义为保持相对恒定,只要被捕获的场景的照明保持相对恒定。 因此,诸如移动到场景中的,离开或在场景中的对象的场景改变不会显着影响MAL度量的值,因此不会触发曝光调整。 一旦MAL指标表明场景的照明是稳定的,则可能会抑制相机的自动曝光操作。 只要进入的帧指示稳定的照明条件(基于MAL度量),自动曝光操作可能仍然被抑制。 当输入帧在指定数量的帧上导致基本上不同的MAL时,可以恢复自动曝光操作。

    H.264/AVC coder incorporating rate and quality controller
    36.
    发明授权
    H.264/AVC coder incorporating rate and quality controller 有权
    H.264 / AVC编码器并入速率和质量控制器

    公开(公告)号:US07986731B2

    公开(公告)日:2011-07-26

    申请号:US10811983

    申请日:2004-03-30

    IPC分类号: H04N7/12

    摘要: A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.

    摘要翻译: 公开了一种用于视频编码应用的速率控制系统。 速率控制器响应于表示图像的空间复杂度,运动复杂度和/或每像素的复杂度指示符,为图像中的视频数据分配量化参数。 基于虚拟缓冲区丰满度分析和目标速率估计提出了一种基于虚拟缓冲器的量化器参数,该参数是从复杂性指标中得出的。 从用于对先前编码的类似类型的图像(例如,I图像,P图像或B图像)进行编码的量化器参数的线性回归分析提出了第二量化参数。 编码策略决定单元根据两个提出的量化器参数的比较来定义最终的量化器参数。

    Facial Pose Improvement with Perspective Distortion Correction
    37.
    发明申请
    Facial Pose Improvement with Perspective Distortion Correction 有权
    透视畸变修正的面部姿态改善

    公开(公告)号:US20110090303A1

    公开(公告)日:2011-04-21

    申请号:US12581043

    申请日:2009-10-16

    IPC分类号: H04N7/15 H04N5/217

    摘要: Methods, systems, and apparatus are presented for reducing distortion in an image, such as a video image. A video image can be captured by an image capture device, e.g. during a video conferencing session. Distortion correction processing, such as the application of one or more warping techniques, can be applied to the captured image to produce a distortion corrected image, which can be transmitted to one or more participants. The warping techniques can be performed in accordance with one or more warp parameters specifying a transformation of the captured image. Further, the warp parameters can be generated in accordance with an orientation of the image capture device, which can be determined based on sensor data or can be a fixed value. Additionally or alternatively, the warp parameters can be determined in accordance with a reference image or model to which the captured image should be warped.

    摘要翻译: 呈现了用于减少诸如视频图像的图像中的失真的方法,系统和装置。 视频图像可以由图像捕获设备捕获,例如, 在视频会议期间。 畸变校正处理,例如应用一个或多个翘曲技术,可以应用于所捕获的图像,以产生可以发送到一个或多个参与者的失真校正图像。 翘曲技术可以根据指定捕获图像的变换的一个或多个翘曲参数来执行。 此外,可以根据可以基于传感器数据确定的图像捕获装置的取向来生成翘曲参数,或者可以是固定值。 附加地或替代地,可以根据捕获的图像应该翘曲的参考图像或模型来确定翘曲参数。

    Method for implementing a quantizer in a multimedia compression and encoding system
    38.
    发明授权
    Method for implementing a quantizer in a multimedia compression and encoding system 有权
    在多媒体压缩和编码系统中实现量化器的方法

    公开(公告)号:US07769084B1

    公开(公告)日:2010-08-03

    申请号:US10427843

    申请日:2003-04-30

    IPC分类号: H04N7/18

    摘要: Method For Implementing A Quantizer In A Multimedia Compression And Encoding System is disclosed. In the Quantizer system of the present invention, several new quantization ideas are disclosed. In one embodiment, adjacent macroblocks are grouped together into macroblock groups. The macroblock groups are then assigned a common quantizer value. The common quantizer value may be selected based upon how the macroblocks are encoded, the type of macroblocks within the macroblock group (intra-blocks or inter-blocks), the history of the motion vectors associated with the macroblocks in the macroblock group, the residuals of the macroblocks in the macroblock group, and the energy of the macroblocks in the macroblock group. The quantizer value may be adjusted in a manner that is dependent on the current quantizer value. Specifically, if the quantizer value is at the low end of the quantizer scale, then only small adjustments are made. If the quantizer value is at the high end then larger adjustments may be made to the quantizer. Finally, in one embodiment, the quantizer is implemented along with an inverse quantizer for efficient operation.

    摘要翻译: 公开了一种在多媒体压缩和编码系统中实现量化器的方法。 在本发明的量化器系统中,公开了几种新的量化思想。 在一个实施例中,相邻宏块被分组在一起成为宏块组。 然后向宏块组分配一个公共量化器值。 可以基于宏块如何编码,宏块组(块内或块内)中的宏块的类型,与宏块组中的宏块相关联的运动向量的历史来选择公共量化器值,残差 的宏块组中的宏块的能量,以及宏块组中的宏块的能量。 量化器值可以以取决于当前量化器值的方式进行调整。 具体地,如果量化器值处于量化器标尺的低端,则仅进行小的调整。 如果量化器值处于高端,则可以对量化器进行较大的调整。 最后,在一个实施例中,量化器与用于有效操作的逆量化器一起被实现。

    Encoding and decoding data arrays using separate pre-multiplication stages
    39.
    发明申请
    Encoding and decoding data arrays using separate pre-multiplication stages 有权
    使用单独的预乘法阶段对数据阵列进行编码和解码

    公开(公告)号:US20080147765A1

    公开(公告)日:2008-06-19

    申请号:US12037061

    申请日:2008-02-25

    IPC分类号: G06F17/14

    CPC分类号: G06F17/147

    摘要: Some embodiments of the invention provide a method of performing a Discrete Cosine Transform (“DCT”) encoding or decoding coefficients of a data array by (1) multiplying the coefficients by a scalar value before the encoding or decoding, and then (2) dividing the encoded or decoded coefficients by the scalar value. When used in conjunction with fixed-point arithmetic, this method increases the precision of the encoded and decoded results. In addition, some embodiments provide a method of performing a two-dimensional (2D) Inverse Discrete Cosine Transform (“iDCT”). This method splits a pre-multiplication operation of the iDCT into two or more separate stages. When used in conjunction with fixed-point arithmetic, this splitting increases the precision of the decoded results of the iDCT.

    摘要翻译: 本发明的一些实施例提供了一种通过以下步骤对数据阵列的系数进行编码或解码的离散余弦变换(“DCT”)的方法:(1)在编码或解码之前将系数乘以标量值,然后(2) 编码或解码的系数乘以标量值。 当与定点算术结合使用时,该方法提高了编码和解码结果的精度。 此外,一些实施例提供了执行二维(2D)逆离散余弦变换(“iDCT”)的方法。 该方法将iDCT的预乘法运算分为两个或多个独立的阶段。 当与定点算术结合使用时,该分割增加了iDCT的解码结果的精度。