Motion estimation techniques for video encoding
    1.
    发明申请
    Motion estimation techniques for video encoding 审中-公开
    视频编码的运动估计技术

    公开(公告)号:US20060120612A1

    公开(公告)日:2006-06-08

    申请号:US11008699

    申请日:2004-12-08

    IPC分类号: G06K9/36

    CPC分类号: H04N19/53 H04N19/56

    摘要: This disclosure describes video encoding techniques and video encoding devices that implement such techniques. In one embodiment, this disclosure describes a video encoding device comprising a motion estimator that computes a motion vector predictor based on motion vectors previously calculated for video blocks in proximity to a current video block to be encoded, and uses the motion vector predictor in searching for a prediction video block used to encode the current video block, and a motion compensator that generates a difference block indicative of differences between the current video block to be encoded and the prediction video block.

    摘要翻译: 本公开描述了实现这种技术的视频编码技术和视频编码设备。 在一个实施例中,本公开描述了一种视频编码装置,其包括运动估计器,该运动估计器基于先前针对要编码的当前视频块附近的视频块计算的运动矢量来计算运动矢量预测器,并且使用运动矢量预测器来搜索 用于对当前视频块进行编码的预测视频块;以及运动补偿器,其生成指示要编码的当前视频块与预测视频块之间的差异的差分块。

    Video coding with fine granularity scalability using cycle-aligned fragments
    2.
    发明授权
    Video coding with fine granularity scalability using cycle-aligned fragments 有权
    使用循环对齐片段的细粒度可扩展性的视频编码

    公开(公告)号:US08233544B2

    公开(公告)日:2012-07-31

    申请号:US11776679

    申请日:2007-07-12

    IPC分类号: H04N7/12 H04N11/02

    CPC分类号: H04N19/34

    摘要: The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.

    摘要翻译: 本公开描述了使用循环对准片段(CAF)的FGS视频编码技术。 这些技术可以执行FGS视频数据块系数和语法元素的基于循环的编码,并且将循环封装成用于传输的片段。 片段可以是循环对齐的,使得每个片段的有效载荷的开始基本上与循环中的一个的开始重合。 以这种方式,可以容易地通过各个片段访问周期。 可以通过向量模式来控制一些周期,以便在移动到另一个块之前扫描到块内的预定位置。 以这种方式,可以减少周期数,减少片段的数量和相关的开销。 CAF可以彼此独立地进行熵编码,使得每个片段可以容易地被访问和解码,而不等待其他片段的解码。 独立熵编码可以允许并行解码和片段的同时处理。

    Distortion estimation for quantized data
    3.
    发明授权
    Distortion estimation for quantized data 失效
    量化数据的失真估计

    公开(公告)号:US08711926B2

    公开(公告)日:2014-04-29

    申请号:US11672550

    申请日:2007-02-08

    IPC分类号: H04N11/02

    CPC分类号: H04N19/154 H04N19/124

    摘要: Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.

    摘要翻译: 描述用于估计由数据量化引起的失真的技术。 可以针对要量化的一系列系数获得具有多个箱的直方图。 可以基于直方图库的直方图和平均失真来估计由于系数集合的量化造成的失真。 每个仓中的系数数可以乘以箱的平均失真以获得每桶的失真。 可以用校正因子累积和缩放所有存储体的每个bin的失真,以获得估计的失真。 这些技术可以用于估计一组编码元件的失真。 针对每个多个量化步骤的每个编码元素,可以估计失真和速率。 可以基于估计的失真和用于不同量化步长的编码元件集合的估计速率来为编码元件组选择一组量化步长。

    DISTORTION ESTIMATION FOR QUANTIZED DATA
    4.
    发明申请
    DISTORTION ESTIMATION FOR QUANTIZED DATA 失效
    量化数据的失真估计

    公开(公告)号:US20080192821A1

    公开(公告)日:2008-08-14

    申请号:US11672550

    申请日:2007-02-08

    IPC分类号: H04N7/26

    CPC分类号: H04N19/154 H04N19/124

    摘要: Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.

    摘要翻译: 描述用于估计由数据量化引起的失真的技术。 可以针对要量化的一系列系数获得具有多个箱的直方图。 可以基于直方图库的直方图和平均失真来估计由于系数集合的量化造成的失真。 每个仓中的系数数可以乘以箱的平均失真以获得每桶的失真。 可以用校正因子累积和缩放所有存储体的每个bin的失真,以获得估计的失真。 这些技术可以用于估计一组编码元件的失真。 针对每个多个量化步骤的每个编码元素,可以估计失真和速率。 可以基于估计的失真和用于不同量化步长的编码元件集合的估计速率来为编码元件组选择一组量化步长。

    VIDEO CODING WITH FINE GRANULARITY SCALABILITY USING CYCLE-ALIGNED FRAGMENTS
    5.
    发明申请
    VIDEO CODING WITH FINE GRANULARITY SCALABILITY USING CYCLE-ALIGNED FRAGMENTS 有权
    使用周期对齐的片段,具有精细粒度可扩展性的视频编码

    公开(公告)号:US20080013622A1

    公开(公告)日:2008-01-17

    申请号:US11776679

    申请日:2007-07-12

    CPC分类号: H04N19/34

    摘要: The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.

    摘要翻译: 本公开描述了使用循环对准片段(CAF)的FGS视频编码技术。 这些技术可以执行FGS视频数据块系数和语法元素的基于循环的编码,并且将循环封装成用于传输的片段。 片段可以是循环对齐的,使得每个片段的有效载荷的开始基本上与循环中的一个的开始重合。 以这种方式,可以容易地通过各个片段访问周期。 可以通过向量模式来控制一些周期,以便在移动到另一个块之前扫描到块内的预定位置。 以这种方式,可以减少周期数,减少片段的数量和相关的开销。 CAF可以彼此独立地进行熵编码,使得每个片段可以容易地被访问和解码,而不等待其他片段的解码。 独立熵编码可以允许并行解码和片段的同时处理。

    Interactive speech recognition apparatus and method with conditioned voice prompts
    6.
    发明授权
    Interactive speech recognition apparatus and method with conditioned voice prompts 有权
    交互式语音识别装置和方法,具有条件语音提示

    公开(公告)号:US07328159B2

    公开(公告)日:2008-02-05

    申请号:US10050378

    申请日:2002-01-15

    IPC分类号: G10L21/00 G10L15/20 G10L13/00

    CPC分类号: G10L15/22 G10L25/78

    摘要: An improved system for an interactive voice recognition system (400) includes a voice prompt generator (401) for generating voice prompt in a first frequency band (501). A speech detector (406) detects presence of speech energy in a second frequency band (502). The first and second frequency bands (501, 502) are essentially conjugate frequency bands. A voice data generator (412) generates voice data based on an output of the voice prompt generator (401) and audible speech of a voice response generator (402). A control signal (422) controls the voice prompt generator (401) based on whether the speech detector (406) detects presence of speech energy in the second frequency band (502). A back end (405) of the interactive voice recognition system (400) is configured to operate on an extracted front end voice feature based on whether the speech detector (406) detects presence of speech energy in the second frequency band (502).

    摘要翻译: 用于交互式语音识别系统(400)的改进的系统包括用于在第一频带(501)中生成语音提示的语音提示生成器(401)。 语音检测器(406)检测第二频带(502)中语音能量的存在。 第一和第二频带(501,502)基本上是共轭频带。 语音数据生成器(412)基于语音提示生成器(401)的输出和语音响应生成器(402)的可听话音生成语音数据。 控制信号(422)基于语音检测器(406)是否检测到第二频带(502)中的语音能量的存在来控制语音提示产生器(401)。 交互式语音识别系统(400)的后端(405)被配置为基于所述语音检测器(406)是否检测到所述第二频带(502)中的语音能量的存在来对所提取的前端语音特征进行操作。

    Intensity compensation techniques in video processing
    7.
    发明授权
    Intensity compensation techniques in video processing 有权
    视频处理中的强度补偿技术

    公开(公告)号:US08599920B2

    公开(公告)日:2013-12-03

    申请号:US12185889

    申请日:2008-08-05

    IPC分类号: H04N7/12

    摘要: Techniques for intensity compensation in video processing are provided. In one configuration, a wireless communication device compliant with the VC1-SMPTE standard (e.g., cellular phone, etc.) comprises a processor that is configured to execute instructions operative to reconstruct reference frames from a received video bitstream. A non-intensity-compensated copy of a reference frame of the bitstream is stored in a memory of the device and used for defining the displayable images and for on-the-fly generation of a stream of intensity-compensated pixels to perform motion compensation calculations for frames of the video bitstream.

    摘要翻译: 提供了视频处理中的强度补偿技术。 在一种配置中,符合VC1-SMPTE标准(例如,蜂窝电话等)的无线通信设备包括被配置为执行操作以从接收的视频比特流重建参考帧的指令的处理器。 比特流的参考帧的非强度补偿的副本被存储在设备的存储器中,并用于定义可显示的图像,并且用于实时生成强度补偿像素的流以执行运动补偿计算 用于视频比特流的帧。

    Two pass rate control techniques for video coding using a min-max approach
    9.
    发明授权
    Two pass rate control techniques for video coding using a min-max approach 有权
    两种通过速率控制技术,用于使用最小 - 最大方法进行视频编码

    公开(公告)号:US08379721B2

    公开(公告)日:2013-02-19

    申请号:US11303618

    申请日:2005-12-15

    IPC分类号: H04N11/04

    摘要: This disclosure describes rate control techniques that can improve video coding based on a “two-pass” approach. The first pass codes a video sequence using a first set of quantization parameters (QPs) for the purpose of estimating rate-distortion characteristics of the video sequence based on the statistic of the first pass. A second set of QPs can then be defined for a second coding pass. The estimated rate-distortion characteristics of the first pass are used to select Qps for the second pass in a manner that minimizes quality fluctuation between the frames of the video sequence. Furthermore, selection of the second set of QPs may also substantially maximize quality of the frames at the substantially minimized quality flucuation in order to achieve low average frame distortion with the minimized quality fluctuation.

    摘要翻译: 本公开描述了可以基于双向方法改进视频编码的速率控制技术。 第一次通过使用第一组量化参数(QP)编码视频序列,以便基于第一遍的统计量来估计视频序列的速率失真特性。 然后可以为第二编码通道定义第二组QP。 第一遍的估计速率 - 失真特性用于以最小化视频序列的帧之间的质量波动的方式来选择第二遍的Qps。 此外,第二组QP的选择也可以在基本上最小化的质量流量下基本上最大化帧的质量,以便以最小化的质量波动实现低平均帧失真。

    Electronic video image stabilization
    10.
    发明授权
    Electronic video image stabilization 有权
    电子视频图像稳定

    公开(公告)号:US07840085B2

    公开(公告)日:2010-11-23

    申请号:US11487078

    申请日:2006-07-14

    IPC分类号: G06K9/40

    摘要: This disclosure describes electronic video image stabilization techniques for imaging and video devices. The techniques involve determining motion and spatial statistics for individual macroblocks of a frame, and determining a global motion vector for the frame based on the statistics of each of the macroblocks. In one embodiment, a method of performing electronic image stabilization includes performing spatial estimation on each of a plurality of macroblocks within a frame of an image to obtain spatial statistics for each of the macroblocks, performing motion estimation on each of the plurality of macroblocks to obtain motion statistics for each of the macroblocks, integrating the spatial statistics and the motion statistics of each of the macroblocks to determine a global motion vector for the frame, and offsetting the image with respect to a reference window according to the global motion vector.

    摘要翻译: 本公开描述了用于成像和视频设备的电子视频图像稳定技术。 这些技术涉及确定帧的各个宏块的运动和空间统计,以及基于每个宏块的统计来确定该帧的全局运动矢量。 在一个实施例中,执行电子图像稳定的方法包括对图像的帧内的多个宏块中的每一个执行空间估计,以获得每个宏块的空间统计,对多个宏块中的每一个执行运动估计以获得 对每个宏块进行运动统计,对每个宏块的空间统计和运动统计进行积分,以确定该帧的全局运动矢量,以及根据全局运动矢量相对于参考窗口偏移该图像。