Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions
    1.
    发明授权
    Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions 有权
    在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置

    公开(公告)号:US06438518B1

    公开(公告)日:2002-08-20

    申请号:US09429754

    申请日:1999-10-28

    IPC分类号: G10L1904

    CPC分类号: G10L19/18 G10L19/02

    摘要: A method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions includes a speech coder configured to select from among various predictive coding modes. After a predefined number of speech frames have been predictively coded, the speech coder codes one frame with a nonpredictive coding mode or a mildly predictive coding mode. The predefined number of frames can be determined in advance from the subjective standpoint of a listener. The predefined number of frames may be varied periodically. An average coding bit rate may be maintained for the speech coder by ensuring that an average coding bit rate is maintained for each successive pattern, or group, of predictively coded speech frames including at least one nonpredictively coded or mildly predictively coded speech frame.

    摘要翻译: 一种用于在预测语音编码器中使用编码方案选择模式以降低对帧错误状况的灵敏度的方法和装置包括配置成从各种预测编码模式中进行选择的语音编码器。 在预定数量的语音帧已被预测编码之后,语音编码器以非预测编码模式或轻度预测编码模式对一帧进行编码。 可以从收听者的主观角度预先确定预定数量的帧。 预定数量的帧可以周期性地改变。 可以通过确保针对包括至少一个非预测编码或温和预测编码的语音帧的预测编码语音帧的每个连续模式或组维持平均编码比特率来维持语音编码器的平均编码比特率。

    Frame erasure compensation method in a variable rate speech coder
    2.
    发明授权
    Frame erasure compensation method in a variable rate speech coder 有权
    可变速率语音编码器中的帧擦除补偿方法

    公开(公告)号:US06584438B1

    公开(公告)日:2003-06-24

    申请号:US09557283

    申请日:2000-04-24

    IPC分类号: G10L1300

    摘要: A frame erasure compensation method in a variable-rate speech coder includes quantizing, with a first encoder, a pitch lag value for a current frame and a first delta pitch lag value equal to the difference between the pitch lag value for the current frame and the pitch lag value for the previous frame. A second, predictive encoder quantizes only a second delta pitch lag value for the previous frame (equal to the difference between the pitch lag value for the previous frame and the pitch lag value for the frame prior to that frame). If the frame prior to the previous frame is processed as a frame erasure, the pitch lag value for the previous frame is obtained by subtracting the first delta pitch lag value from the pitch lag value for the current frame. The pitch lag value for the erasure frame is then obtained by subtracting the second delta pitch lag value from the pitch lag value for the previous frame. Additionally, a waveform interpolation method may be used to smooth discontinuities caused by changes in the coder pitch memory.

    摘要翻译: 可变速率语音编码器中的帧擦除补偿方法包括:利用第一编码器量化当前帧的音调滞后值,以及等于当前帧的音调滞后值与第 前一帧的音调滞后值。 第二预测编码器仅量化前一帧的第二增量音调滞后值(等于先前帧的音调滞后值与该帧之前的帧的音调滞后值之间的差)。 如果先前帧之前的帧被作为帧擦除处理,则通过从当前帧的音调滞后值中减去第一增量音调滞后值来获得先前帧的音调滞后值。 然后通过从前一帧的音调滞后值减去第二增量音调滞后值来获得擦除帧的音调滞后值。 此外,可以使用波形插值方法来平滑由编码器音调存储器的变化引起的不连续性。

    Amplitude quantization scheme for low-bit-rate speech coders
    6.
    发明授权
    Amplitude quantization scheme for low-bit-rate speech coders 有权
    低比特率语音编码器的幅度量化方案

    公开(公告)号:US06324505B1

    公开(公告)日:2001-11-27

    申请号:US09356756

    申请日:1999-07-19

    IPC分类号: G10L2102

    CPC分类号: G10L19/0204 G10L25/18

    摘要: An amplitude quantization scheme for low-bit-rate speech coders includes the first step of extracting a vector of spectral information from a frame. The energy of the vector is normalized to generate gain factors. The gain factors are differentially vector quantized. The normalized gain factors are non-uniformly downsampled to generate a fixed-dimension vector with elements associated with a set of non-uniform frequency bands. The fixed-dimension vector is split into two or more sub-vectors. The sub-vectors are differentially quantized, to best advantage with a harmonic cloning process.

    摘要翻译: 用于低比特率语音编码器的幅度量化方案包括从帧提取频谱信息的向量的第一步骤。 向量的能量被归一化以产生增益因子。 增益因子是差分矢量量化的。 归一化的增益因子被非均匀地下采样以产生具有与一组非均匀频带相关联的元素的固定维度向量。 固定维度向量被分成两个或多个子向量。 子矢量被差分量化,以利用谐波克隆过程的最佳优势。

    Adaptive intra-refresh for digital video encoding
    7.
    发明授权
    Adaptive intra-refresh for digital video encoding 有权
    适用于数字视频编码的内部刷新

    公开(公告)号:US08948266B2

    公开(公告)日:2015-02-03

    申请号:US11025297

    申请日:2004-12-28

    摘要: An adaptive Intra-refresh (IR) technique for digital video encoding adjusts IR rate based on video content, or a combination of video content and channel condition. The IR rate may be applied at the frame level or macroblock (MB) level. At the frame level, the IR rate specifies the percentage of MBs to be Intra-coded within the frame. At the MB level, the IR rate defines a statistical probability that a particular MB is to be Intra-coded. The IR rate is adjusted in proportion to a combined metric that weighs estimated channel loss probability, frame-to-frame variation, and texture information. The IR rate can be determined using a close-form solution that requires relatively low implementation complexity. For example, such a close-form does not require iteration or an exhaustive search. In addition, the IR rate can be determined from parameters that are available before motion estimation and compensation are performed.

    摘要翻译: 用于数字视频编码的自适应内部刷新(IR)技术基于视频内容或视频内容和频道条件的组合来调整IR速率。 可以在帧级或宏块(MB)级应用IR速率。 在帧级别,IR速率指定帧内帧内编码的百分比。 在MB级别,IR率定义了特定MB被内部编码的统计概率。 IR速率与重量估计的信道丢失概率,帧到帧变化和纹理信息的组合度量成比例地调整。 IR速率可以使用需要较低实现复杂度的紧密形式的解决方案来确定。 例如,这种关闭形式不需要迭代或穷尽搜索。 另外,可以在执行运动估计和补偿之前可用的参数来确定IR速率。

    3D video encoding
    8.
    发明授权
    3D video encoding 有权
    3D视频编码

    公开(公告)号:US08594180B2

    公开(公告)日:2013-11-26

    申请号:US11677335

    申请日:2007-02-21

    IPC分类号: G06F21/00

    摘要: A stereo 3D video frame includes left and right components that are combined to produce a stereo image. For a given amount of distortion, the left and right components may have different impacts on perceptual visual quality of the stereo image due to asymmetry in the distortion response of the human eye. A 3D video encoder adjusts an allocation of coding bits between left and right components of the 3D video based on a frame-level bit budget and a weighting between the left and right components. The video encoder may generate the bit allocation in the rho (ρ) domain. The weighted bit allocation may be derived based on a quality metric that indicates overall quality produced by the left and right components. The weighted bit allocation compensates for the asymmetric distortion response to reduce overall perceptual distortion in the stereo image and thereby enhance or maintain visual quality.

    摘要翻译: 立体3D视频帧包括组合以产生立体图像的左和右组件。 对于给定量的失真,由于人眼的失真响应的不对称,左和右分量可能对立体图像的感知视觉质量具有不同的影响。 3D视频编码器基于帧级比特预算和左右分量之间的加权来调整3D视频的左和右分量之间的编码比特的分配。 视频编码器可以在rho(rho)域中生成比特分配。 可以基于指示左组件和右组件产生的总体质量的质量度量来导出加权比特分配。 加权比特分配补偿非对称失真响应,以减少立体图像中的整体感知失真,从而增强或维持视觉质量。

    Methods of performing error concealment for digital video
    9.
    发明授权
    Methods of performing error concealment for digital video 有权
    对数字视频执行错误隐藏的方法

    公开(公告)号:US08379734B2

    公开(公告)日:2013-02-19

    申请号:US11690132

    申请日:2007-03-23

    IPC分类号: H04N7/68

    摘要: Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction. The novel spatial error concealment technique divides a corrupt macroblock into four different regions, a corner region, a row adjacent to the corner region, a column adjacent to the corner region, and a remainder main region. Those regions are then reconstructed in that order and information from earlier reconstructed regions may be used in later reconstructed regions. Finally, a macroblock refreshment technique is disclosed for preventing error propagation from harming non-corrupt inter-blocks. Specifically, an inter-macroblock may be ‘refreshed’ using spatial error concealment if there has been significant error caused damage that may cause the inter-block to propagate the errors.

    摘要翻译: 错误隐藏用于隐藏数字视频信息中检测到的错误的影响。 公开了一种复杂的错误隐藏模式决定,以确定是否应使用空间误差隐藏(SEC)或时间误差隐藏(TEC)。 错误隐藏模式决策系统使用不同的方法,取决于损坏的帧是帧内还是帧间。 如果视频帧是帧内帧,则使用相似性度量来确定帧内是否表示场景改变。 如果视频帧是帧内帧,则使用复数多方程来确定是否应使用SEC或TEC。 当错误隐藏模式决定确定空间误差隐藏应用于重建时,公开了一种新颖的空间误差隐藏技术。 新颖的空间误差隐藏技术将腐败的宏块分为四个不同的区域,一个角区域,一个与拐角区域相邻的一行,一个邻近拐角区域的列以及一个剩余的主区域。 然后按照该顺序重建那些区域,并且可以在稍后的重建区域中使用来自较早重建区域的信息。 最后,公开了一种宏块刷新技术,用于防止错误传播损害非损坏的块间。 具体地,如果存在可能导致块间传播错误的严重错误引起的损坏,则可以使用空间错误隐藏来刷新宏块间宏块。