Adaptive intra-refresh for digital video encoding

    公开(公告)号:US20060078051A1

    公开(公告)日:2006-04-13

    申请号:US11025297

    申请日:2004-12-28

    摘要: An adaptive Intra-refresh (IR) technique for digital video encoding adjusts IR rate based on video content, or a combination of video content and channel condition. The IR rate may be applied at the frame level or macroblock (MB) level. At the frame level, the IR rate specifies the percentage of MBs to be Intra-coded within the frame. At the MB level, the IR rate defines a statistical probability that a particular MB is to be Intra-coded. The IR rate is adjusted in proportion to a combined metric that weighs estimated channel loss probability, frame-to-frame variation, and texture information. The IR rate can be determined using a close-form solution that requires relatively low implementation complexity. For example, such a close-form does not require iteration or an exhaustive search. In addition, the IR rate can be determined from parameters that are available before motion estimation and compensation are performed.

    Tandem-free intersystem voice communication
    2.
    发明授权
    Tandem-free intersystem voice communication 有权
    无串联系统间语音通信

    公开(公告)号:US08432935B2

    公开(公告)日:2013-04-30

    申请号:US12181972

    申请日:2008-07-29

    IPC分类号: H04J3/16 G10L11/06

    CPC分类号: G10L19/173 H04W88/181

    摘要: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.

    摘要翻译: 本文提供了技术,以通过两个否则不兼容的无线网络在两个无线终端之间提供无串联操作。 具体地,实施例在通过连续传输(CTX)无线信道通信到通过不连续传输(DTX)无线信道进行通信的无线终端的无线终端之间提供无串联操作。 在第一方面,非活动语音帧在DTX和CTX格式之间被转换。 在第二方面,每个无线终端包括与移动到移动连接的相对端上的活动语音编码器兼容的活动语音解码器。

    Sub-sampled excitation waveform codebooks
    3.
    发明授权
    Sub-sampled excitation waveform codebooks 有权
    次采样激励波形码本

    公开(公告)号:US07698132B2

    公开(公告)日:2010-04-13

    申请号:US10322245

    申请日:2002-12-17

    IPC分类号: G10L19/00 G10L19/12 G10L21/02

    摘要: Methods and apparatus are presented for reducing the number of bits needed to represent an excitation waveform. An acoustic signal in an analysis frame is analyzed to determine whether it is a band-limited signal. A sub-sampled sparse codebook is used to generate the excitation waveform if the acoustic signal is a band-limited signal. The sub-sampled sparse codebook is generated by decimating permissible pulse locations from the codebook track in accordance with the frequency characteristic of the acoustic signal.

    摘要翻译: 提出了用于减少表示激励波形所需的位数的方法和装置。 分析分析帧中的声信号以确定其是否是带限信号。 如果声信号是带限信号,则使用子采样稀疏码本来产生激励波形。 通过根据声信号的频率特性对来自码本磁道的允许脉冲位置进行抽取,产生子采样稀疏码本。

    Content-adaptive background skipping for region-of-interest video coding
    4.
    发明申请
    Content-adaptive background skipping for region-of-interest video coding 有权
    针对感兴趣区域视频编码的内容自适应背景跳过

    公开(公告)号:US20060204113A1

    公开(公告)日:2006-09-14

    申请号:US11200407

    申请日:2005-08-09

    IPC分类号: G06K9/36 H04N11/02

    摘要: The disclosure is directed to techniques for content-adaptive background skipping for region-of-interest (ROI) video coding. The techniques may be useful in video telephony (VT) applications such as video streaming and videoconferencing, and especially useful in low bit-rate wireless communication applications, such as mobile VT. The disclosed techniques analyze content information of a video frame to dynamically determine whether to skip a non-ROI area within the frame. For example, the skipping determination may be based on content activity, such as ROI shape deformation, ROI motion, non-ROI motion, non-ROI texture complexity, and accumulated distortion due to non-ROI skipping. The skip determination may operate in conjunction with either frame-level or macroblock-level bit allocation.

    摘要翻译: 本公开涉及用于感兴趣区域(ROI)视频编码的内容自适应背景跳过技术。 这些技术在诸如视频流和视频会议的视频电话(VT)应用中可能是有用的,并且在诸如移动VT的低比特率无线通信应用中尤其有用。 所公开的技术分析视频帧的内容信息以动态地确定是否跳过帧内的非ROI区域。 例如,跳过确定可以基于诸如ROI形状变形,ROI运动,非ROI运动,非ROI纹理复杂度以及由于非ROI跳过而导致的累积失真的内容活动。 跳过确定可以与帧级或宏块级位分配一起操作。

    Adaptive frame skipping techniques for rate controlled video encoding

    公开(公告)号:US20060198443A1

    公开(公告)日:2006-09-07

    申请号:US11193249

    申请日:2005-07-29

    摘要: The disclosure is directed to adaptive frame skipping techniques for rate controlled video encoding of a video sequence. According to the disclosed techniques, an encoder performs frame skipping in an intelligent manner that can improve video quality of the encoded sequence relative to encoding using conventional frame skipping. In particular, the disclosed frame skipping scheme is adaptive and considers motion activity of the video frames in order to identify certain frames that can be skipped without sacrificing significant video quality. The described frame skipping techniques may take into account the tradeoff between spatial and temporal quality of different video frames. In this manner, the techniques can allocate limited resources between the spatial and temporal quality in a way that can improve the visual appearance of a video sequence.

    Systems, methods and apparatus for context descriptor transmission
    7.
    发明授权
    Systems, methods and apparatus for context descriptor transmission 有权
    用于上下文描述符传输的系统,方法和装置

    公开(公告)号:US08600740B2

    公开(公告)日:2013-12-03

    申请号:US12129525

    申请日:2008-05-29

    IPC分类号: G10L21/00 G10L21/02

    摘要: Configurations disclosed herein include systems, methods and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Example embodiments may first remove any existing context from a digital audio signal to obtain a context suppressed signal. The context suppressed signal may then be encoded. An audio context may be selected from among a plurality of audio contexts, with the selected audio context inserted into a signal based on the encoded context suppressed signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替换现有上下文的系统,方法和装置。 示例性实施例可以首先从数字音频信号中去除任何现有上下文以获得上下文抑制信号。 然后可以编码上下文抑制信号。 可以从多个音频上下文中选择音频上下文,其中所选择的音频上下文基于编码的上下文抑制信号插入到信号中。

    Systems, methods, and apparatus for context processing using multi resolution analysis
    8.
    发明授权
    Systems, methods, and apparatus for context processing using multi resolution analysis 失效
    使用多分辨率分析的上下文处理的系统,方法和装置

    公开(公告)号:US08554550B2

    公开(公告)日:2013-10-08

    申请号:US12129466

    申请日:2008-05-29

    IPC分类号: G10L21/02 G10L19/00 G10L11/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Particularly, certain embodiments contemplate suppressing the context component from the digital audio signal to obtain a context-suppressed signal; generating an audio context signal that is based on a first filter and a first plurality of sequences, each of the first plurality of sequences having a different time resolution and mixing a first signal that is based on the generated audio context signal with a second signal that is based on the context-suppressed signal to obtain a context-enhanced signal, wherein generating an audio context signal includes applying the first filter to each of the first plurality of sequences.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 具体地,某些实施例考虑从数字音频信号抑制上下文成分以获得上下文抑制信号; 生成基于第一滤波器和第一多个序列的音频上下文信号,所述第一多个序列中的每一个具有不同的时间分辨率,并且将基于所生成的音频上下文信号的第一信号与第二信号混合,所述第二信号 基于所述上下文抑制信号以获得上下文增强信号,其中生成音频上下文信号包括将所述第一滤波器应用于所述第一多个序列中的每一个。

    Systems, methods, and apparatus for context processing using multiple microphones
    9.
    发明授权
    Systems, methods, and apparatus for context processing using multiple microphones 有权
    使用多个麦克风进行上下文处理的系统,方法和装置

    公开(公告)号:US08483854B2

    公开(公告)日:2013-07-09

    申请号:US12129421

    申请日:2008-05-29

    IPC分类号: G06F17/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. In one aspect, a method of processing a digital audio signal that includes a first audio context is disclosed. The method comprises based on a first audio signal that is produced by a first microphone, suppressing the first audio context from the digital audio signal to obtain a context-suppressed signal. The method may further comprise selecting a second context based on the first audio context, and mixing the second audio context with a signal that is based on the context-suppressed signal to obtain a context-enhanced signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 一方面,公开了一种处理包括第一音频环境的数字音频信号的方法。 该方法包括基于由第一麦克风产生的第一音频信号,从数字音频信号抑制第一音频上下文以获得上下文抑制信号。 该方法还可以包括基于第一音频上下文选择第二上下文,并且将第二音频上下文与基于上下文抑制信号的信号进行混合以获得上下文增强信号。

    Video demultiplexer and decoder with efficient data recovery
    10.
    发明申请
    Video demultiplexer and decoder with efficient data recovery 审中-公开
    视频解复用器和解码器,具有高效的数据恢复功能

    公开(公告)号:US20060062312A1

    公开(公告)日:2006-03-23

    申请号:US10947981

    申请日:2004-09-22

    摘要: A video demultiplexer and video decoder include features for efficient video data recovery in the event of channel error. The demultiplexer detects a boundary between physical layer data units and adds boundary information to the bitstream produced by the demultiplexer. The demultiplexer produces adaptation layer data units, which are processed by the adaptation layer to produce an application layer bitstream. When the video decoder encounters an error in the bitstream, it uses the boundary information to limit the amount of data that must be concealed. In particular, the boundary information permits the error to be associated with a small segment of data. The video decoder conceals data from the beginning of the segment of data, rather than an entire slice or frame in which the segment resides. In this manner, the video decoder provides efficient data recovery, limiting the loss of useful data that otherwise would be purposely discarded for concealment purposes.

    摘要翻译: 视频解复用器和视频解码器包括在频道错误的情况下有效的视频数据恢复的特征。 解复用器检测物理层数据单元之间的边界,并将边界信息添加到由解复用器产生的位流。 解复用器产生适配层数据单元,其由适配层处理以产生应用层比特流。 当视频解码器在比特流中遇到错误时,它使用边界信息来限制必须隐藏的数据量。 特别地,边界信息允许错误与一小段数据相关联。 视频解码器从数据段的开头隐藏数据,而不是段所在的整个片或帧。 以这种方式,视频解码器提供有效的数据恢复,限制为了隐藏目的而有意丢弃的有用数据的丢失。