Video demultiplexer and decoder with efficient data recovery
    1.
    发明申请
    Video demultiplexer and decoder with efficient data recovery 审中-公开
    视频解复用器和解码器,具有高效的数据恢复功能

    公开(公告)号:US20060062312A1

    公开(公告)日:2006-03-23

    申请号:US10947981

    申请日:2004-09-22

    摘要: A video demultiplexer and video decoder include features for efficient video data recovery in the event of channel error. The demultiplexer detects a boundary between physical layer data units and adds boundary information to the bitstream produced by the demultiplexer. The demultiplexer produces adaptation layer data units, which are processed by the adaptation layer to produce an application layer bitstream. When the video decoder encounters an error in the bitstream, it uses the boundary information to limit the amount of data that must be concealed. In particular, the boundary information permits the error to be associated with a small segment of data. The video decoder conceals data from the beginning of the segment of data, rather than an entire slice or frame in which the segment resides. In this manner, the video decoder provides efficient data recovery, limiting the loss of useful data that otherwise would be purposely discarded for concealment purposes.

    摘要翻译: 视频解复用器和视频解码器包括在频道错误的情况下有效的视频数据恢复的特征。 解复用器检测物理层数据单元之间的边界,并将边界信息添加到由解复用器产生的位流。 解复用器产生适配层数据单元,其由适配层处理以产生应用层比特流。 当视频解码器在比特流中遇到错误时,它使用边界信息来限制必须隐藏的数据量。 特别地,边界信息允许错误与一小段数据相关联。 视频解码器从数据段的开头隐藏数据,而不是段所在的整个片或帧。 以这种方式,视频解码器提供有效的数据恢复,限制为了隐藏目的而有意丢弃的有用数据的丢失。

    Efficient video slicing
    2.
    发明申请
    Efficient video slicing 有权
    高效的视频切片

    公开(公告)号:US20060039483A1

    公开(公告)日:2006-02-23

    申请号:US11018076

    申请日:2004-12-20

    摘要: The disclosure is directed to a video slicing technique that promotes low complexity, bandwidth efficiency and error resiliency. A video encoder places an RM close to the beginning of each logical transmission unit (LTU) so that all but a very small end segment of each video slice fits substantially within an LTU. Instead of requiring placement of RMs exactly at the LTU boundaries, a video encoder applies an approximate alignment technique. Video slices are encoded so that RMs are placed close to the beginning of each LTU, e.g., at the end of the first MB falling within the LTU. A portion of the last MB from the preceding slice carries over into the next LTU. Loss of an LTU results in loss of virtually the entire current slice plus a very small portion of the previous slice.

    摘要翻译: 本公开涉及一种促进低复杂度,带宽效率和错误弹性的视频切片技术。 视频编码器使RM靠近每个逻辑传输单元(LTU)的开始,使得每个视频片段的除了非常小的端段之外的所有基本上都在LTU内。 视频编码器不需要将RM完全放置在LTU边界,而是应用近似对准技术。 视频片段被编码,使得RM被放置在每个LTU的开始附近,例如在第一MB的结尾落在LTU内。 来自前一个片段的最后一个MB的一部分进入下一个LTU。 LTU的丢失导致几乎整个当前切片的损失加上前一切片的非常小的部分。

    Region-of-interest extraction for video telephony

    公开(公告)号:US20060215752A1

    公开(公告)日:2006-09-28

    申请号:US11182432

    申请日:2005-07-15

    IPC分类号: H04N7/12

    摘要: The disclosure is directed to techniques for region-of-interest (ROI) processing for video telephony (VT) applications. According to the disclosed techniques, a recipient device defines ROI information for video information transmitted by a sender device, i.e., far-end video information. The recipient device transmits the ROI information to the sender device. Using the ROI information transmitted by the recipient device, the sender device applies preferential encoding to an ROI within a video scene. ROI extraction may be applied to process a user description of a region of interest (ROI) to generate information specifying the ROI based on the description. The user description may be textual, graphical, or speech-based. An extraction module applies appropriate processing to generated the ROI information from the user description. The extraction module may locally reside with a video communication device, or reside in a distinct intermediate server configured for ROI extraction.

    Video source rate control for video telephony
    4.
    发明申请
    Video source rate control for video telephony 有权
    视频电话的视频源速率控制

    公开(公告)号:US20070097257A1

    公开(公告)日:2007-05-03

    申请号:US11314428

    申请日:2005-12-20

    IPC分类号: H04B1/66 H04N11/02

    摘要: The disclosure relates to techniques for video source rate control for video telephony (VT) applications. The source video encoding rate may controlled using a dual-buffer based estimation of a frame budget that defines a number of encoding bits available for a frame of the video. The dual-buffer based estimation technique may track the fullness of a physical video buffer and the fullness of the virtual video buffer. The source video encoding rate is then controlled based on the resulting frame budget. The contents of the virtual buffer depend on constraints imposed by a target encoding rate, while the contents of the physical buffer depend on constraints imposed by varying channel conditions. Consideration of physical video buffer fullness permits the video source rate control technique to be channel-adaptive. Consideration of virtual video buffer fullness permits the video source rate control technique to avoid encoding excessive video that could overwhelm the channel.

    摘要翻译: 本公开涉及用于视频电话(VT)应用的视频源速率控制的技术。 可以使用定义可用于视频帧的编码比特数的帧预算的基于双缓冲器的估计来控制源视频编码率。 基于双缓冲器的估计技术可以跟踪物理视频缓冲器的丰满度和虚拟视频缓冲器的丰满度。 然后基于所得到的帧预算来控制源视频编码速率。 虚拟缓冲器的内容取决于由目标编码速率施加的约束,而物理缓冲器的内容取决于由变化的信道条件施加的约束。 考虑物理视频缓冲器的丰满度使得视频源速率控制技术成为频道自适应的。 考虑虚拟视频缓冲区丰满度允许视频源速率控制技术避免编码可能压倒频道的过多视频。

    Picture-in-picture processing for video telephony
    5.
    发明申请
    Picture-in-picture processing for video telephony 有权
    视频电话的画中画处理

    公开(公告)号:US20070040898A1

    公开(公告)日:2007-02-22

    申请号:US11208176

    申请日:2005-08-19

    IPC分类号: H04N7/14

    CPC分类号: H04N7/147 H04N2007/145

    摘要: The disclosure is directed to techniques for picture-in-picture (PIP) processing for video telephony (VT). According to the disclosed techniques, a local video communication device transmits PIP information to a remote video communication device. Using the PIP information, the remote video communication device applies preferential encoding to non-PIP regions of video transmitted to the local video communication device.

    摘要翻译: 本公开涉及用于视频电话(VT)的画中画(PIP)处理技术。 根据所公开的技术,本地视频通信设备将PIP信息发送到远程视频通信设备。 使用PIP信息,远程视频通信设备对发送到本地视频通信设备的视频的非PIP区域应用优先编码。

    Region-of-interest processing for video telephony
    6.
    发明申请
    Region-of-interest processing for video telephony 有权
    视频电话领域的兴趣处理

    公开(公告)号:US20060215753A1

    公开(公告)日:2006-09-28

    申请号:US11183072

    申请日:2005-07-15

    IPC分类号: H04N7/12

    摘要: The disclosure is directed to techniques for region-of-interest (ROI) processing for video telephone (VT) applications. According to the disclosed techniques, a recipient device defines ROI information for video information transmitted by a sender device, i.e., far-end video information. The recipient device transmits the ROI information to the sender device. Using the ROI information transmitted by the recipient device, the sender device applies preferential encoding to an ROI within a video scene. In this manner, the recipient device is able to remotely control ROI encoding of far-end video information by the sender device.

    摘要翻译: 本公开涉及用于视频电话(VT)应用的感兴趣区域(ROI)处理的技术。 根据所公开的技术,接收者设备定义由发送者设备发送的视频信息的ROI信息,即远端视频信息。 收件人设备将ROI信息发送给发送者设备。 使用由收件人设备发送的ROI信息,发送者设备对视频场景内的ROI应用优先编码。 以这种方式,接收方设备能够远程地控制发送者设备对远端视频信息的ROI编码。

    Encoder-assisted adaptive video frame interpolation
    7.
    发明申请
    Encoder-assisted adaptive video frame interpolation 有权
    编码器辅助自适应视频帧插值

    公开(公告)号:US20060039471A1

    公开(公告)日:2006-02-23

    申请号:US11112531

    申请日:2005-04-22

    摘要: The disclosure is directed to techniques for encoder-assisted adaptive interpolation of video frames. According to the disclosed techniques, an encoder generates information to assist a decoder in interpolation of a skipped video frame, i.e., an S frame. The information permits the decoder to reduce visual artifacts in the interpolated frame and thereby achieve improved visual quality. The information may include interpolation equation labels that identify selected interpolation equations to be used by the decoder for individual video blocks. As an option, to conserve bandwidth, the equation labels may be transmitted for only selected video blocks that meet a criterion for encoder-assisted interpolation. Other video blocks without equation labels may be interpolated according to a default interpolation technique.

    摘要翻译: 本公开涉及视频帧的编码器辅助自适应插值技术。 根据所公开的技术,编码器生成信息以帮助解码器插入跳过的视频帧,即S帧。 该信息允许解码器减少内插帧中的视觉伪影,从而实现改善的视觉质量。 该信息可以包括识别用于各个视频块的由解码器使用的所选内插方程的内插方程标签。 作为选择,为了节省带宽,可以仅针对满足编码器辅助插值的准则的所选择的视频块来发送等式标签。 没有等式标签的其他视频块可以根据默认内插技术进行内插。

    Systems, methods and apparatus for context descriptor transmission
    8.
    发明授权
    Systems, methods and apparatus for context descriptor transmission 有权
    用于上下文描述符传输的系统,方法和装置

    公开(公告)号:US08600740B2

    公开(公告)日:2013-12-03

    申请号:US12129525

    申请日:2008-05-29

    IPC分类号: G10L21/00 G10L21/02

    摘要: Configurations disclosed herein include systems, methods and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Example embodiments may first remove any existing context from a digital audio signal to obtain a context suppressed signal. The context suppressed signal may then be encoded. An audio context may be selected from among a plurality of audio contexts, with the selected audio context inserted into a signal based on the encoded context suppressed signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替换现有上下文的系统,方法和装置。 示例性实施例可以首先从数字音频信号中去除任何现有上下文以获得上下文抑制信号。 然后可以编码上下文抑制信号。 可以从多个音频上下文中选择音频上下文,其中所选择的音频上下文基于编码的上下文抑制信号插入到信号中。

    Systems, methods, and apparatus for context processing using multi resolution analysis
    9.
    发明授权
    Systems, methods, and apparatus for context processing using multi resolution analysis 失效
    使用多分辨率分析的上下文处理的系统,方法和装置

    公开(公告)号:US08554550B2

    公开(公告)日:2013-10-08

    申请号:US12129466

    申请日:2008-05-29

    IPC分类号: G10L21/02 G10L19/00 G10L11/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Particularly, certain embodiments contemplate suppressing the context component from the digital audio signal to obtain a context-suppressed signal; generating an audio context signal that is based on a first filter and a first plurality of sequences, each of the first plurality of sequences having a different time resolution and mixing a first signal that is based on the generated audio context signal with a second signal that is based on the context-suppressed signal to obtain a context-enhanced signal, wherein generating an audio context signal includes applying the first filter to each of the first plurality of sequences.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 具体地,某些实施例考虑从数字音频信号抑制上下文成分以获得上下文抑制信号; 生成基于第一滤波器和第一多个序列的音频上下文信号,所述第一多个序列中的每一个具有不同的时间分辨率,并且将基于所生成的音频上下文信号的第一信号与第二信号混合,所述第二信号 基于所述上下文抑制信号以获得上下文增强信号,其中生成音频上下文信号包括将所述第一滤波器应用于所述第一多个序列中的每一个。

    Systems, methods, and apparatus for context processing using multiple microphones
    10.
    发明授权
    Systems, methods, and apparatus for context processing using multiple microphones 有权
    使用多个麦克风进行上下文处理的系统,方法和装置

    公开(公告)号:US08483854B2

    公开(公告)日:2013-07-09

    申请号:US12129421

    申请日:2008-05-29

    IPC分类号: G06F17/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. In one aspect, a method of processing a digital audio signal that includes a first audio context is disclosed. The method comprises based on a first audio signal that is produced by a first microphone, suppressing the first audio context from the digital audio signal to obtain a context-suppressed signal. The method may further comprise selecting a second context based on the first audio context, and mixing the second audio context with a signal that is based on the context-suppressed signal to obtain a context-enhanced signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 一方面,公开了一种处理包括第一音频环境的数字音频信号的方法。 该方法包括基于由第一麦克风产生的第一音频信号,从数字音频信号抑制第一音频上下文以获得上下文抑制信号。 该方法还可以包括基于第一音频上下文选择第二上下文,并且将第二音频上下文与基于上下文抑制信号的信号进行混合以获得上下文增强信号。