Multi-mode region-of-interest video object segmentation
    11.
    发明申请
    Multi-mode region-of-interest video object segmentation 有权
    多模式感兴趣区域视频对象分割

    公开(公告)号:US20070183661A1

    公开(公告)日:2007-08-09

    申请号:US11349659

    申请日:2006-02-07

    IPC分类号: G06K9/34 G06K9/00

    摘要: The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.

    摘要翻译: 本公开涉及从视频序列自动分割感兴趣区域(ROI)视频对象的技术。 ROI对象分割使得可以从视频序列的非ROI或“背景”区域中提取观看者感兴趣的视频序列的所选ROI或“前景”对象。 ROI对象的示例是人体的人脸或头肩部区域。 所公开的技术包括组合ROI特征检测,区域分割和背景减除的混合技术。 以这种方式,所公开的技术可以从视频序列提供前景对象生成和前景对象的低复杂度提取的准确性。 ROI对象分割系统可以实现本文描述的技术。 此外,ROI对象分割可能在使用诸如视频电话应用和视频监控应用之类的视频序列的多种多媒体应用中是有用的。

    Adaptive filtering to enhance video bit-rate control performance
    12.
    发明申请
    Adaptive filtering to enhance video bit-rate control performance 有权
    自适应滤波以增强视频比特率控制性能

    公开(公告)号:US20070172211A1

    公开(公告)日:2007-07-26

    申请号:US11378567

    申请日:2006-03-16

    IPC分类号: H04N7/26

    摘要: This disclosure describes adaptive filtering techniques to improve the quality of captured imagery, such as video or still images. In particular, this disclosure describes adaptive filtering techniques that filter each pixel as a function of a set of surrounding pixels. An adaptive image filter may compare image information associated with a pixel of interest to image information associated with a set of surrounding pixels by, for example, computing differences between the image formation associated with the pixel of interest and each of the surrounding pixels of the set. The computed differences can be used in a variety of ways to filter image information of the pixel of interest. In some embodiments, for example, the adaptive image filter may include both a low pass component and high pass component that adjust as a function of the computed differences.

    摘要翻译: 本公开描述了自适应滤波技术,以改善所捕获图像的质量,例如视频或静止图像。 特别地,本公开描述了根据一组周围像素来滤除每个像素的自适应滤波技术。 自适应图像滤波器可以通过例如计算与感兴趣像素相关联的图像形成与该组的每个周围像素之间的差异来比较与感兴趣像素相关联的图像信息与与一组周围像素相关联的图像信息 。 计算的差异可以以各种方式用于过滤感兴趣像素的图像信息。 在一些实施例中,例如,自适应图像滤波器可以包括作为所计算的差异的函数调整的低通分量和高通分量两者。

    Video frame motion-based automatic region-of-interest detection
    13.
    发明申请
    Video frame motion-based automatic region-of-interest detection 有权
    基于视频帧运动的自动感兴趣区域检测

    公开(公告)号:US20070076957A1

    公开(公告)日:2007-04-05

    申请号:US11364285

    申请日:2006-02-28

    IPC分类号: G06K9/46 G06K9/00 G06K9/34

    摘要: The disclosure is directed to techniques for region-of-interest (ROI) video processing based on low-complexity automatic ROI detection within video frames of video sequences. The low-complexity automatic ROI detection may be based on characteristics of video sensors within video communication devices. In other cases, the low-complexity automatic ROI detection may be based on motion information for a video frame and a different video frame of the video sequence. The disclosed techniques include a video processing technique capable of tuning and enhancing video sensor calibration, camera processing, ROI detection, and ROI video processing within a video communication device based on characteristics of a specific video sensor. The disclosed techniques also include a sensor-based ROI detection technique that uses video sensor statistics and camera processing side-information to improve ROI detection accuracy. The disclosed techniques also include a motion-based ROI detection technique that uses motion information obtained during motion estimation in video processing.

    摘要翻译: 本公开涉及基于视频序列的视频帧内的低复杂度自动ROI检测的感兴趣区域(ROI)视频处理技术。 低复杂度的自动ROI检测可以基于视频通信设备内的视频传感器的特性。 在其他情况下,低复杂度自动ROI检测可以基于视频帧的运动信息和视频序列的不同视频帧。 所公开的技术包括基于特定视频传感器的特性,能够在视频通信设备内调整和增强视频传感器校准,相机处理,ROI检测和ROI视频处理的视频处理技术。 所公开的技术还包括基于传感器的ROI检测技术,其使用视频传感器统计和相机处理侧信息来提高ROI检测精度。 所公开的技术还包括基于运动的ROI检测技术,其使用在视频处理中的运动估计期间获得的运动信息。

    Method and apparatus for interoperability between voice transmission systems during speech inactivity
    14.
    发明授权
    Method and apparatus for interoperability between voice transmission systems during speech inactivity 有权
    在语音不活动期间语音传输系统之间的互操作性的方法和装置

    公开(公告)号:US07061934B2

    公开(公告)日:2006-06-13

    申请号:US10622661

    申请日:2003-07-17

    IPC分类号: H04J3/16 H04J3/22

    CPC分类号: G10L19/173

    摘要: The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise [FIG. 2]. Continuous eighth rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems (402–410). Discontinuous SID frames are translated to continuous eighth rate encoded noise frames for decoding by a CTX system (602–606). Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity.

    摘要翻译: 所公开的实施例提供了用于在静音或背景噪声的传输期间CTX和DTX通信系统之间的互操作性的方法和装置。 2]。 连续的第八速率编码噪声帧被转换为不连续的SID帧以传输到DTX系统(402-410)。 不连续SID帧被转换为连续的第八速率编码噪声帧,以便由CTX系统(602-606)进行解码。 CTX到DTX互操作性的应用包括CDMA和GSM互操作性(窄带语音传输系统),CDMA下一代声码器(可选模式声码器)与用于IP语音应用的DTX模式下运行的新ITU-T 4 kbps声码器的互操作性,未来 语音传输系统具有通用语音编码器/解码器,但在语音非活动期间以不同的CTX或DTX模式工作,以及CDMA宽带语音传输系统与具有普通宽带声码器但具有不同操作模式(DTX)的其他宽带语音传输系统的互操作性 或CTX)在语音非活动期间。

    Tandem-free intersystem voice communication
    15.
    发明授权
    Tandem-free intersystem voice communication 有权
    无串联系统间语音通信

    公开(公告)号:US08432935B2

    公开(公告)日:2013-04-30

    申请号:US12181972

    申请日:2008-07-29

    IPC分类号: H04J3/16 G10L11/06

    CPC分类号: G10L19/173 H04W88/181

    摘要: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.

    摘要翻译: 本文提供了技术,以通过两个否则不兼容的无线网络在两个无线终端之间提供无串联操作。 具体地,实施例在通过连续传输(CTX)无线信道通信到通过不连续传输(DTX)无线信道进行通信的无线终端的无线终端之间提供无串联操作。 在第一方面,非活动语音帧在DTX和CTX格式之间被转换。 在第二方面,每个无线终端包括与移动到移动连接的相对端上的活动语音编码器兼容的活动语音解码器。

    Quality metric-biased region-of-interest coding for video telephony
    16.
    发明申请
    Quality metric-biased region-of-interest coding for video telephony 有权
    用于视频电话的质量度量偏好的兴趣区编码

    公开(公告)号:US20060238444A1

    公开(公告)日:2006-10-26

    申请号:US11199773

    申请日:2005-08-09

    IPC分类号: G09G3/20

    摘要: The disclosure is directed to techniques for region-of-interest (ROI) coding for video telephony (VT). The disclosed techniques include a technique for generation of a quality metric for ROI video, which jointly considers a user's degree of interest in the ROI, ROI video fidelity, and ROI perceptual quality in evaluating the quality of an encoded video sequence. The quality metric may be used to bias ROI coding and, in particular, the allocation of coding bits between ROI and non-ROI areas of a video frame.

    摘要翻译: 本公开涉及用于视频电话(VT)的感兴趣区域(ROI)编码技术。 所公开的技术包括用于生成用于ROI视频的质量度量的技术,其在评估编码视频序列的质量时共同考虑用户对ROI的感兴趣度,ROI视频保真度和ROI感知质量。 质量度量可以用于偏移ROI编码,特别是视频帧的ROI和非ROI区域之间的编码比特的分配。

    Content-adaptive background skipping for region-of-interest video coding
    17.
    发明申请
    Content-adaptive background skipping for region-of-interest video coding 有权
    针对感兴趣区域视频编码的内容自适应背景跳过

    公开(公告)号:US20060204113A1

    公开(公告)日:2006-09-14

    申请号:US11200407

    申请日:2005-08-09

    IPC分类号: G06K9/36 H04N11/02

    摘要: The disclosure is directed to techniques for content-adaptive background skipping for region-of-interest (ROI) video coding. The techniques may be useful in video telephony (VT) applications such as video streaming and videoconferencing, and especially useful in low bit-rate wireless communication applications, such as mobile VT. The disclosed techniques analyze content information of a video frame to dynamically determine whether to skip a non-ROI area within the frame. For example, the skipping determination may be based on content activity, such as ROI shape deformation, ROI motion, non-ROI motion, non-ROI texture complexity, and accumulated distortion due to non-ROI skipping. The skip determination may operate in conjunction with either frame-level or macroblock-level bit allocation.

    摘要翻译: 本公开涉及用于感兴趣区域(ROI)视频编码的内容自适应背景跳过技术。 这些技术在诸如视频流和视频会议的视频电话(VT)应用中可能是有用的,并且在诸如移动VT的低比特率无线通信应用中尤其有用。 所公开的技术分析视频帧的内容信息以动态地确定是否跳过帧内的非ROI区域。 例如,跳过确定可以基于诸如ROI形状变形,ROI运动,非ROI运动,非ROI纹理复杂度以及由于非ROI跳过而导致的累积失真的内容活动。 跳过确定可以与帧级或宏块级位分配一起操作。

    Adaptive frame skipping techniques for rate controlled video encoding

    公开(公告)号:US20060198443A1

    公开(公告)日:2006-09-07

    申请号:US11193249

    申请日:2005-07-29

    摘要: The disclosure is directed to adaptive frame skipping techniques for rate controlled video encoding of a video sequence. According to the disclosed techniques, an encoder performs frame skipping in an intelligent manner that can improve video quality of the encoded sequence relative to encoding using conventional frame skipping. In particular, the disclosed frame skipping scheme is adaptive and considers motion activity of the video frames in order to identify certain frames that can be skipped without sacrificing significant video quality. The described frame skipping techniques may take into account the tradeoff between spatial and temporal quality of different video frames. In this manner, the techniques can allocate limited resources between the spatial and temporal quality in a way that can improve the visual appearance of a video sequence.

    Method and apparatus for improved detection of rate errors in variable rate receivers
    19.
    发明申请
    Method and apparatus for improved detection of rate errors in variable rate receivers 有权
    用于改进可变速率接收机中速率误差检测的方法和装置

    公开(公告)号:US20050050407A1

    公开(公告)日:2005-03-03

    申请号:US10938445

    申请日:2004-09-09

    CPC分类号: H04L1/08 H04L1/0046 H04L1/201

    摘要: A system and method for detection of rate determination algorithm errors in variable rate communications system receivers. The disclosed embodiments prevent rate determination algorithm errors from causing audible artifacts such as screeches or beeps. The disclosed system and method detects frames with incorrectly determined data rates and performs frame erasure processing and/or memory state clean up to prevent propagation of distortion across multiple frames. Frames with incorrectly determined data rates are detected by checking illegal rate transitions, reserved bits, validating unused filter type bit combinations and analyzing relationships between fixed code-book gains and linear prediction coefficient gains.

    摘要翻译: 一种用于在可变速率通信系统接收机中检测速率确定算法错误的系统和方法。 所公开的实施例防止速率确定算法错误引起可听见的伪影,例如吱吱声或嘟嘟声。 所公开的系统和方法检测具有错误确定的数据速率的帧,并执行帧擦除处理和/或存储器状态清理,以防止跨多个帧的失真传播。 通过检查非法速率转换,保留位,验证未使用的过滤器类型位组合以及分析固定代码簿增益和线性预测系数增益之间的关系来检测具有不正确确定的数据速率的帧。

    Systems, methods, and apparatus for context suppression using receivers
    20.
    发明授权
    Systems, methods, and apparatus for context suppression using receivers 有权
    使用接收机进行上下文抑制的系统,方法和装置

    公开(公告)号:US08560307B2

    公开(公告)日:2013-10-15

    申请号:US12129455

    申请日:2008-05-29

    IPC分类号: G10L21/00

    摘要: Configurations disclosed herein include systems, methods and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Example embodiments may decode two sets of encoded frames from an encoded audio signal. The two frame sets may be encoded using different encoding schemes. For example, the bit rate or coding mode may differ between the two encoded frame sets. Based on information from one of the decoded sets of frames, a context component included in a signal represented by the other frame set may be suppressed. Other embodiments may generate an audio context signal within the mobile user terminal, and mix the generated audio signal with another decoded audio signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替换现有上下文的系统,方法和装置。 示例性实施例可以从经编码的音频信号中解码两组经编码的帧。 可以使用不同的编码方案对两个帧集进行编码。 例如,比特率或编码模式可以在两个编码的帧集之间不同。 基于来自解码的帧集合的信息,可以抑制由另一帧集合表示的信号中包括的上下文分量。 其他实施例可以在移动用户终端内生成音频上下文信号,并且将生成的音频信号与另一解码音频信号进行混合。