Method and apparatus for interoperability between voice transmission systems during speech inactivity

    公开(公告)号:US06631139B2

    公开(公告)日:2003-10-07

    申请号:US09774440

    申请日:2001-01-31

    IPC分类号: H04L1246

    CPC分类号: G10L19/173

    摘要: The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise. Continuous eighth rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems. Discontinuous SID frames are translated to continuous eighth rate encoded noise frames for decoding by a CTX system. Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity.

    Method and apparatus for interoperability between voice transmission systems during speech inactivity
    2.
    发明授权
    Method and apparatus for interoperability between voice transmission systems during speech inactivity 有权
    在语音不活动期间语音传输系统之间的互操作性的方法和装置

    公开(公告)号:US07061934B2

    公开(公告)日:2006-06-13

    申请号:US10622661

    申请日:2003-07-17

    IPC分类号: H04J3/16 H04J3/22

    CPC分类号: G10L19/173

    摘要: The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise [FIG. 2]. Continuous eighth rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems (402–410). Discontinuous SID frames are translated to continuous eighth rate encoded noise frames for decoding by a CTX system (602–606). Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity.

    摘要翻译: 所公开的实施例提供了用于在静音或背景噪声的传输期间CTX和DTX通信系统之间的互操作性的方法和装置。 2]。 连续的第八速率编码噪声帧被转换为不连续的SID帧以传输到DTX系统(402-410)。 不连续SID帧被转换为连续的第八速率编码噪声帧,以便由CTX系统(602-606)进行解码。 CTX到DTX互操作性的应用包括CDMA和GSM互操作性(窄带语音传输系统),CDMA下一代声码器(可选模式声码器)与用于IP语音应用的DTX模式下运行的新ITU-T 4 kbps声码器的互操作性,未来 语音传输系统具有通用语音编码器/解码器,但在语音非活动期间以不同的CTX或DTX模式工作,以及CDMA宽带语音传输系统与具有普通宽带声码器但具有不同操作模式(DTX)的其他宽带语音传输系统的互操作性 或CTX)在语音非活动期间。

    Communications using wideband terminals
    3.
    发明授权
    Communications using wideband terminals 有权
    通信使用宽带终端

    公开(公告)号:US07289461B2

    公开(公告)日:2007-10-30

    申请号:US09811056

    申请日:2001-03-15

    IPC分类号: H04Q7/00 H04L12/66 G10L19/00

    CPC分类号: H04W88/181

    摘要: A call setup procedure is presented to permit vocoder bypass, which will allow the transmission of wideband speech packets between wideband terminals over narrowband transmission constraints. In addition, methods and apparatus are presented that allow the conversion between a wideband tandem-free operation, a narrowband tandem-free operation, and a standard tandem operation.

    摘要翻译: 提出了呼叫建立过程以允许声码器旁路,这将允许通过窄带传输约束在宽带终端之间传输宽带语音分组。 另外,提出了允许在宽带无串联操作,窄带无串联操作和标准串联操作之间进行转换的方法和装置。

    Systems, methods and apparatus for context descriptor transmission
    4.
    发明授权
    Systems, methods and apparatus for context descriptor transmission 有权
    用于上下文描述符传输的系统,方法和装置

    公开(公告)号:US08600740B2

    公开(公告)日:2013-12-03

    申请号:US12129525

    申请日:2008-05-29

    IPC分类号: G10L21/00 G10L21/02

    摘要: Configurations disclosed herein include systems, methods and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Example embodiments may first remove any existing context from a digital audio signal to obtain a context suppressed signal. The context suppressed signal may then be encoded. An audio context may be selected from among a plurality of audio contexts, with the selected audio context inserted into a signal based on the encoded context suppressed signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替换现有上下文的系统,方法和装置。 示例性实施例可以首先从数字音频信号中去除任何现有上下文以获得上下文抑制信号。 然后可以编码上下文抑制信号。 可以从多个音频上下文中选择音频上下文,其中所选择的音频上下文基于编码的上下文抑制信号插入到信号中。

    Systems, methods, and apparatus for context processing using multi resolution analysis
    5.
    发明授权
    Systems, methods, and apparatus for context processing using multi resolution analysis 失效
    使用多分辨率分析的上下文处理的系统,方法和装置

    公开(公告)号:US08554550B2

    公开(公告)日:2013-10-08

    申请号:US12129466

    申请日:2008-05-29

    IPC分类号: G10L21/02 G10L19/00 G10L11/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Particularly, certain embodiments contemplate suppressing the context component from the digital audio signal to obtain a context-suppressed signal; generating an audio context signal that is based on a first filter and a first plurality of sequences, each of the first plurality of sequences having a different time resolution and mixing a first signal that is based on the generated audio context signal with a second signal that is based on the context-suppressed signal to obtain a context-enhanced signal, wherein generating an audio context signal includes applying the first filter to each of the first plurality of sequences.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 具体地,某些实施例考虑从数字音频信号抑制上下文成分以获得上下文抑制信号; 生成基于第一滤波器和第一多个序列的音频上下文信号,所述第一多个序列中的每一个具有不同的时间分辨率,并且将基于所生成的音频上下文信号的第一信号与第二信号混合,所述第二信号 基于所述上下文抑制信号以获得上下文增强信号,其中生成音频上下文信号包括将所述第一滤波器应用于所述第一多个序列中的每一个。

    Systems, methods, and apparatus for context processing using multiple microphones
    6.
    发明授权
    Systems, methods, and apparatus for context processing using multiple microphones 有权
    使用多个麦克风进行上下文处理的系统,方法和装置

    公开(公告)号:US08483854B2

    公开(公告)日:2013-07-09

    申请号:US12129421

    申请日:2008-05-29

    IPC分类号: G06F17/00

    摘要: Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. In one aspect, a method of processing a digital audio signal that includes a first audio context is disclosed. The method comprises based on a first audio signal that is produced by a first microphone, suppressing the first audio context from the digital audio signal to obtain a context-suppressed signal. The method may further comprise selecting a second context based on the first audio context, and mixing the second audio context with a signal that is based on the context-suppressed signal to obtain a context-enhanced signal.

    摘要翻译: 本文公开的配置包括可以应用于语音通信和/或存储应用中以去除,增强和/或替代现有上下文的系统,方法和装置。 一方面,公开了一种处理包括第一音频环境的数字音频信号的方法。 该方法包括基于由第一麦克风产生的第一音频信号,从数字音频信号抑制第一音频上下文以获得上下文抑制信号。 该方法还可以包括基于第一音频上下文选择第二上下文,并且将第二音频上下文与基于上下文抑制信号的信号进行混合以获得上下文增强信号。

    Video demultiplexer and decoder with efficient data recovery
    7.
    发明申请
    Video demultiplexer and decoder with efficient data recovery 审中-公开
    视频解复用器和解码器,具有高效的数据恢复功能

    公开(公告)号:US20060062312A1

    公开(公告)日:2006-03-23

    申请号:US10947981

    申请日:2004-09-22

    摘要: A video demultiplexer and video decoder include features for efficient video data recovery in the event of channel error. The demultiplexer detects a boundary between physical layer data units and adds boundary information to the bitstream produced by the demultiplexer. The demultiplexer produces adaptation layer data units, which are processed by the adaptation layer to produce an application layer bitstream. When the video decoder encounters an error in the bitstream, it uses the boundary information to limit the amount of data that must be concealed. In particular, the boundary information permits the error to be associated with a small segment of data. The video decoder conceals data from the beginning of the segment of data, rather than an entire slice or frame in which the segment resides. In this manner, the video decoder provides efficient data recovery, limiting the loss of useful data that otherwise would be purposely discarded for concealment purposes.

    摘要翻译: 视频解复用器和视频解码器包括在频道错误的情况下有效的视频数据恢复的特征。 解复用器检测物理层数据单元之间的边界,并将边界信息添加到由解复用器产生的位流。 解复用器产生适配层数据单元,其由适配层处理以产生应用层比特流。 当视频解码器在比特流中遇到错误时,它使用边界信息来限制必须隐藏的数据量。 特别地,边界信息允许错误与一小段数据相关联。 视频解码器从数据段的开头隐藏数据,而不是段所在的整个片或帧。 以这种方式,视频解码器提供有效的数据恢复,限制为了隐藏目的而有意丢弃的有用数据的丢失。

    Multi-mode region-of-interest video object segmentation
    8.
    发明申请
    Multi-mode region-of-interest video object segmentation 有权
    多模式感兴趣区域视频对象分割

    公开(公告)号:US20070183661A1

    公开(公告)日:2007-08-09

    申请号:US11349659

    申请日:2006-02-07

    IPC分类号: G06K9/34 G06K9/00

    摘要: The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.

    摘要翻译: 本公开涉及从视频序列自动分割感兴趣区域(ROI)视频对象的技术。 ROI对象分割使得可以从视频序列的非ROI或“背景”区域中提取观看者感兴趣的视频序列的所选ROI或“前景”对象。 ROI对象的示例是人体的人脸或头肩部区域。 所公开的技术包括组合ROI特征检测,区域分割和背景减除的混合技术。 以这种方式,所公开的技术可以从视频序列提供前景对象生成和前景对象的低复杂度提取的准确性。 ROI对象分割系统可以实现本文描述的技术。 此外,ROI对象分割可能在使用诸如视频电话应用和视频监控应用之类的视频序列的多种多媒体应用中是有用的。

    Adaptive filtering to enhance video bit-rate control performance
    9.
    发明申请
    Adaptive filtering to enhance video bit-rate control performance 有权
    自适应滤波以增强视频比特率控制性能

    公开(公告)号:US20070172211A1

    公开(公告)日:2007-07-26

    申请号:US11378567

    申请日:2006-03-16

    IPC分类号: H04N7/26

    摘要: This disclosure describes adaptive filtering techniques to improve the quality of captured imagery, such as video or still images. In particular, this disclosure describes adaptive filtering techniques that filter each pixel as a function of a set of surrounding pixels. An adaptive image filter may compare image information associated with a pixel of interest to image information associated with a set of surrounding pixels by, for example, computing differences between the image formation associated with the pixel of interest and each of the surrounding pixels of the set. The computed differences can be used in a variety of ways to filter image information of the pixel of interest. In some embodiments, for example, the adaptive image filter may include both a low pass component and high pass component that adjust as a function of the computed differences.

    摘要翻译: 本公开描述了自适应滤波技术,以改善所捕获图像的质量,例如视频或静止图像。 特别地,本公开描述了根据一组周围像素来滤除每个像素的自适应滤波技术。 自适应图像滤波器可以通过例如计算与感兴趣像素相关联的图像形成与该组的每个周围像素之间的差异来比较与感兴趣像素相关联的图像信息与与一组周围像素相关联的图像信息 。 计算的差异可以以各种方式用于过滤感兴趣像素的图像信息。 在一些实施例中,例如,自适应图像滤波器可以包括作为所计算的差异的函数调整的低通分量和高通分量两者。

    Video frame motion-based automatic region-of-interest detection
    10.
    发明申请
    Video frame motion-based automatic region-of-interest detection 有权
    基于视频帧运动的自动感兴趣区域检测

    公开(公告)号:US20070076957A1

    公开(公告)日:2007-04-05

    申请号:US11364285

    申请日:2006-02-28

    IPC分类号: G06K9/46 G06K9/00 G06K9/34

    摘要: The disclosure is directed to techniques for region-of-interest (ROI) video processing based on low-complexity automatic ROI detection within video frames of video sequences. The low-complexity automatic ROI detection may be based on characteristics of video sensors within video communication devices. In other cases, the low-complexity automatic ROI detection may be based on motion information for a video frame and a different video frame of the video sequence. The disclosed techniques include a video processing technique capable of tuning and enhancing video sensor calibration, camera processing, ROI detection, and ROI video processing within a video communication device based on characteristics of a specific video sensor. The disclosed techniques also include a sensor-based ROI detection technique that uses video sensor statistics and camera processing side-information to improve ROI detection accuracy. The disclosed techniques also include a motion-based ROI detection technique that uses motion information obtained during motion estimation in video processing.

    摘要翻译: 本公开涉及基于视频序列的视频帧内的低复杂度自动ROI检测的感兴趣区域(ROI)视频处理技术。 低复杂度的自动ROI检测可以基于视频通信设备内的视频传感器的特性。 在其他情况下,低复杂度自动ROI检测可以基于视频帧的运动信息和视频序列的不同视频帧。 所公开的技术包括基于特定视频传感器的特性,能够在视频通信设备内调整和增强视频传感器校准,相机处理,ROI检测和ROI视频处理的视频处理技术。 所公开的技术还包括基于传感器的ROI检测技术,其使用视频传感器统计和相机处理侧信息来提高ROI检测精度。 所公开的技术还包括基于运动的ROI检测技术,其使用在视频处理中的运动估计期间获得的运动信息。