摘要:
The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise. Continuous eighth rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems. Discontinuous SID frames are translated to continuous eighth rate encoded noise frames for decoding by a CTX system. Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity.
摘要:
The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise [FIG. 2]. Continuous eighth rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems (402–410). Discontinuous SID frames are translated to continuous eighth rate encoded noise frames for decoding by a CTX system (602–606). Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity.
摘要:
A call setup procedure is presented to permit vocoder bypass, which will allow the transmission of wideband speech packets between wideband terminals over narrowband transmission constraints. In addition, methods and apparatus are presented that allow the conversion between a wideband tandem-free operation, a narrowband tandem-free operation, and a standard tandem operation.
摘要:
Configurations disclosed herein include systems, methods and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Example embodiments may first remove any existing context from a digital audio signal to obtain a context suppressed signal. The context suppressed signal may then be encoded. An audio context may be selected from among a plurality of audio contexts, with the selected audio context inserted into a signal based on the encoded context suppressed signal.
摘要:
Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. Particularly, certain embodiments contemplate suppressing the context component from the digital audio signal to obtain a context-suppressed signal; generating an audio context signal that is based on a first filter and a first plurality of sequences, each of the first plurality of sequences having a different time resolution and mixing a first signal that is based on the generated audio context signal with a second signal that is based on the context-suppressed signal to obtain a context-enhanced signal, wherein generating an audio context signal includes applying the first filter to each of the first plurality of sequences.
摘要:
Configurations disclosed herein include systems, methods, and apparatus that may be applied in a voice communications and/or storage application to remove, enhance, and/or replace the existing context. In one aspect, a method of processing a digital audio signal that includes a first audio context is disclosed. The method comprises based on a first audio signal that is produced by a first microphone, suppressing the first audio context from the digital audio signal to obtain a context-suppressed signal. The method may further comprise selecting a second context based on the first audio context, and mixing the second audio context with a signal that is based on the context-suppressed signal to obtain a context-enhanced signal.
摘要:
A video demultiplexer and video decoder include features for efficient video data recovery in the event of channel error. The demultiplexer detects a boundary between physical layer data units and adds boundary information to the bitstream produced by the demultiplexer. The demultiplexer produces adaptation layer data units, which are processed by the adaptation layer to produce an application layer bitstream. When the video decoder encounters an error in the bitstream, it uses the boundary information to limit the amount of data that must be concealed. In particular, the boundary information permits the error to be associated with a small segment of data. The video decoder conceals data from the beginning of the segment of data, rather than an entire slice or frame in which the segment resides. In this manner, the video decoder provides efficient data recovery, limiting the loss of useful data that otherwise would be purposely discarded for concealment purposes.
摘要:
The disclosure is directed to techniques for automatic segmentation of a region-of-interest (ROI) video object from a video sequence. ROI object segmentation enables selected ROI or “foreground” objects of a video sequence that may be of interest to a viewer to be extracted from non-ROI or “background” areas of the video sequence. Examples of a ROI object are a human face or a head and shoulder area of a human body. The disclosed techniques include a hybrid technique that combines ROI feature detection, region segmentation, and background subtraction. In this way, the disclosed techniques may provide accurate foreground object generation and low-complexity extraction of the foreground object from the video sequence. A ROI object segmentation system may implement the techniques described herein. In addition, ROI object segmentation may be useful in a wide range of multimedia applications that utilize video sequences, such as video telephony applications and video surveillance applications.
摘要:
This disclosure describes adaptive filtering techniques to improve the quality of captured imagery, such as video or still images. In particular, this disclosure describes adaptive filtering techniques that filter each pixel as a function of a set of surrounding pixels. An adaptive image filter may compare image information associated with a pixel of interest to image information associated with a set of surrounding pixels by, for example, computing differences between the image formation associated with the pixel of interest and each of the surrounding pixels of the set. The computed differences can be used in a variety of ways to filter image information of the pixel of interest. In some embodiments, for example, the adaptive image filter may include both a low pass component and high pass component that adjust as a function of the computed differences.
摘要:
The disclosure is directed to techniques for region-of-interest (ROI) video processing based on low-complexity automatic ROI detection within video frames of video sequences. The low-complexity automatic ROI detection may be based on characteristics of video sensors within video communication devices. In other cases, the low-complexity automatic ROI detection may be based on motion information for a video frame and a different video frame of the video sequence. The disclosed techniques include a video processing technique capable of tuning and enhancing video sensor calibration, camera processing, ROI detection, and ROI video processing within a video communication device based on characteristics of a specific video sensor. The disclosed techniques also include a sensor-based ROI detection technique that uses video sensor statistics and camera processing side-information to improve ROI detection accuracy. The disclosed techniques also include a motion-based ROI detection technique that uses motion information obtained during motion estimation in video processing.