摘要:
Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.
摘要:
Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.
摘要:
Described techniques and tools include techniques and tools for mapping digital media data (e.g., audio, video, still images, and/or text, among others) in a given format to a transport or file container format useful for encoding the data on optical disks such as digital video disks (DVDs). A digital media universal elementary stream can be used to map digital media streams (e.g., an audio stream, video stream or an image) into any arbitrary transport or file container, including optical disk formats, and other transports, such as broadcast streams, wireless transmissions, etc. The information to decode any given frame of the digital media in the stream can be carried in each coded frame. A digital media universal elementary stream includes stream components called chunks. An implementation of a digital media universal elementary stream arranges data for a media stream in frames, the frames having one or more chunks.
摘要:
A compressed digital audio signal is transmitted from an audio source along a connection wire to an audio receiver. The digital audio signal can encode digital audio data having different sampling frequencies, frames sizes, and other information. The audio receiver that receives the digital audio signal can decode and convert the compressed digital audio signal into multiple synchronized analog signals, which are used to drive multiple speakers. The audio receiver may also synchronize the audio data with associated video data so that the audio playback and video playback are “in sync”, despite delay introduced by the audio signal decoding at the audio receiver.
摘要:
A compressed digital audio signal is transmitted from an audio source along a connection wire to an audio receiver. The digital audio signal can encode digital audio data having different sampling frequencies, frames sizes, and other information. The audio receiver that receives the digital audio signal can decode and convert the compressed digital audio signal into multiple synchronized analog signals, which are used to drive multiple speakers. The audio receiver may also synchronize the audio data with associated video data so that the audio playback and video playback are “in sync”, despite delay introduced by the audio signal decoding at the audio receiver.
摘要:
Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation and the non-harmonic component are then used to compute a harmonic to non-harmonic ratio (HnHR). This HnHR is indicative of the quality of a user's speech and is designated as an estimate of the speech quality of the frame. In one implementation, the HnHR is used to establish a minimum speech quality threshold below which the quality of the user's speech is considered unacceptable. Feedback to the user is then provided based on whether the HnHR falls below the threshold.
摘要:
A communication end device of a two-way communication system is shown. The device includes an audio signal capture device for capturing local audio to be transmitted to another end device, an audio signal rendering device for playing remote audio received from the other end device, and buffers for buffering the captured and rendered audio signals. The device also includes an audio echo canceller operating to predict echo from the rendered audio signal at a calculated relative offset in the captured audio signal based on an adaptive filter, and subtract the predicted echo from the signal transmitted to the other end device The calculated relative offset that is used by the audio echo canceller for a current signal sample is adjusted if a difference between it and an adjusted relative offset of a preceding sample exceeds a threshold value.
摘要:
A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.
摘要:
An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.
摘要:
A transformation method provides a multi-dimensional affine transformation for representing motion between corresponding image components of successive video image frames. The multi-dimensional affine transformations can represent complex motion that includes any or all of translation, rotation, magnification, and shear. The transformation method of this invention includes determining motion transformations between corresponding pixels in the image components of the first and second video image frames. From the motion transformations between corresponding pixels, multi-dimensional affine motion transformations between the corresponding image components are determined. This transformation method increases the accuracy with which complex motion is represented and results in fewer compression or encoding errors in comparison to conventional methods, thereby increasing compression efficiency.