摘要:
In a communications system, multi-rate coding in accordance with the invention is implemented to generate multiple representations of an audio signal at different rates. These representations contain equivalent and/or various amounts of audio information. In an illustrative embodiment, at least one of the representations is a core representation containing core audio information. The remaining representations are enhancement representations containing enhancement audio information. The core representation is necessary for recovering the audio signal with minimal acceptable quality. Such quality is enhanced when the core representation, together with one or more of the enhancement representations, is used to recover the audio signal.
摘要:
An apparatus for providing at least first and second representations of an audio signal for use in a communications system is described. The apparatus comprises a first quantizer for quantizing at least a portion of the signal in accordance with a first multidimensional lattice to generate a first representation. The apparatus further comprises a second quantizer for quantizing at least a portion of the signal in accordance with a second, different multidimensional lattice to generate a second representation. In an illustrative embodiment, the first representation is a core representation containing core audio information. The second representation is an enhancement representation containing enhancement audio information. The core representation is necessary for recovering the audio signal with minimal acceptable quality. Audio quality is enhanced when the core representation, together with the enhancement representation, is used to recover the audio signal. A method for use in such an apparatus is also described.
摘要:
An envelope for a window function establishes (a) a power complementary condition, and (b) at least a first derivative (or higher order) that is continuous. The window function is developed from samples of the envelope. This discrete window function has a predetermined domain. The window function can be embodied as a windowing device receiving signals from a discrete signal source. This windowing device can adjust values from the discrete signal source according to the window function. A utilization device can use the adjusted values from said windowing device.
摘要:
An efficient coding scheme with higher audio bandwidth and/or better audio quality at lower bitrates, wherein the scheme eliminates long-term and short-term frequency domain correlation in a signal via frequency domain predictors. The coding scheme compresses information consisting of coded low frequency components as well as a parametric representation for the high frequency components based on a non-linear model. Additionally, by working on the frequency domain representations of the signal (such as the MDCT representation which is naturally available to a PAC encoder and decoder), low pass and high pass signal components are easily obtained by windowing the appropriate ranges of frequencies in the signal. Furthermore, the power functions of the signal are replaced by corresponding convolution functions of the same order.
摘要:
Audio tracks or other portions of a particular type of audio material to be encoded are analyzed to determine a value of at least one coding-related parameter suitable for providing optimal encoding of the particular type of audio material. When a given portion of the audio material is to be encoded for transmission in a perceptual audio coder of a communication system, the value of the coding-related parameter is identified and then utilized in conjunction with the encoding of the given portion. The determined value of the coding-related parameter may be at least a portion of a psychoacoustic model utilized in encoding the given portion of the particular type of audio material in the perceptual audio coder. As another example, the value of the coding-related parameter may be a setting of an audio processor utilized to process the given portion of the particular type of audio material prior to encoding the given portion in the perceptual audio coder.
摘要:
An audio information bit stream including audio control bits and audio data bits is processed for transmission in a communication system. The audio data bits are first separated into n classes based on error sensitivity, that is, the impact of errors in particular audio data bits on perceived quality of an audio signal reconstructed from the transmission. Each of the n different classes of audio data bits is then provided with a corresponding one of n different levels of error protection, where n is greater than or equal to two. The invention thereby matches error protection for the audio data bits to source and channel error sensitivity. The audio control bits may be transmitted independently of the audio data bits, using an additional level of error protection higher than that used for any of the n classes of the audio data bits. Alternatively, the control bits may be combined with one of the n classes of audio data bits and provided with the highest of the n levels of error protection. Further protection may be provided for the control bits by repeating at least a portion of the control bits from a current packet of the audio information bit stream in a subsequent packet of the audio information bit stream. Moreover, the classification of audio data bits into n different classes can be implemented on a fixed packet-by-packet basis, or in a more flexible, adaptive implementation in which different multipacket error protection profiles are used for different multipacket segments of a source-coded audio signal.
摘要:
In a digital audio broadcast system, to utilize transmission bandwidth efficiently, representations of a stereo audio signal for transmission are generated in accordance with an inventive multidescriptive coding technique. The representations, as generated, are then transmitted through multiple communication channels, respectively. The transmitted representations are received by a receiver where one or more of the representations are selected for recovery of the stereo audio signal. Because of the design of the multidescriptive coding used, the more representations are selected to recover the stereo audio signal, the higher the quality of the recovered signal.
摘要:
Improved channel code configurations for use in transmission of digital audio or other types of information in a digital communication system. The channel code may include an outer channel code, e.g., a cyclic redundancy code (CRC), and an inner channel code, e.g., a complementary punctured pair convolutional (CPPC) code. In accordance with the invention, multiple code words of the outer code are associated with a given packet of the digital information, in accordance with a particular outer code configuration, so as to provide partial error flagging for different portions of the given packet. An information encoder, e.g., a PAC encoder, interacts with an outer code encoder to determine a bit allocation for transmission of packets at a particular bit rate, based at least in part on the outer code configuration. The invention is particularly well-suited for use in the transmission of digital audio information in a hybrid in-band on-channel (HIBOC) digital audio broadcasting (DAB) system, but is also applicable to other types of communication systems.
摘要:
Methods and apparatus for processing information, e.g., audio, video or image information, for transmission in a communication system are disclosed. In an illustrative embodiment, a multiple program coder encodes a bitstream for each of the programs, in accordance with a specified bit allocation. An outer code encoder receives the encoded bitstreams from the multiple program coder, and jointly applies an outer code, e.g., a CRC code, RS code, BCH code or other type of linear block code, to subblocks of a subset of the resulting encoded bitstreams. The outer code is applied such that subblocks of each of the encoded bitstreams in the subset of the encoded bitstrcams are associated with a single outer code block. The number of subblocks associated with a given outer code block may be an integer or non-integer number of code blocks. The invention allows a designated number of program subblocks to share a given outer code block, such that outer code error flags can be generated for smaller groups of bits per program than would otherwise be possible. An inner code encoder may be used to apply an inner code, e.g., a convolutional code, turbo code or trellis coded modulation, to the resulting outer code blocks.
摘要:
The invention provides methods and apparatus for processing information, e.g., audio, video or image information, for transmission in a communication system. In an illustrative embodiment, interference characteristics are determined for a set of n channels to be used to transmit audio information bits, where n is greater than or equal to two. The audio information bits are separated into n classes based on error sensitivity, for example, the impact of errors in particular audio data bits on perceived quality of an audio signal reconstructed from the transmission. The classes of bits are then assigned to the n channels such that the classes of bits having the greatest error sensitivity are transmitted over the channels which are the least susceptible to interference. The interference characteristics associated with the n channels can be determined by, for example, measuring interference levels for one or more of the channels, or obtaining information regarding known interference levels for one or more of the channels. The channels may correspond to different frequency bands, time slots, code division slots or any other type of channels. The invention can provide UEP for different classes of audio information bits even in cases in which the same convolutional code, or the same complementary punctured pair convolutional (CPPC) code pair, is used to encode the classes. The assignment of the classes of bits to the channels, as well as the characteristics of the classes and the channels, may be fixed or dynamic.