摘要:
A parametric representation of a multi-channel audio signal describes the spatial properties of the audio signal well with compact side information when a coherence information, describing the coherence between a first and a second channel, is derived within a hierarchical encoding process only for channel pairs including a first channel having only information of a left side with respect to a listening position and including a second channel having only information from a right side with respect to a listening position. As within the hierarchical process the multiple audio channels of the audio signal are downmixed iteratively into monophonic channels, one can pick the relevant parameters from an encoding step involving only channel pairs carrying the information needed to describe the spatial properties of the multi-channel audio signal.
摘要:
The present invention is based on the finding that an efficient code for encoding information values can be derived, when two or more information values are grouped in a tuple in a tuple order and when an encoding rule is used, that assigns the same code word to tuples having identical information values in different orders and that does derive an order information, indicating the tuple order, and when the code word is output in association with the order information.
摘要:
For synthesizing at least three output channels using two stereo input channels, the stereo input channels are analyzed to detect signal components occurring in both input channels. A signal generator is operative to introduce at least a part of the detected signal components into the second channel associated with a second speaker in an intended speaker scheme, which is positioned between a first and a third speaker in the speaker scheme. When, however, feeding of the complete detected signal components would result in a clipping situation, then only a part of the detected signal components is fed into the second channel as a real center channel and the remainder is located in the first and third channels as a phantom center channel.
摘要:
In a method for characterizing a signal representing an audio content a measure is determined for a tonality of the signal, whereupon a statement is made about the audio content of the signal on the basis of the measure for the tonality of the signal. The measure for the tonality is derived from a quotient whose numerator is the mean of the summed values of spectral components of the signal exponentiated with a first power and whose denominator is the mean of the summed values of spectral components exponentiated with a second power, the first and second powers differing from each other. The measure for the tonality of the signal for the content analysis is robust in relation to a signal distortion, due e.g. to MP3 coding, and has a high correlation with the content of the analyzed signal.
摘要:
A method of coding stereo audio spectral values first carries out grouping of those values in scale factor bands, with which scale factors are associated. Sections are formed next, each comprising at least one scale factor band. The spectral values are coded within at least one section with a code book assigned to the section, out of a plurality of code books each with a code book number assigned to it, the number of the code book used being transmitted as side information to the coded stereo audio spectral values. At least one additional code book number is provided, which does not refer to a code book but shows information relevant to the section to which it is assigned. A method of decoding stereo audio spectral values which are partly coded by the intensity stereo process and which have side information uses the relevant information, showing the additional code book numbers, to cancel the existing coding of the stereo audio spectral values.
摘要:
The present invention permits a combination of a scalable audio coder with the TNS technique. In a method for coding time signals sampled in a first sampling rate, second time signals are first generated whose sampling rate is smaller than the first sampling rate. The second time signals are then coded according to a first coding algorithm and written into a bit stream. The coded second time signals are, however, decoded again, and, like the first time signals, transformed into the frequency domain. From a spectral representation of the first time signals, TNS prediction coefficients are calculated. The transformed output signal of the coder/decoder with the first coding algorithm, like the spectral representation of the first time signal, undergoes a prediction over the frequency to obtain residual spectral values for both signals, though only the prediction coefficients calculated on the basis of the first time signals are used. These two signals are evaluated against each other. The evaluated residual spectral values are then coded by means of a second coding algorithm to obtain coded evaluated residual spectral values, which, together with the side information containing the calculated prediction coefficients, are written into the bit stream.
摘要:
A method for detecting a transient in a discrete-time audio signal is performed completely in the time domain and includes the step of segmenting the discrete-time audio signal as to generate consecutive segments of the same length with unfiltered discrete-time audio signals. The discrete-time audio signal in a current segment is filtered. Either the energy of the filtered discrete-time audio signal in the current segment is compared with the energy of the filtered discrete-time audio signal in a preceding segment or a current relationship between the energy of the filtered discrete-time audio signal in the current segment and the energy of the unfiltered discrete-time audio signal in the current segment is formed and this current relationship compared with a preceding corresponding relationship. Whether a transient is present in the discrete-time audio signal is detected using one and/or the other of these comparisons.
摘要:
In a method for concealing errors in an audio data stream the occurrence of an error is detected in the audio data stream, audio data prior to the occurrence of the fault being intact audio data. Thereafter a spectral energy of a subgroup of the intact audio data is calculated. After forming a pattern for substitute data on the basis of the spectral energy calculated for the subgroup of the intact audio data, substitute data for erroneous or missing audio data which correspond to the subgroup are created on the basis of the pattern.
摘要:
In the coding and decoding of stereo audio spectral values both the intensity stereo process and prediction are used in order to achieve high data compression. If intensity stereo coding is active in one section of scale factor bands, the prediction for the right channel in that range is deactivated, whereby the results of the prediction are not used to form the coded stereo audio spectral values. To allow further adaptation of the prediction for the right channel, the predictor of the right channel is fed with stereo audio spectral values for the channel, which again are intensity stereo decoded.