摘要:
In the method of coding the audio signal, the values of first parameters (P1,1), which represent aspects of the audio signal at a first instant (ti), are calcul ated to obtain first calculated values (Al,i). The values of second parameters P2,i), which represent the aspects of the audio signal at a second, later, instant (t2), are calculated to obtain the second calculated values (A2,i). The number of the first parameters (Pl,i) and the number of the second parameters (P2,i) differ. A subset (SUS2,i) of the second parameters (P2,i) is associated with a particular portion (SFRAi) of a frequency range (FR) of the audio signal This frequency range (FR) of the audio signal is preferably selected to cover all the f requencies present in the audio signal. The values (A2,i) of the subset (SUS2,i) of the second parameters (P2,i) are coded based on a difference of this subset (SUS2,i) and a subset (SUS1,i) of the first calculated value(s) (Al,i) associate d with substantially this same particular portion (SFRAi) of the frequency range (FR). Thus the differentially coded values (7) of the second parameters (P2,i) are obtained by coding the difference of the values of second parameters (P2,i and first parameters (P1,i) which are associated with substantially the same frequency subrange (SFRAi). This allows to differential code the parameters (Pl,I P2,i) even if the number of the parameters changes in time.
摘要:
The invention relates to a linking unit (100), a parametric encoder (400) and a method for generating linking information L indicating components of consecutive extended segments sp and sc which may be linked together in order to form a sinusoidal track. The segments sp and sc approximate consecutive segments of a sinusoidal audio or speech signal s. The linking unit comprises a calculating unit (120) for generating a similarity matrix S(m,n) in response to received sinusoidal code data and an evaluating unit (140) for receiving and evaluating said similarity matrix S in order to generate said linking information by selecting those pairs of components m,n the similarity of which is maximal. According to the invention the calculating unit (120) is adapted to calculate the similarity matrix S by additionally considering information about the phase consistency between the components of the extended previous segment sp and the extended current segment sc. In that way the selection of components suitable for being linked together is improved resulting in the definition of correct tracks.
摘要:
A parametric stereo upmix apparatus generates left and right signals from a mono downmix signal based on spatial parameters. The parametric stereo upmix includes a predictor configured to predict a difference signal including a difference between the left and right signals based on the mono downmix signal scaled with a prediction coefficient. The prediction coefficient is derived from the spatial parameters. The parametric stereo upmix apparatus further includes an arithmetic unit configured to derive the left and right signals based on a sum and a difference of the mono downmix signal and the difference signal.
摘要:
A method and a device for processing a stereo signal obtained from an encoder, which codes an N-channel audio signal into spatial parameters (P) and a stereo down-mix comprising first and second stereo signals (L 0 , R 0 ). A first signal and a third signal are added in order to obtain a first output signal (L 0w ), wherein the first signal QL 0wL ) comprises the first stereo signal (L 0 ) modified by a first complex function (g 1 ), and the third signal (L 0wR ) comprises the second stereo signal (R 0 ) modified by a third complex function (g 3 ). A second signal and a fourth signal are added to obtain a second output signal (R 0w ). The fourth signal (R 0wR ) comprises the second stereo signal (R 0 ) modified by a fourth complex function (g 4 ), and the second signal (R 0wL ) comprises the first stereo signal (L 0 ) modified by a second complex function (g 2 ). The complex functions (g 1 ,g 2 ,g 3 ,g 4 ) are functions of the spatial parameters (P) and are chosen such that an energy value of the difference (L 0wL -P 0wL ) between the first signal and the second signal is larger than or equal to the energy value of the sum (L 0wL +R 0wL ) of the first and the second signal and the energy value of the difference (R 0wR -L 0wR ) between the fourth signal and the third signal is larger than or equal to the energy value of the sum (R 0wR +L 0wR ) of the fourth signal and the third signal.
摘要:
A method of generating a monaural signal (S) comprising a combination of at least two input audio channels (L, R) is disclosed. Corresponding frequency components from respective frequency spectrum representations for each audio channel (L(k), R(k)) are summed (46) to provide a set of summed frequency components (S(k)) for each sequential segment. For each frequency band (i) of each of sequential segment, a correction factor (m(i)) is calculated (45) as function of a sum of energy of the frequency components of the summed signal in the band formula (I) and a sum of the energy of said frequency components of the input audio channels in the band formula (II). Each summed frequency component is corrected (47) as a function of the correction factor (m(i)) for the frequency band of said component.
摘要:
An audio decoder comprises a receiver (801) for receiving input data comprising an N-channel signal corresponding to a down-mixed signal of an M-channel audio signal, M>N, having complex valued subband encoding matrices applied in frequency subbands and parametric multi-channel data. A subband filter bank (805) generates real-valued frequency subbands for the N-channel signal. A matrix processor (809) determines real-valued subband decoding matrices for compensating the application of the encoding matrices in response to the parametric multi-channel data. A compensation processor (807) generates down-mix data corresponding to the down-mixed signal by a matrix multiplication of the real-valued subband decoding matrices and data of the N-channel signal in the at least some real-valued frequency subbands. The down-mix data can be used to regenerate the down-mixed signal and the M-channel audio signal. The decoder may compensate for MPEG Matrix Surround Compatibility operations performed at the encoder using real-valued frequency subbands.
摘要:
A method of encoding input signals (1, r) to generate encoded data (100) is provided. The method involves processing the input signals (1, r) to determine first parameters (Õ 1 , Õ 2 ) describing relative phase difference and temporal difference between the signals (1, r), and applying these first parameters (Õ 1 , Õ 2 ) to process the input signals to generate intermediate signals. The method involves processing the intermediate signals to determine second parameters (±; IID, Á) describing angular rotation of the first intermediate signals to generate a dominant signal (m) and a residual signal (s), the dominant signal (m) having a magnitude or energy greater than that of the residual signal (s). These second parameters are applicable to process the intermediate signals to generate the dominant (m) and residual (s) signals. The method also involves quantizing the first parameters, the second parameters, and dominant and residual signals (m, s) to generate corresponding quantized data for subsequent multiplexing to generate the encoded data (100).
摘要:
An output audio signal (L, R) is generated based on an input audio signal, the input audio signal comprising a plurality of input subband signals (N). The input subband signals are delayed in a plurality of delay units (76) to obtain a plurality of delayed subband signals, wherein at least one input subband signal is delayed more than a further input subband signal of higher frequency, and wherein the output audio signal is derived (77) from a combination of the input audio signal and the plurality of delayed subband signals.
摘要:
Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.
摘要:
Coding a signal is provided, wherein a first set of values is provided related to subsequent times in a first time interval of the signal, a second set of values is provided related to subsequent times in a second time interval of the signal, the first time interval having an overlap with the second time interval, the overlap including at least two subsequent times of the second interval, wherein at least one of the values of the second set related to the at least two subsequent times in the overlap is encoded with reference to a value of the first set which is closer in time to the at least one value of the second set than any other value in the second set.