摘要:
A decoder (115) generates a multi channel audio signal, such as a surround sound signal, from a received first signal. The multi-channel signal comprises a second set of audio channels and the first signal comprises a first set of audio channels. The decoder (115) comprises a receiver (401) which receives the first signal. The receiver (401) is coupled to an estimate processor (405) which generates estimated parametric data for the second set of audio channels in response to characteristics of the first set of audio channels. The estimated parametric data relates characteristics of the second set of audio channels to characteristics of the first set of audio channels. The decoder (115) furthermore comprises a spatial audio decoder (403) which decodes the first signal in response to the estimated parametric data to generate the multi-channel signal comprising the second set of channels. The invention allows use of spatial audio decoding with signals that are not encoded by a spatial audio encoder.
摘要:
Encoding an audio signal is provided wherein the audio signal includes a first audio channel and a second audio channel, the encoding comprising subband filtering each of the first audio channel and the second audio channel in a complex modulated filterbank to provide a first plurality of subband signals for the first audio channel and a second plurality of subband signals for the second audio channel, downsampling each of the subband signals to provide a first plurality of downsampled subband signals and a second plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, deriving spatial parameters from the sub-subband signals and from those downsampled subband signals that are not further subband filtered, and deriving a single channel audio signal comprising derived subband signals derived from the first plurality of downsampled subband signals and the second plurality of downsampled subband signals. Further, decoding is provided wherein an encoded audio signal comprising an encoded single channel audio signal and a set of spatial parameters is decoded by decoding the encoded single channel audio channel to obtain a plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, and deriving two audio channels from the spatial parameters, the sub-subband signals and those downsampled subband signals that are not further subband filtered.
摘要:
An encoder (109) comprises a receiver (201) which receives a time domain audio signal. A filter bank (203) generates a first subband signal from the time domain audio signal where the first subband signal corresponds to a non-critically sampled complex subband domain representation of the time domain signal. A conversion processor (205) generates a second subband signal from the first subband signal by subband processing. The second subband signal corresponds to a critically sampled complex subband domain representation of the time domain audio signals. An encode processor (207) then generates a waveform encoded data stream by encoding data values of the second subband signal. The conversion processor (205) generates the second subband signal by direct subband conversion without converting back to the time domain. The invention allows an oversampled subband signal typically generated in parametric encoding to be waveform encoded with reduced complexity. A decoder performs the inverse operation.
摘要:
Encoding an audio signal is provided wherein the audio signal includes a first audio channel and a second audio channel, the encoding comprising subband filtering each of the first audio channel and the second audio channel in a complex modulated filterbank to provide a first plurality of subband signals for the first audio channel and a second plurality of subband signals for the second audio channel, downsampling each of the subband signals to provide a first plurality of downsampled subband signals and a second plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, deriving spatial parameters from the sub-subband signals and from those downsampled subband signals that are not further subband filtered, and deriving a single channel audio signal comprising derived subband signals derived from the first plurality of downsampled subband signals and the second plurality of downsampled subband signals. Further, decoding is provided wherein an encoded audio signal comprising an encoded single channel audio signal and a set of spatial parameters is decoded by decoding the encoded single channel audio channel to obtain a plurality of downsampled subband signals, further subband filtering at least one of the downsampled subband signals in a further filterbank in order to provide a plurality of sub-subband signals, and deriving two audio channels from the spatial parameters, the sub-subband signals and those downsampled subband signals that are not further subband filtered.
摘要:
A decoder (115) generates a multi channel audio signal, such as a surround sound signal, from a received first signal. The multi-channel signal comprises a second set of audio channels and the first signal comprises a first set of audio channels. The decoder (115) comprises a receiver (401) which receives the first signal. The receiver (401) is coupled to an estimate processor (405) which generates estimated parametric data for the second set of audio channels in response to characteristics of the first set of audio channels. The estimated parametric data relates characteristics of the second set of audio channels to characteristics of the first set of audio channels. The decoder (115) furthermore comprises a spatial audio decoder (403) which decodes the first signal in response to the estimated parametric data to generate the multi-channel signal comprising the second set of channels. The invention allows use of spatial audio decoding with signals that are not encoded by a spatial audio encoder.
摘要:
A multi-channel audio encoder (10) for encoding a multi-channel audio signal (101), e.g. a 5.1 channel audio signal, into a spatial down-mix (102), e.g. a stereo signal, and associated parameters (104, 105). The encoder (10) comprises first and second units (110, 120). The first unit (110) encodes the multi-channel audio signal (101) into the spatial down-mix (102) and parameters (104). These parameters (104) enable a multi-channel decoder (20) to reconstruct the multi-channel audio signal (203) from the spatial down-mix (102). The second unit (120) generates, from the spatial down-mix (102), parameters (105) that enable the decoder to reconstruct the spatial down-mix (202) from an alternative down-mix (103), e.g. a so-called artistic down-mix that has been manually mixed in a sound studio. In this way, the decoder (20) can efficiently deal with a situation in which an alternative down-mix (103) is received instead of the regular spatial, down-mix (102). In the decoder (20), first the spatial down-mix (202) is reconstructed from the alternative down-mix (103) and the parameters (105). Next, the spatial down-mix (202) is decoded into the multi-channel audio signal (203).
摘要:
An audio decoder comprises a receiver (801) for receiving input data comprising an N-channel signal corresponding to a down-mixed signal of an M-channel audio signal, M>N, having complex valued subband encoding matrices applied in frequency subbands and parametric multi-channel data. A subband filter bank (805) generates real-valued frequency subbands for the N-channel signal. A matrix processor (809) determines real-valued subband decoding matrices for compensating the application of the encoding matrices in response to the parametric multi-channel data. A compensation processor (807) generates down-mix data corresponding to the down-mixed signal by a matrix multiplication of the real-valued subband decoding matrices and data of the N-channel signal in the at least some real-valued frequency subbands. The down-mix data can be used to regenerate the down-mixed signal and the M-channel audio signal. The decoder may compensate for MPEG Matrix Surround Compatibility operations performed at the encoder using real-valued frequency subbands.
摘要:
A multi-channel audio encoder (10) for encoding a multi-channel audio signal (101), e.g. a 5.1 channel audio signal, into a spatial down-mix (102), e.g. a stereo signal, and associated parameters (104, 105). The encoder (10) comprises first and second units (110, 120). The first unit (110) encodes the multi-channel audio signal (101) into the spatial down-mix (102) and parameters (104). These parameters (104) enable a multi-channel decoder (20) to reconstruct the multi-channel audio signal (203) from the spatial down-mix (102). The second unit (120) generates, from the spatial down-mix (102), parameters (105) that enable the decoder to reconstruct the spatial down-mix (202) from an alternative down-mix (103), e.g. a so-called artistic down-mix that has been manually mixed in a sound studio. In this way, the decoder (20) can efficiently deal with a situation in which an alternative down-mix (103) is received instead of the regular spatial, down-mix (102). In the decoder (20), first the spatial down-mix (202) is reconstructed from the alternative down-mix (103) and the parameters (105). Next, the spatial down-mix (202) is decoded into the multi-channel audio signal (203).