摘要:
An audio encoding device (100) comprises first encoding means (101, 111) for encoding transient signal components and/or sinusoidal signal components of an audio signal (x(n)) and producing a residual signal (z(n)), and second encoding means for encoding the residual signal. The second encoding means comprise filter means (122) for selecting at least two frequency bands of the residual signal. The selected frequency bands (LF, HF) of the residual signal (z(n)) are encoded by a first encoding unit (123) and a second encoding unit (124) respectively. The first encoding unit (123) may comprise a waveform encoder, such as a time-domain encoder, while the second encoding unit (124) may comprise a noise encoder.
摘要:
In a method of encoding input signals (CH1 to CH3; 400 to 450) in a multi-channel encoder (5; 15) to generate corresponding output data having down-mix output signals (610, 620) together with complementary parametric data (600), the method includes a first step of down-mixing input signals (CH1 to CH3; 400 to 450) to generate the corresponding down-mix output signals (610, 620), and a second step of processing the input signals (CH1 to CH3; 400 to 450) during down-mixing to generate the parametric data (600) complementary to the down-mix output signals (610, 620). Processing of the input signals (CH1 to CH3; 400 to 450) involves including information in the down-mix signals (610, 620) which is useable during subsequent decoding of the down-mix output signals (610, 620) and the parametric data (600) to determine at least some parameter data and thereby enabling representations of the input signals (CH1 to CH3; 400 to 450) to be subsequently regenerated.
摘要:
An inverse filtering method, comprising: generating a first filtered signal based on an input signal; and combining the first filtered signal with the input signal for obtaining a residual signal. The generating comprises: generating at least two second filtered signals, each of said second filtered signals not significantly delayed in time relative to each other, the generating being stable and causal; and amplifying at least one of the second filtered signals with a prediction coefficient.
摘要:
An encoder (100) for encoding a multi-channel audio signal comprises a prediction processor (101) for generating two residual signals for two signal components of the multi-channel signal by linear prediction which is associated with psycho-acoustic characteristics and which specifically uses psycho-acoustic prediction filters; a rotation processor (105) for rotating the combined signal of the two residual signals to generate a main signal and a side signal, in which the energy of the main signal is maximized and the energy of the side signal is minimized; an encoding processor (109) for encoding the main and preferably the side signal; and an output processor (111) for generating an output signal data, prediction parameters and rotation parameters.
摘要:
A device (2) for changing the pitch of an audio signal (r), such as a speech signal, comprises a sinusoidal analysis unit (21) for determining sinusoidal parameters of the audio signal (r), a parameter production unit (22) for predicting the phase of a sinusoidal component, and a sinusoidal synthesis unit (23) for synthesizing the parameters to produce a reconstructed signal (r′). The parameter production unit (22) receives, for each time segment of the audio signal, the phase of the previous time segment to predict the phase of the current time segment.
摘要:
A hybrid sinusoidal/pulse excitation encoder has been recently proposed for constructing a scalable audio encoder The base layer consisting of data supplied by the sinusoidal encoder retains the main features of the input signal achieving medium to high quality audio at a very low bit rate. Quality can be further enhanced by adding excitation signal layers associated with a decreasing decimation that increasingly model more subtle aspects of the original signal. The invention provides a method of mixing the different excitation signal layers so that the full concept of scalability is realised without compromising the quality of the encoded signals. The mixing is controlled via a quality parameter that weights the significance of previous layers when constructing a new higher layer.
摘要:
An audio apparatus comprises a processor (101) for providing a set of audio channels. A prediction circuit (103) generates a predicted signal for a first channel by adaptive filtering of a second channel by an adaptive filter. An adaptation processor (105) adapts the adaptive filter to minimize a cost function indicative of a difference between the predicted signal and the first channel. A compensation processor (107) then generates a non-predicted signal by compensating the first signal for the predicted signal and a distribution processor (109) generates an output set of audio channels by distributing at least the predicted signal and the non-predicted signal over the output set of audio signals where the distribution is different for the predicted signal and the non-predicted signal. The cross-channel predictive filtering provides signal components that represent different spatial characteristics of the originating sound and which are therefore advantageously distributed differently for the output channels.
摘要:
In a sinusoidal audio encoder a number of sinusoids are estimated per audio segment. A sinusoid is represented y frequency, amplitude and phase. Normally, phase is quantised independent of frequency The invention uses a frequency dependent quantisation of phase, and in particular the low frequencies are quantised using smaller quantisation intervals than at higher frequencies. Thus, the unwrapped phases of the lower frequencies are quantised more accurately, possibly with a smaller quantisation range, than the phases of the higher frequencies. The invention gives a significant improvement in decoded signal quality, especially for low bit-rate quantisers.
摘要:
The method creates an audio stream comprising tracks of sinusoidal components linked across a plurality of sequential time segments. Segments in each track are weighted with a normal window (WI, W2, W3), and consecutive segments have a normal period of overlap (0) of their trailing edges and leading edges. Segments in which a transient5 component is determined are weighted with a first modified window (WIm) having a modified trailing edge, and the following segment in the track is weighted with a second modified window (W2m) having a modified leading edge, so that the modified trailing edge and the modified leading edge have a modified period of overlap (0m) that comprises the transient component and that is shorter than the normal period of overlap (0), and wherein the audio stream includes sinusoidal codes representing the frequency and the transient. According to the invention, the modified period of overlap (0m) depends on the frequency value (f).
摘要:
There is described a method of encoding a signal (s(n)) in a coder (400) to generate a corresponding encoded bit-stream (x(n); STP). The method comprises steps of: (a) processing the signal (s(n)) to determine main sinusoidal components and transient components thereof to generate corresponding component parameters; (b) processing the signal (s(n)) by removing the sinusoidal and transient components therefrom to generate a residual signal (r(n)); (c) processing the residual signal (r(n)) to determine a spectral representation (PSD) and determining therefrom a spectral broadening measure (SBM); (d) determining from the residual signal (r(n)) spectral envelope parameters by linear prediction; and (e) combining the components parameters together with the spectral envelope parameters and the spectral broadening measure to generate the encoded bit-stream. The method is capable of reducing noise that would otherwise arise were the bitstream to be decoded not subjected to such spectral broadening.