摘要:
In a method and device for post-processing a decoded sound signal in view of enhancing a perceived quality of this decoded sound signal, the decoded sound signal is divided into a plurality of frequency sub-band signals, and post-processing is applied to at least one of the frequency sub-band signal. After post-processing of this at least one frequency sub-band signal, the frequency sub-band signals may be added to produce an output post-processed decoded sound signal. In this manner, the post-processing can be localized to a desired sub-band or sub-bands with leaving other sub-bands virtually unaltered.
摘要:
In a method and device for post-processing a decoded sound signal in view of enhancing a perceived quality of this decoded sound signal, the decoded sound signal is divided into a plurality of frequency sub-band signals, and post-processing is applied to at least one of the frequency sub-band signal. After post-processing of this at least one frequency sub-band signal, the frequency sub-band signals may be added to produce an output post-processed decoded sound signal. In this manner, the post-processing can be localized to a desired sub-band or sub-bands with leaving other sub-bands virtually unaltered.
摘要:
A pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.
摘要:
A device and method for shaping noise during encoding of an input sound signal comprise pre-emphasizing the input signal or a decoded signal from a given sound signal codec to produce a pre-emphasized signal, computing a filter transfer function based on the pre-emphasized signal, and shaping the noise by filtering the noise through the transfer function to produce a shaped noise signal, wherein the noise shaping comprises producing a noise feedback. A device and method for noise shaping in a multilayer codec, including at least Layer 1 and 2, comprise: at an encoder, producing an encoded sound signal in Layer 1 including Layer 1 noise shaping, and producing a Layer 2 enhancement signal; at a decoder, decoding the Layer 1 encoded sound signal to produce a synthesis signal, decoding the enhancement signal, computing a filter transfer function based on the synthesis signal, filtering the enhancement signal through the transfer function to produce a Layer 2 filtered enhancement signal, and adding the filtered enhancement signal to the synthesis signal to produce an output signal including contributions from Layer 1 and 2.
摘要:
An audio encoder has a first information sink oriented encoding branch, a second information source or SNR oriented encoding branch, and a switch for switching between the first encoding branch and the second encoding branch, wherein the second encoding branch has a converter into a specific domain different from the spectral domain, and wherein the second encoding branch furthermore has a specific domain coding branch, and a specific spectral domain coding branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder for decoding a signal, and a third domain decoder and two cascaded switches for switching between the decoders.
摘要:
A perceptual weighting device for producing a perceptually weighted signal in response to a wideband signal comprises a signal pre-emphasis filter, a synthesis filter calculator, and a perceptual weighting filter. The signal pre-emphasis filter enhances the high frequency content of the wideband signal to thereby produce a pre-emphasized signal. The signal pre-emphasis filter has a transfer function of the form: P(z)=1−μz−1, wherein μ is a pre-emphasis factor having a value located between 0 and 1. The synthesis filter calculator is responsive to the pre-emphasized signal for producing synthesis filter coefficients. Finally, the perceptual weighting filter processes the pre-emphasized signal in relation to the synthesis filter coefficients to produce the perceptually weighted signal. The perceptual weighting filter has a transfer function, with fixed denominator, of the form: W(z)=A(z/γ1)/(1−γ2z−1) where 0
摘要:
An improved pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.
摘要:
In a method and device for recovering the high frequency content of a wideband signal previously down-sampled, and for injecting this high frequency content in an over-sampled synthesized version of the wideband signal to produce a fill-spectrum synthesized wideband signal, a random noise generator produces a noise sequence having a given spectrum. A spectral shaping unit spectrally shapes the noise sequence in relation to linear prediction filter coefficients related to the down-sampled wideband signal. A signal injection circuit finally injects the spectrally-shaped noise sequence in the over-sampled synthesized signal version to thereby produce the full-spectrum synthesized wideband signal.
摘要:
An audio signal decoder includes a transform domain path configured to obtain a time-domain representation of a portion of an audio content on the basis of a first set of spectral coefficients, a representation of an aliasing-cancellation stimulus signal and a plurality of linear-prediction-domain parameters. The transform domain path applies a spectrum shaping to the first set of spectral coefficients to obtain a spectrally-shaped version thereof. The transform domain path obtains a time-domain representation of the audio content on the basis of the spectrally-shaped version of the first set of spectral coefficients. The transform domain path includes an aliasing-cancellation stimulus filter to filter the aliasing-cancellation stimulus signal in dependence on at least a subset of the linear-prediction-domain parameters. The transform domain path also includes a combiner configured to combine the time-domain representation of the audio content with an aliasing-cancellation synthesis signal to obtain an aliasing reduced time-domain signal.
摘要:
A pitch search method and device for digitally encoding a wideband signal, in particular but not exclusively a speech signal, in view of transmitting, or storing, and synthesizing this wideband sound signal. The new method and device which achieve efficient modeling of the harmonic structure of the speech spectrum uses several forms of low pass filters applied to a pitch codevector, the one yielding higher prediction gain (i.e. the lowest pitch prediction error) is selected and the associated pitch codebook parameters are forwarded.