Abstract:
The invention is aimed at improving the quality of the filtering by transfer functions of HRTF type of signals (L, R) compressed in a transformed domain, for binaural playing on two channels (L-BIN, R-BIN), using a combination of HRTF filters (hL,L, hL,R) including a decorrelated version (HRTF-C*, HRTF-E*) of a few of these filters. For this purpose, a decorrelation cue is given with spatialization parameters (SPAT) accompanying the compressed signals (L, R). The Decorrelation comprises applying a different phase shift to each subband of the input signal combined with addition of an overall delay. The invention makes it possible to improve the broadening in the binaural rendition of audio scenes initially in a multi-channel format.
Abstract:
The invention is aimed at improving the quality of the filtering by transfer functions of HRTF type of signals (L, R) compressed in a transformed domain, for binaural playing on two channels (L-BIN, R-BIN), using a combination of HRTF filters (hL,L, hL,R) including a decorrelated version (HRTF-C*, HRTF-E*) of a few of these filters. For this purpose, a decorrelation cue is given with spatialization parameters (SPAT) accompanying the compressed signals (L, R). The invention makes it possible to improve the broadening in the binaural rendition of audio scenes initially in a multi-channel format.
Abstract:
A method for processing a signal in the form of consecutive sample blocks, the method comprising filtering in a transformed domain of sub-bands, and particularly equalization processing, applied to a current block in the transformed domain, and filtering-adjustment processing that is applied in the transformed domain to at least one block adjacent to the current block.
Abstract:
A method for updating the processing capacity of an encoder or decoder to use a modulated transform having a size greater than a predetermined initial size is provided, particularly, where the encoders or decoders are for storing an initial prototype filter defined by an ordered set of initial size coefficients. A step is provided for constructing a prototype filter of a size greater than the initial size to implement the modulated transform of the greater size by inserting at least one coefficient between two consecutive coefficients of the initial prototype filter.
Abstract:
A method of hierarchical coding of a digital audio frequency input signal into several frequency sub-bands, including a core coding of the input signal according to a first throughput and at least one enhancement coding of higher throughput, of a residual signal. The core coding uses a binary allocation according to an energy criterion. The method includes for the enhancement coding: calculating a frequency-based masking threshold for at least part of the frequency bands processed by the enhancement coding; determining a perceptual importance per frequency sub-band as a function of the masking threshold and as a function of the number of bits allocated for the core coding; binary allocation of bits in the frequency sub-bands processed by the enhancement coding, as a function of the perceptual importance determined; and coding the residual signal according to the bit allocation. Also provided are a decoding method, a coder and a decoder.
Abstract:
A method for processing sound data is provided for the reconstruction of multi-channel audio data on the basis at least of data on a reduced number of channels and of spatialization data. A test is carried out to determine whether the spatialization data received are valid. If the test is positive, a spatialization value is predicted according to a per respective model of a plurality of models. A prediction model is chosen on the basis of the spatialization values thus predicted and on the basis of the spatialization data received, to permit, in case of subsequent reception of defective spatialization data, a prediction according to this chosen model of a spatialization value and to use this predicted spatialization value for the reconstruction of the multi-channel audio data.
Abstract:
A system for coding a hierarchical audio signal, comprising, at least, a core layer using parametric coding by analysis by synthesis in a first frequency band, a band extension layer for widening said first frequency band into a second frequency band, or wideband. The system also comprises a wideband audio coding quality enhancement layer based on transform coding using a spectral parameter obtained from said band extension layer. Application to transmitting speech and/or audio signals over packet networks.
Abstract:
A method of bitrate switching on decoding an audio signal coded by a audio coding system, said decoding comprising a post-processing step depending on the bitrate. On switching from an initial bitrate to a final bitrate, said method includes a transition step of continuous change from a signal at the initial bitrate to a signal at the final bitrate, one or both of said signals being post-processed. Application to transmission of VoIP speech and/or audio signals in data packet networks.
Abstract:
A method for encoding and decoding a digital audio signal is provided, said method comprising the steps of: encoding a first sequence of samples of the digital signal according to a transform encoding; encoding a second sequence of samples of the digital signal according to a predictive encoding; wherein the second sequence starts before the end of the first sequence, a subsequence common to the first and second sequences being thus encoded both by predictive encoding and by transform encoding.
Abstract:
The invention proposes the synthesis of a signal consisting of consecutive blocks. It proposes more particularly, on receipt of such a signal, to replace, by synthesis, lost or erroneous blocks of this signal. To this end, it proposes an attenuation of the overvoicing during the generation of a signal synthesis. More particularly, a voiced excitation is generated on the basis of the pitch period (T) estimated or transmitted at the previous block, by optionally applying a correction of plus or minus a sample of the duration of this period (counted in terms of number of samples), by constituting groups (A′,B′,C′,D′) of at least two samples and inverting positions of samples in the groups, randomly (B′,C′) or in a forced manner. An over-harmonicity in the excitation generated is thus broken and the effect of overvoicing in the synthesis of the generated signal is thereby attenuated.