Abstract:
The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
Abstract:
The present disclosure relates to the field of audio coding, in particular, it relates to a method for encoding audio signals through a masking model based on a hearing threshold of frequency intervals of the audio signal and a measured energy of the audio signal for the corresponding frequency intervals. The disclosure further relates to an encoder that is capable of carrying out the audio encoding method.
Abstract:
According to an aspect of the present invention, a method for reconstructing an audio signal having a baseband portion and a highband portion is disclosed. The method includes obtaining a decoded baseband audio signal by decoding an encoded audio signal and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal. The method further includes generating a noise component based on a noise parameter. Finally, the method includes adjusting a phase of the high-frequency reconstructed signal and obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal to obtain a time-domain reconstructed audio signal.
Abstract:
According to an aspect of the present invention, a method for reconstructing an audio signal having a baseband portion and a highband portion is disclosed. The method includes obtaining a decoded baseband audio signal by decoding an encoded audio signal and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal. The method further includes generating a noise component based on a noise parameter. Finally, the method includes adjusting a phase of the high-frequency reconstructed signal and obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal to obtain a time-domain reconstructed audio signal.
Abstract:
The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
Abstract:
Audio processing methods may involve receiving audio data corresponding to a plurality of audio channels. The audio data may include a frequency domain representation corresponding to filterbank coefficients of an audio encoding or processing system. A decorrelation process may be performed with the same filterbank coefficients used by the audio encoding or processing system. The decorrelation process may be performed without converting coefficients of the frequency domain representation to another frequency domain or time domain representation. The decorrelation process may involve selective or signal-adaptive decorrelation of specific channels and/or specific frequency bands. The decorrelation process may involve applying a decorrelation filter to a portion of the received audio data to produce filtered audio data. The decorrelation process may involve using a non-hierarchal mixer to combine a direct portion of the received audio data with the filtered audio data according to spatial parameters.
Abstract:
A method including steps of decoding an encoded audio signal indicative of encoded audio content (e.g., audio content captured during a teleconference) to generate a decoded signal indicative of a decoded version of the audio content, and performing adaptive quantization noise filtering on the decoded signal. The filtering is performed adaptively in the frequency domain in response to data indicative of signal to noise values in turn indicative of a post-quantization signal-to-quantization noise ratio for each frequency band of each of at least one segment of the encoded audio content. In some embodiments, each signal to noise value is a bit allocation value equal to the number of mantissa bits of an encoded audio sample of a frequency band of a segment of the encoded audio content. Other aspects are decoder, or post-filter coupled to receive a decoder's output, configured to perform an embodiment of the adaptive filtering.
Abstract:
A method of encoding audio information for forward error correction reconstruction of a transmitted audio stream over a lossy packet switched network, the method including the steps of: (a) dividing the audio stream into audio frames; (b) determining a series of corresponding audio frequency bands for the audio frames; (c) determining a series of power envelopes for the frequency bands; (d) encoding the envelopes as a low bit rate version of the audio frame in a redundant transmission frame.
Abstract:
A method for generating a reconstructed audio signal having a baseband portion and a highband portion is disclosed. The method includes deformatting an encoded audio signal into a first part and a second part and extracting, from the first part, temporal envelope information and spectral components of the baseband portion. The method further includes decoding the first part to obtain a decoded baseband audio signal. The decoding includes filtering in a frequency domain at least some of the spectral components of the baseband portion with the reconstruction filter using the temporal envelope information to shape a temporal envelope of the baseband portion. The method also includes extracting, from the second part, a noise parameter and an estimated spectral envelope of the highband portion and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal by adjusting, based on the estimated spectral envelope of the highband portion, a spectral envelope of the high-frequency reconstructed signal.
Abstract:
A method for generating a reconstructed audio signal having a baseband portion and a highband portion is disclosed. The method includes deformatting an encoded audio signal into a first part and a second part and obtaining a decoded baseband audio signal by decoding the first part. The method also includes extracting, from the second part, a noise parameter and an estimated spectral envelope of the highband portion and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying in a circular manner a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal by adjusting, based on the estimated spectral envelope of the highband portion, a spectral envelope of the high-frequency reconstructed signal. The method also includes generating a noise component based on the noise parameter and obtaining a combined high-frequency signal by adding the noise component to the envelope adjusted high-frequency signal. Finally, the method includes obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal. The method may be implemented by an audio decoding device comprising one or more hardware elements.