Abstract:
An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.
Abstract:
A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.
Abstract:
Transfer functions can describe responses of microphones or ears to sounds at different locations on a sphere. The transfer functions can be compressed by determining, based on transfer functions, a) one or more basis transfer functions, and b) spherical harmonics coefficients that describe variations of the transfer functions with respect to spherical coordinates. Other aspects are described and claimed.
Abstract:
A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.
Abstract:
Dynamic loudness equalization of received audio content in a playback system, using metadata that includes instantaneous loudness values for the audio content. A playback level is derived from a user volume setting of the playback system, and is compared with a mixing level that is assigned to the audio content. Parameters are computed, that define an equalization filter that is filtering the audio content before driving a speaker with the filtered audio content, based on the instantaneous loudness values and the comparing of the playback level with the assigned mixing level. Other embodiments are also described and claimed.
Abstract:
Dynamic loudness equalization of received audio content in a playback system, using metadata that includes instantaneous loudness values for the audio content. A playback level is derived from a user volume setting of the playback system, and is compared with a mixing level that is assigned to the audio content. Parameters are computed, that define an equalization filter that is filtering the audio content before driving a speaker with the filtered audio content, based on the instantaneous loudness values and the comparing of the playback level with the assigned mixing level. Other embodiments are also described and claimed.
Abstract:
A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.
Abstract:
Dynamic loudness equalization of received audio content in a playback system, using metadata that includes instantaneous loudness values for the audio content. A playback level is derived from a user volume setting of the playback system, and is compared with a mixing level that is assigned to the audio content. Parameters are computed, that define an equalization filter that is filtering the audio content before driving a speaker with the filtered audio content, based on the instantaneous loudness values and the comparing of the playback level with the assigned mixing level. Other embodiments are also described and claimed.
Abstract:
An audio encoder encodes a digital audio recording having a number of audio channels or audio objects. A Dynamic Range Control (DRC) processor produces a sequence of encoder DRC gain values, by applying a selected one of a number of DRC characteristics to a group of one or more of the audio channels or audio objects. The encoder DRC gain values are to be applied to adjust the group of audio channels or audio objects, upon decoding them from the encoded digital audio recording. A bitstream multiplexer combines a) the encoded digital audio recording with b) the sequence of encoder DRC gain values, an indication of the selected DRC characteristic, and an indication of an alternate DRC characteristic, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording and performing DRC adjustment upon it.