Layered coding of audio with discrete objects

    公开(公告)号:US11430451B2

    公开(公告)日:2022-08-30

    申请号:US16584706

    申请日:2019-09-26

    Applicant: Apple Inc.

    Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.

    LAYERED CODING OF AUDIO WITH DISCRETE OBJECTS

    公开(公告)号:US20220262373A1

    公开(公告)日:2022-08-18

    申请号:US17739901

    申请日:2022-05-09

    Applicant: Apple Inc.

    Abstract: A first layer of data having a first set of Ambisonic audio components can be decoded where the first set of Ambisonic audio components is generated based on ambience and one or more object-based audio signals. A second layer of data is decoded having at least one of the one or more object-based audio signals. One of the object-based audio signals is subtracted from the first set of Ambisonic audio components. The resulting Ambisonic audio components are rendered to generate a first set of audio channels. The one or more object-based audio signals are spatially rendered to generate a second set of audio channels. Other aspects are described and claimed.

    ENCODED AUDIO METADATA-BASED EQUALIZATION
    23.
    发明申请

    公开(公告)号:US20200342886A1

    公开(公告)日:2020-10-29

    申请号:US16893114

    申请日:2020-06-04

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.

    AUDIO BUFFERING FOR PROCESSING WITH VARIABLE LOOKAHEAD

    公开(公告)号:US20200090697A1

    公开(公告)日:2020-03-19

    申请号:US16133433

    申请日:2018-09-17

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: An audio processing system has a buffer, a first digital signal processing module that uses a first lookahead, a second digital signal processing module that uses a second, greater lookahead, and a cross-fader. The cross-fader fades between the output of the first digital signal processing module to the output of the second digital signal processing module, based on lookahead depth of data of the audio signal in the buffer. Other aspects are also described and claimed.

    STEREO-BASED IMMERSIVE CODING
    26.
    发明公开

    公开(公告)号:US20230274747A1

    公开(公告)日:2023-08-31

    申请号:US18019226

    申请日:2021-08-20

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: Disclosed is an audio codec that represents an immersive signal by a two-channel stereo signal that is a stereo rendering of the immersive signal and directional parameters. The directional parameters may be based on a perceptual model describing the direction of virtual speaker pairs to recreate the perceived location of dominant sounds. Audio processing at the decoder may be performed on the stereo signal in the frequency domain for multiple channel pairs using time-frequency tiles. Spatial localization of the audio signals may use a panning approach by applying weightings to the time-frequency tiles of the stereo signal for each output channel pair. The weightings for the time-frequency tiles may be derived based on the directional parameters, an analysis of the stereo signal, and the output channel layout. The weightings may be used to adaptively process the time-frequency tiles using a decorrelator to reduce or minimize spectral distortions from spatial rendering.

    Encoded audio metadata-based loudness equalization and dynamic equalization during DRC

    公开(公告)号:US10341770B2

    公开(公告)日:2019-07-02

    申请号:US15275162

    申请日:2016-09-23

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: Dynamic loudness equalization of received audio content in a playback system, using metadata that includes instantaneous loudness values for the audio content. A playback level is derived from a user volume setting of the playback system, and is compared with a mixing level that is assigned to the audio content. Parameters are computed, that define an equalization filter that is filtering the audio content before driving a speaker with the filtered audio content, based on the instantaneous loudness values and the comparing of the playback level with the assigned mixing level. Other embodiments are also described and claimed.

    Encoded audio extended metadata-based dynamic range control

    公开(公告)号:US10276173B2

    公开(公告)日:2019-04-30

    申请号:US15828087

    申请日:2017-11-30

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: An audio encoder encodes a digital audio recording having a number of audio channels or audio objects. A Dynamic Range Control (DRC) processor produces a sequence of encoder DRC gain values, by applying a selected one of a number of DRC characteristics to a group of one or more of the audio channels or audio objects. The encoder DRC gain values are to be applied to adjust the group of audio channels or audio objects, upon decoding them from the encoded digital audio recording. A bitstream multiplexer combines a) the encoded digital audio recording with b) the sequence of encoder DRC gain values, an indication of the selected DRC characteristic, and an indication of an alternate DRC characteristic, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording and performing DRC adjustment upon it.

    Encoded audio extended metadata-based dynamic range control

    公开(公告)号:US09837086B2

    公开(公告)日:2017-12-05

    申请号:US15217632

    申请日:2016-07-22

    Applicant: Apple Inc.

    Inventor: Frank Baumgarte

    Abstract: An audio encoder encodes a digital audio recording having a number of audio channels or audio objects. A Dynamic Range Control (DRC) processor produces a sequence of encoder DRC gain values, by applying a selected one of a number of DRC characteristics to a group of one or more of the audio channels or audio objects. The encoder DRC gain values are to be applied to adjust the group of audio channels or audio objects, upon decoding them from the encoded digital audio recording. A bitstream multiplexer combines a) the encoded digital audio recording with b) the sequence of encoder DRC gain values, an indication of the selected DRC characteristic, and an indication of an alternate DRC characteristic, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording and performing DRC adjustment upon it.

Patent Agency Ranking