Abstract:
Methods for generating or decoding an encoded audio bitstream including audio data and supplementary data (e.g., metadata and/or unrelated audio data), where at least some of the supplementary data is included as LSBs of audio segments, and/or at least some of the supplementary data is included in guard bands. Typical embodiments provide a scalable and video synchronous format compatible with real-time and file-based infrastructure components that support the SMPTE 337 format for carrying data in AES3 serial bitstreams, and/or provide a framework for extending distribution codecs to scale beyond an 8-channel limit to support multiples of 8 channels synchronously across multiple AES3 interfaces. Another aspect is an audio processing unit configured to perform any embodiment of the method or including a buffer memory storing at least one segment of an audio bitstream generated in accordance with any embodiment of the method.
Abstract:
Audio stems are generated to contain audio content to be mixed by recipient devices. Multiple sets of mixing instructions for multiple audio channel configurations are determined, for example, based on input of audio producers. Each set of mixing instructions is to be used for mixing the audio stems for rendering in a corresponding audio channel configuration. A bitstream is generated to carry both the audio stems and the sets of mixing instructions. A recipient device receives the bitstream as the input. The recipient device determines a specific audio channel configuration to be used for rendering the plurality of audio stems. Based on that determination, a specific set of mixing instructions is retrieved from the bitstream and used to mix the audio stems.