Abstract:
A computer-implemented method of encoding audio includes accessing a plurality of independent audio source streams, each of which includes a sequence of source frames. Respective source frames of each sequence include respective pluralities of pulse-code modulated audio samples. Each of the plurality of independent audio source streams is separately encoded to generate a plurality of independent encoded streams, each of which corresponds to a respective independent audio source stream. The encoding includes, for respective source frames, converting respective pluralities of pulse-code modulated audio samples to respective pluralities of floating-point frequency samples that are divided into a plurality of frequency bands. An instruction to mix the plurality of independent encoded streams is received; in response, respective floating-point frequency samples of the independent encoded streams are combined. An output bitstream is generated that includes the combined respective floating-point frequency samples.
Abstract:
A method and related system of encoding audio is disclosed. In the method, data representing a plurality of independent audio signals is accessed. The data representing each respective audio signal comprises a sequence of source frames. Each frame in the sequence of sources frames comprises a plurality of audio data copies. Each audio data copy has an associated quality level that is a member of a predefined range of quality levels, ranging from a highest quality level to a lowest quality level. The plurality of source frame sequences is merged into a sequence of target frames that comprise a plurality of target channels. Merging corresponding source frames into a respective target frame includes selecting a quality level and assigning the audio data copy at the selected quality level of each corresponding source frame to at least one respective target channel.