Abstract:
An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
Abstract:
Disclosed is a unified speech and audio coding (USAC) audio signal encoding/decoding apparatus and method for digital radio services. An audio signal encoding method may include receiving an audio signal, determining a coding method for the received audio signal, encoding the audio signal based on the determined coding method, and configuring, as an audio superframe of a fixed size, an audio stream generated as a result of encoding the audio signal, wherein the coding method may include a first coding method associated with extended high-efficiency advanced audio coding (xHE-AAC) and a second coding method associated with existing advanced audio coding (AAC).
Abstract:
An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
Abstract:
An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
Abstract:
A method of encoding/decoding a speech signal and a device for performing the same are provided. The method includes outputting, based on a first input speech signal of a previous timepoint and a second input speech signal of a current timepoint, a predicted signal that predicts the second input speech signal from the first input speech signal and obtaining, based on the second input speech signal and the predicted signal, a residual signal by removing a correlation between the first input speech signal and the second input speech signal from the second input speech signal.
Abstract:
Provided is an encoding apparatus including a memory configured to store instructions and a processor electrically connected to the memory and configured to execute the instructions, wherein the processor may be configured to perform a plurality of operations, when the instructions are executed by the processor, wherein the plurality of operations may include obtaining an input audio signal, generating an embedded audio signal by embedding signal components of a second frequency band of the input audio signal in a first frequency band of the input audio signal, generating additional information associated with the first frequency band and the second frequency band, generating an encoded audio signal by encoding the embedded audio signal, and formatting the encoded audio signal and the additional information into a bitstream.
Abstract:
An audio signal processing method, which is executed by a processor electronically communicating with a deep neural network within a computing system, may comprise: acquiring, by the processor, an input signal before encoding and an output signal after quantization and decoding; calculating, by the processor, a perceptual global loss for a frame corresponding to the input and the output signals; acquiring, by the processor, a plurality of subframes corresponding to the input and output signals by applying a windowing function to the frame of the input and output signals; calculating, by the processor, perceptual local losses for the plurality of subframes corresponding to the input and output signals; and acquiring, by the processor, multi-time scale perceptual loss based on the perceptual global and local losses.
Abstract:
Provided are an apparatus for encoding an audio signal and a method of an operation thereof. An audio signal encoding method includes obtaining quantized linear prediction (LP) coefficients by performing a linear predictive coding (LPC) analysis and quantization on an input audio signal, generating a reference signal by applying discrete Fourier transform (DFT) to the input audio signal, obtaining LP residual coefficients from the reference signal, scaling magnitudes of the LP residual coefficients using the quantized LP coefficients and the reference signal, and quantizing phases of the LP residual coefficients and the scaled magnitudes of the LP residual coefficients.
Abstract:
A method of encoding an audio signal and an encoder and a method of decoding an audio signal and a decoder are provided. The method of encoding an audio signal includes outputting a decoded signal by using a bitstream that encodes an audio signal, separating the decoded signal into a low-band signal and a high-band signal by using a sound source separator, upsampling the low-band signal, upsampling the high-band signal, and restoring the audio signal by synthesizing the upsampled low-band signal with the upsampled high-band signal, wherein the bitstream is generated by encoding a superimposed signal in which a signal in a high frequency band of the audio signal is superimposed on a low frequency band of the audio signal.
Abstract:
Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.