Abstract:
An encoder and an encoding method for a multi-channel signal, and a decoder and a decoding method for a multi-channel signal are disclosed. A multi-channel signal may be efficiently processed by consecutive downmixing or upmixing.
Abstract:
Disclosed is an apparatus and method for controlling a sound object based on an additional image object. A sound object controlling method includes displaying image objects synchronized with a plurality of sound objects, respectively, on a display; and controlling a sound object synchronized with an image object selected by a user from among the image objects displayed on the display. The sound object includes metadata that includes playback location information of the sound object on a specific space, sound level information of the sound object, and display location information of the image object synchronized with the sound object on the display.
Abstract:
Disclosed is a method and apparatus for controlling a sound to be provided to a user based on a multipole sound object, the method including setting a multipole sound object including at least one sound source, and controlling an attribute of a sound source included in the multipole sound object based on a direction and a relative distance between the multipole sound object and the user.
Abstract:
An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
Abstract:
An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
Abstract:
Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.
Abstract:
A method and apparatus for generating late reverberation are provided. The method includes generating a reverberation parameter required to generate late reverberation based on an early room impulse response, outputting late reverberation based on the reverberation parameter, and outputting a room impulse response based on the early room impulse response and the late reverberation.
Abstract:
Disclosed are a method for spatial audio reproduction based on D-RIRs includes: selecting measurement points around a listener based on the location of the listener; calculating a D-RIR for the location of the listener based on D-RIRs for the measurement points around the listener; and reproducing spatial audio at the location of the listener based on the D-RIR at the location of the listener.
Abstract:
An audio signal processing apparatus and an audio signal processing method are disclosed. The audio signal processing method performed by the audio signal processing apparatus includes determining whether a line of sight between a render item (RI) corresponding to an audio element and a listener is visible, based on a bitstream, in response to a case where the line of sight is invisible, generating an audio signal by rendering a diffraction-type RI corresponding to the RI, and outputting the audio signal.
Abstract:
Provided is a method of processing mesh data for processing an audio signal for audio rendering in a virtual reality (VR) space. The method includes receiving mesh data defining geometry of a certain space and the mesh data includes data regarding three-dimensional (3D) coordinates of points for configuring spatial information of the certain space and processing the mesh data by identifying outermost points among the points.