Abstract:
An encoding/decoding apparatus and method for controlling a channel signal is disclosed, wherein the encoding apparatus may include an encoder to encode an object signal, a channel signal, and rendering information for the channel signal, and a bit stream generator to generate, as a bit stream, the encoded object signal, the encoded channel signal, and the encoded rendering information for the channel signal.
Abstract:
Disclosed is an apparatus and method for extracting a sound source from a multi-channel audio signal. A sound source extracting method includes transforming a multi-channel audio signal into two-dimensional (2D) data, extracting a plurality of feature maps by inputting the 2D data into a convolutional neural network (CNN) including at least one layer, and extracting a sound source from the multi-channel audio signal using the feature maps.
Abstract:
Disclosed is a content processing method including receiving content including broadcast data and advertisement data into which additional information is inserted, extracting the additional information from the advertisement data, identifying the advertisement data from the content based on the extracted additional information, and extracting the broadcast data excluding the advertisement data identified from the content, wherein the additional information is inserted at at least one of optimal intervals determined based on test additional information inserted at a plurality of analysis intervals of an audio signal associated with the advertisement data.
Abstract:
An audio signal identification method and apparatus are provided. The audio signal identification method includes generating an amplitude map from an input audio signal, determining whether a portion of the amplitude map is a target portion corresponding to a target signal, using a pre-trained model, extracting feature data from the target portion, and identifying the audio signal based on the feature data.
Abstract:
Disclosed is a method and an apparatus for embedding data in an audio signal based on a time domain, and a method and an apparatus for extracting data from an audio signal based on a time domain. The method for embedding data in an audio signal based on a time domain may include generating a time-domain insertion sequence from original data based on a weighting element, embedding the insertion sequence in a host audio signal, and transmitting the host audio signal in which the insertion sequence is embedded. The method for extracting data from an audio signal based on a time domain may include receiving a time-domain audio signal in which data is embedded, extracting a codeword from the audio signal, and synchronizing the audio signal based on the codeword.
Abstract:
A system and method for synchronizing an audio signal and a video signal are provided. A decoding method in the system may include decoding an audio signal and a video signal received from an encoding apparatus, extracting first unique information of the audio signal from the decoded video signal, generating second unique information of the audio signal based on the decoded audio signal, determining a delay between the audio signal and the video signal by comparing the first unique information to the second unique information, and synchronizing the audio signal and the video signal based on the delay. The first unique information may be generated based on an audio signal that is not encoded by the encoding apparatus, and may be inserted into the video signal.
Abstract:
A method and an apparatus for transmitting a watermark robust to an acoustic channel distortion are disclosed. The method of transmitting the watermark may include extracting a watermark from a first audio signal including the watermark; modifying the extracted watermark based on a state of an acoustic channel; and embedding the modified watermark into the first audio signal to output a second audio signal.
Abstract:
Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
Abstract:
Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
Abstract:
An apparatus and method for generating and consuming a three-dimensional (3D) data format to generate a realistic panoramic image are provided. The apparatus may include an image preprocessing unit to search for a matching point between images captured by a plurality of cameras, and to extract, as image information, at least one of a depth value, a texture value and object division information from each of the captured images, an image information structuring unit to structure 3D data to use the extracted image information to generate a realistic image, a 3D data format storage unit to store format information of the to structured 3D data in a database (DB), realistic image generating unit to generate a realistic panoramic image using the stored format information of the 3D data, and a realistic image rendering unit to perform rendering on the generated realistic panoramic image.