摘要:
Disclosed is a content processing method including receiving content including broadcast data and advertisement data into which additional information is inserted, extracting the additional information from the advertisement data, identifying the advertisement data from the content based on the extracted additional information, and extracting the broadcast data excluding the advertisement data identified from the content, wherein the additional information is inserted at at least one of optimal intervals determined based on test additional information inserted at a plurality of analysis intervals of an audio signal associated with the advertisement data.
摘要:
An audio signal identification method and apparatus are provided. The audio signal identification method includes generating an amplitude map from an input audio signal, determining whether a portion of the amplitude map is a target portion corresponding to a target signal, using a pre-trained model, extracting feature data from the target portion, and identifying the audio signal based on the feature data.
摘要:
Disclosed is a method and an apparatus for embedding data in an audio signal based on a time domain, and a method and an apparatus for extracting data from an audio signal based on a time domain. The method for embedding data in an audio signal based on a time domain may include generating a time-domain insertion sequence from original data based on a weighting element, embedding the insertion sequence in a host audio signal, and transmitting the host audio signal in which the insertion sequence is embedded. The method for extracting data from an audio signal based on a time domain may include receiving a time-domain audio signal in which data is embedded, extracting a codeword from the audio signal, and synchronizing the audio signal based on the codeword.
摘要:
A system and method for synchronizing an audio signal and a video signal are provided. A decoding method in the system may include decoding an audio signal and a video signal received from an encoding apparatus, extracting first unique information of the audio signal from the decoded video signal, generating second unique information of the audio signal based on the decoded audio signal, determining a delay between the audio signal and the video signal by comparing the first unique information to the second unique information, and synchronizing the audio signal and the video signal based on the delay. The first unique information may be generated based on an audio signal that is not encoded by the encoding apparatus, and may be inserted into the video signal.
摘要:
A method and an apparatus for transmitting a watermark robust to an acoustic channel distortion are disclosed. The method of transmitting the watermark may include extracting a watermark from a first audio signal including the watermark; modifying the extracted watermark based on a state of an acoustic channel; and embedding the modified watermark into the first audio signal to output a second audio signal.
摘要:
Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
摘要:
Disclosed is a binaural rendering method and apparatus for decoding a multichannel audio signal. The binaural rendering method may include: extracting an early reflection component and a late reverberation component from a binaural filter; generating a stereo audio signal by performing binaural rendering of a multichannel audio signal base on the early reflection component; and applying the late reverberation component to the generated stereo audio signal.
摘要:
An apparatus and method for generating and consuming a three-dimensional (3D) data format to generate a realistic panoramic image are provided. The apparatus may include an image preprocessing unit to search for a matching point between images captured by a plurality of cameras, and to extract, as image information, at least one of a depth value, a texture value and object division information from each of the captured images, an image information structuring unit to structure 3D data to use the extracted image information to generate a realistic image, a 3D data format storage unit to store format information of the to structured 3D data in a database (DB), realistic image generating unit to generate a realistic panoramic image using the stored format information of the 3D data, and a realistic image rendering unit to perform rendering on the generated realistic panoramic image.
摘要:
An audio signal encoding/decoding method and an apparatus for performing the same are disclosed. The audio signal encoding method includes obtaining a full-band input signal, extracting a first feature vector corresponding to a first sub-band signal and a second feature vector corresponding to a second sub-band signal using an encoder neural network including a plurality of encoding layers, generating a first code vector corresponding to the first feature vector and a second code vector corresponding to the second feature vector by compressing the first feature vector and the second feature vector, and generating a bitstream by quantizing the first code vector and the second code vector.
摘要:
Disclosed is an apparatus and method for audio encoding/decoding that is robust against coding distortion in a transition section. An audio encoding method includes outputting a frequency domain signal by time-to-frequency (T/F) transform of an input signal, outputting a frequency domain residual signal in which a frequency axis envelope is removed from the frequency domain signal by applying frequency domain noise shaping (FDNS) encoding to the frequency domain signal, outputting a time domain residual signal in which a time axis envelope is removed by performing linear prediction coefficient (LPC) analysis based on the frequency domain residual signal, and quantizing and transmitting the time domain residual signal.