Abstract:
Disclosed are an apparatus and a method for separating sound sources capable of learning distributions of corresponding sound sources based on the assumption that specific sound sources have specific distributions based on interchannel correlation parameter in audio signals providing space perception through a plurality of channels to separate an amount corresponding to energy contribution of the corresponding sound sources from mixture signals. Exemplary embodiments of the present invention can more precisely predict the channel distributions of the specific sound sources included in the input mixture signals and more accurately separate sound sources than a method for separating a sound source based on the channel according to the related art, under conditions that general channel distribution information of the specific sound sources are approximately modeled.
Abstract:
Disclosed are an apparatus and a method for separating sound sources capable of learning distributions of corresponding sound sources based on the assumption that specific sound sources have specific distributions based on interchannel correlation parameter in audio signals providing space perception through a plurality of channels to separate an amount corresponding to energy contribution of the corresponding sound sources from mixture signals. Exemplary embodiments of the present invention can more precisely predict the channel distributions of the specific sound sources included in the input mixture signals and more accurately separate sound sources than a method for separating a sound source based on the channel according to the related art, under conditions that general channel distribution information of the specific sound sources are approximately modeled.
Abstract:
The present invention discloses an encoding apparatus using a Discrete Cosine Transform (DCT) scanning, which includes: a mode selection means for selecting an optimal mode for intra prediction; an intra prediction means for performing intra prediction onto video inputted based on the mode selected in the mode selection means; a DCT and quantization means for performing DCT and quantization onto residual coefficients of a block outputted from the intra prediction means; and an entropy encoding means for performing entropy encoding onto DCT coefficients acquired from the DCT and quantization by using a scanning mode decided based on pixel similarity of the residual coefficients.
Abstract:
Disclosed are an apparatus and a method of providing contents. The apparatus of providing the contents may include a receiving unit to receive, from a contents provider, contents and information about a contents providing location, a local group setting unit to search for at least one cell based on the information about the contents providing location and to set the retrieved cell as a content providing location group of the contents, and a transmitting unit to transmit the contents to the set content providing location group. The object based audio contents may be consecutively replayed based on an identical audio preset.
Abstract:
A method and apparatus for separating a multi-channel mixed signal are provided. The method includes the steps of: a) transforming a temporal domain to a frequency domain by performing a discrete Fourier transform onto at least one of mixed signals inputted from an external device through multi-channel; b) estimating multi-decorrelation by calculating a plurality of cross power spectra for the mixed signal in the transformed frequency domain; c) estimating a separation coefficient of the mixed signal based on relative optimization in order to decorrelate the calculated cross power spectra, where the separation coefficient is serially updated; d) transforming the frequency domain to the temporal domain by performing an inverse discrete Fourier transform on the estimated separation coefficient in the temporal domain; and e) separating an original signal from the mixed signal by filtering the mixed signal using the separation coefficient of the transformed temporal domain.
Abstract:
Provided are a method for creating, editing and reproducing a multi-object audio content file for an object-based audio service and a method for creating audio presets. The multi-object audio content file creating method includes creating a plurality of frames for each audio object forming an audio content; and creating a multi-object audio content file by grouping and storing the frames according to each reproduction time. This invention can enhance functions of the object-based audio service and make it easy to access to each audio object of an audio content file.
Abstract:
Provided are an object-based three dimensional (3-D) audio service system using preset audio scenes and a method thereof. The system and the method are suggested for enabling a user to easily and conveniently watch and listen an object based 3-D audio service by eliminating inconvenience that requires a user to control each of object audio signals of sound sources. The system includes: audio input means for inputting an audio signal; preset audio scene generating means for extracting object audio signals from the audio signal inputted through the audio input means and generating more than one of 3-D audio scene information by arranging the extracted object audio signals in a 3-D space and editing features of each object; and encoding means for encoding and multiplexing the audio signal and the 3-D audio scene information for each object audio signal.
Abstract:
A method for compressing and decompressing a multi-channel signal using virtual source location information (VSLI) on a semicircular plane is provided. VSLI, rather than inter channel level difference (ICLD), is used as spatial cue information, thereby minimizing loss caused by quantization of spatial cue information, improving sound quality of a decompressed audio signal, and reproducing an excellent audio signal by reducing distortion upon decompression of an original signal at a decoder spectrum.
Abstract:
Provided is a method and apparatus for encoding/decoding a multi-channel audio signal. The apparatus for encoding a multi-channel audio signal includes a frame converter for converting the multi-channel audio signal into a framed audio signal; means for downmixing the framed audio signal; means for encoding the downmixed audio signal; a source location information estimator for estimating source location information from the framed multi-channel audio signal; means for quantizing the estimated source location information; and means for multiplexing the encoded audio signal and the quantized source location information, to generate an encoded multi-channel audio signal.
Abstract:
A method and apparatus for separating a multi-channel mixed signal are provided. The method includes the steps of: a) transforming a temporal domain to a frequency domain by performing a discrete Fourier transform onto at least one of mixed signals inputted from an external device through multi-channel; b) estimating multi-decorrelation by calculating a plurality of cross power spectra for the mixed signal in the transformed frequency domain; c) estimating a separation coefficient of the mixed signal based on relative optimization in order to decorrelate the calculated cross power spectra, where the separation coefficient is serially updated; d) transforming the frequency domain to the temporal domain by performing an inverse discrete Fourier transform on the estimated separation coefficient in the temporal domain; and e) separating an original signal from the mixed signal by filtering the mixed signal using the separation coefficient of the transformed temporal domain.