摘要:
A method and apparatus for separating and extracting main sound sources from a mixed musical sound signal are provided. A musical sound source separation apparatus may include an prior information signal compressor to compress an prior information signal including a characteristic of a predetermined sound source, a mixed signal divider to divide a mixed signal including a plurality of sound sources into a plurality of segments, a Nonnegative Matrix Partial Co-Factorization (NMPCF) analyzer to acquire common information shared by the plurality of segments, by applying an NMPCF algorithm to the prior information signal, and a target musical instrument signal separator to separate a target musical instrument signal corresponding to the predetermined sound source from the mixed signal, based on the common information.
摘要:
Disclosed are an apparatus and a method of providing contents. The apparatus of providing the contents may include a receiving unit to receive, from a contents provider, contents and information about a contents providing location, a local group setting unit to search for at least one cell based on the information about the contents providing location and to set the retrieved cell as a content providing location group of the contents, and a transmitting unit to transmit the contents to the set content providing location group. The object based audio contents may be consecutively replayed based on an identical audio preset.
摘要:
Disclosed are an apparatus and a method of providing contents. The apparatus of providing the contents may include a receiving unit to receive, from a contents provider, contents and information about a contents providing location, a local group setting unit to search for at least one cell based on the information about the contents providing location and to set the retrieved cell as a content providing location group of the contents, and a transmitting unit to transmit the contents to the set content providing location group. The object based audio contents may be consecutively replayed based on an identical audio preset.
摘要:
Provided are an apparatus and method for providing an object based audio file, and an apparatus and method for playing back an object based audio file. The object based audio file producing apparatus may include a bitstream generator to generate a bitstream about an object based audio file including a plurality of audio object frames and a file header for an object based audio service; and a bitstream transmitter to transmit the bitstream to the object based audio file playback apparatus. The plurality of audio object frames may include a frame storing a audio source in which all of a plurality of audio frames is mixed and a frame storing each of the audio objects.
摘要:
Disclosed are an apparatus and a method for separating sound sources capable of learning distributions of corresponding sound sources based on the assumption that specific sound sources have specific distributions based on interchannel correlation parameter in audio signals providing space perception through a plurality of channels to separate an amount corresponding to energy contribution of the corresponding sound sources from mixture signals. Exemplary embodiments of the present invention can more precisely predict the channel distributions of the specific sound sources included in the input mixture signals and more accurately separate sound sources than a method for separating a sound source based on the channel according to the related art, under conditions that general channel distribution information of the specific sound sources are approximately modeled.
摘要:
A Unified Speech and Audio Codec (USAC) for adjusting an overlap area of a window based on a transition is provided. To increase an encoding efficiency, encoding may be performed by overlapping relatively long windows. Additionally, when a transition is generated between frames, an overlap area of a window may be reduced based on the transition, thereby preventing a noise from occurring due to the transition.
摘要:
Provided are an apparatus and a method for integrally encoding and decoding a speech signal and a audio signal. The encoding apparatus may include: an input signal analyzer to analyze a characteristic of an input signal; a first conversion encoder to convert the input signal to a frequency domain signal, and to encode the input signal when the input signal is a audio characteristic signal; a Linear Predictive Coding (LPC) encoder to perform LPC encoding of the input signal when the input signal is a speech characteristic signal; a frequency band expander for expanding a frequency band of the input signal whose output is transmitted to either the first conversion encoder or the LPC encoder based on the input characteristic; and a bitstream generator to generate a bitstream using an output signal of the first conversion encoder and an output signal of the LPC encoder.
摘要:
A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
摘要:
Provided are an apparatus and method of separating, from a mixed signal, a sound source generated using a rhythm musical instrument based on characteristics of the rhythm musical instrument repeated in an aspect of time. The apparatus may include a separation unit to separate a plurality of mixed signals into a plurality of segments, a Nonnegative Matrix Partial Co-Factorization (NMPCF) analysis unit to perform an NMPCF analysis on the plurality of segments, and to obtain a plurality of entity matrices based on the analysis result, a target instrument signal separating unit to separate, from the mixed signals, a target instrument signal, by calculating an inner product between the plurality of entity matrices, and a signal association unit to associate the target instrument signals separated from each of the plurality of segments.
摘要:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).