摘要:
Provided are an apparatus and method of separating, from a mixed signal, a sound source generated using a rhythm musical instrument based on characteristics of the rhythm musical instrument repeated in an aspect of time. The apparatus may include a separation unit to separate a plurality of mixed signals into a plurality of segments, a Nonnegative Matrix Partial Co-Factorization (NMPCF) analysis unit to perform an NMPCF analysis on the plurality of segments, and to obtain a plurality of entity matrices based on the analysis result, a target instrument signal separating unit to separate, from the mixed signals, a target instrument signal, by calculating an inner product between the plurality of entity matrices, and a signal association unit to associate the target instrument signals separated from each of the plurality of segments.
摘要:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
摘要:
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a hetero coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the hetero coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
摘要:
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a different coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the different coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
摘要:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
摘要:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.
摘要:
Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.
摘要:
Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit to encode the input signal according to a selection of the module selection unit and to generate a speech bitstream; an audio encoding unit to encode the input signal according to the selection of the module selection unit and to generate an audio bitstream; and a bitstream generation unit to generate an output bitstream from the speech encoding unit or the audio encoding unit according to the selection of the module selection unit.
摘要:
A module capable of appropriately selecting a linear predictive coding (LPC)-based or a code excitation linear prediction (CELP)-based speech or audio encoder and a transform-based audio encoder according to a feature of an input signal is a module that performs as a bridge for overcoming a performance barrier between a conventional LPC-based encoder and an audio encoder. Also, an integral audio encoder that provides consistent audio quality regardless of a type of the input audio signal can be designed based on the module.
摘要:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.