摘要:
A method and apparatus for eliminating cross-channel interference, and a multi-channel source separation method and a multi-channel source separation apparatus using the same. The multi-channel source signal separation apparatus includes: a source separation unit separating multi-channel source signals from a mixture including the multi-channel source signals; and a post-processing unit eliminating cross-channel interference from an arbitrary channel output of the separated multi-channel source signals by using an interference elimination coefficient determined based on a degree of interference between the arbitrary channel output and a different channel output of the separated multi-channel source signals.
摘要:
Cross-channel interference is eliminated and multi-channel sources are separated by estimating a source absence probability for a current frame of a first channel output, and determining an interference elimination coefficient for matching a secondary signal of the first channel output with a primary signal of a second channel output by using the source absence probability, generating an interference signal by multiplying the second channel output by an over-subtraction factor and the interference elimination coefficient, wherein a partial differentiation is performed for a v-norm value of a spectral amplitude difference, between the first channel output and the second channel output multiplied by the interference elimination coefficient and a result of multiplication of the source absence probability, by using the interference elimination coefficient to determine an update amount of the interference elimination coefficient for a next frame.
摘要:
A speech enhancement apparatus and method and a computer-readable recording medium having a program recorded thereon execute a speech enhancement method. The speech enhancement apparatus includes a spectrum subtraction unit generating a subtracted spectrum by subtracting an estimated noise spectrum from a received speech spectrum, a correction function modeling unit generating a correction function to minimize a noise spectrum using variation of a noise spectrum included in training data, and a spectrum correction unit generating a corrected spectrum by correcting the subtracted spectrum using the correction function.
摘要:
A speech recognition method, medium, and system. The method includes detecting an energy change of each frame making up signals including speech and non-speech signals, and identifying a speech segment corresponding to frames that include only speech signals from among the frames based on the detected energy change.
摘要:
A speech enhancement apparatus and method and a computer-readable recording medium having a program recorded thereon execute a speech enhancement method. The speech enhancement apparatus includes a spectrum subtraction unit generating a subtracted spectrum by subtracting an estimated noise spectrum from a received speech spectrum, a correction function modeling unit generating a correction function to minimize a noise spectrum using variation of a noise spectrum included in training data, and a spectrum correction unit generating a corrected spectrum by correcting the subtracted spectrum using the correction function.
摘要:
A speech recognition method, medium, and system. The method includes detecting an energy change of each frame making up signals including speech and non-speech signals, and identifying a speech segment corresponding to frames that include only speech signals from among the frames based inclusive of the detected energy change.