摘要:
An audio processing apparatus generates a suppression coefficient sequence that is composed of coefficient values corresponding to frequency components of an audio signal, the frequency components being multiplied by the corresponding coefficient values to suppress noise components of the audio signal. In the audio processing apparatus, a characteristic value calculation unit calculates a noise characteristic value depending on a shape of a magnitude distribution of the audio signal. An intensity setting unit variably sets a suppression intensity of the noise components based on the noise characteristic value. A coefficient sequence generation unit generates the suppression coefficient sequence based on the audio signal and the suppression intensity.
摘要:
A noise suppression estimation device calculates a noise index value which varies according to kurtosis of a frequence distribution of magnitude of a sound signal before or after suppression of the noise component, the noise index value indicating a degree of occurrence of musical noise after suppression of the noise component in a frequency domain. For example, the noise suppression estimation device calculates first kurtosis of a frequence distribution of magnitude of the sound signal before suppression of the noise component, calculates second kurtosis of a frequence distribution of magnitude of the sound signal after suppression of the noise component, and calculates the noise index value from the first kurtosis and the second kurtosis.
摘要:
In a sound processing device, an index value calculation unit calculates a first index value that follows change of a sound signal at a first following degree and a second index value that follows the change of the sound signal at a second following degree which is lower than the first following degree. An adjustment value calculation unit calculates an adjustment value effective to adjust a reverberation component of the sound signal based on difference between the first index value and the second index value. A reverberation adjustment unit applies the adjustment value to the sound signal.
摘要:
Signal processing section (100) of a terminal converts acquired audio signals of a plurality of channels into frequency spectra set, calculates sound image positions corresponding to individual frequency components, and displays, on a display screen, the calculated sound image positions results by use of a coordinate system having coordinate axes of the frequency components and sound image positions. User-designated partial region of the coordinate system is set as a designated region and an amplitude-level adjusting amount is set for the designated region, so that the signal processing section adjusts amplitude levels of frequency components included in the frequency spectra and in the designated region, converts the adjusted frequency components into audio signals and outputs the converted audio signals.
摘要:
A signal processing device processes a plurality of observed signals at a plurality of frequencies. The plurality of the observed signals are produced by a plurality of sound receiving devices which receive a mixture of a plurality of sounds. In the signal processing device, a storage stores observed data of the plurality of the observed signals. The observed data represents a time series of magnitude of each frequency in each of the plurality of the observed signals. An index calculator calculates an index value from the observed data for each of the plurality of the frequencies. The index value indicates significance of learning of a separation matrix using the observed data of each frequency. The separation matrix is used for separation of the plurality of the sounds from each other at each frequency. A frequency selector selects one or more frequency according to the index value of each frequency. A learning processor determines the separation matrix by learning with a given initial separation matrix using the observed data of the selected frequency.
摘要:
Frequency detection section (62) identifies candidate frequencies (Fc(1) - Fc(N)) per unit segment (Tu) of an audio signal (x). First processing section (71) identifies an estimated train (RA) that is a time series of candidate frequencies (Fc(n)), each selected for a different one of the segments, arranged over a plurality of the unit segments and that has a high likelihood of corresponding to a time series of fundamental frequencies (Ftar) of a target component Second processing section (72) identifies a state train (RB) of states, each indicative of one of sound-generating and non-sound-generating states of the target component in a different one of the segments, arranged over the unit segments. Information generation section (68) generates frequency information (DF) per unit segment (Tu), the frequency information generated for each unit segment corresponding to the sound-generating state designating, as a fundamental frequency (Ftar) of the target component, a candidate frequency (Fc(n)) corresponding to the unit segment in the estimated train (RA), the frequency information generated for each unit segment corresponding to the non-sound-generating state being indicative of no sound generation.
摘要:
A noise suppressing apparatus suppresses a noise component of a sound signal which contains the noise component and a signal component. In the apparatus, a frequency analyzing section divides the sound signal into a plurality of frames such that adjacent frames overlap with each other along a time axis, and computes a first spectrum of each frame. A noise suppressing section suppresses a noise component of the first spectrum so as to provide a second spectrum of each frame in which the noise component is suppressed. A frequency specifying section specifies a frequency of a noise component of each frame. A phase controlling section varies a phase of the noise component corresponding to the specified frequency in the second spectrum by a different variation amount each frame. A signal synthesizing section combines the frames after the second spectrum of each frame is processed by the phase controlling means, such that adjacent frames overlap with each other along the time axis so as to output the sound signal.