摘要:
There is provided a noise suppressing device, for suppressing a noise component contained in a sound, including: at least two sound receiving parts receiving sounds from a plurality of directions containing a sound from a direction of a given sound source and converting the sounds to digital sound signals in a time domain, respectively; an estimating part acquiring both direction information on a direction of the given sound source and distance information on a distance from the given sound source based upon the digital sound signals converted by the sound receiving parts, and estimating a component value of a noise component contained in the signal by use of the direction information and the distance information; and a controlling part acquiring a control value of a suppression amount for controlling a range of a direction of the digital sound signals.
摘要:
An echo canceling system and an echo canceling method are provided, which can deal with the case where there are a plurality of echo paths and respond to the variation in echo arrival times. An echo canceling method to be applied to a full-duplex communication system includes detecting a respective echo arrival time of one or plural echo paths based on a reference signal and an echo signal, calculating as many pseudo-echo signals as the detected arrival times, overlapping the calculated pseudo-echo signals to obtain an overall pseudo-echo signal, and subtracting the overall pseudo-echo signal from the echo signal. A FFT processing is performed with respect to the reference signal and the echo signal, and a similar canceling processing is carried out using an amplitude spectrum alone.
摘要:
To provide a three-dimensional acoustic effect to a listener in a reproduction field, via a headphone in particular, a three-dimensional acoustic apparatus is formed by a linear synthesis filter having filter coefficients that are the linear predictive coefficients obtained by performing a linear predictive analysis on an impulse response which represents the acoustic characteristics to be added to the original signal to achieve this effect. By passing the signal through this acoustic characteristics adding filter, the desired acoustic characteristics are added to the original signal, and by dividing the power spectrum of the impulse response of these acoustic characteristics into critical bandwidths and performing this linear predictive analysis based on impulse signal determined based from power spectrum signals representing the signal sound of each of these critical bandwidths, the filter coefficients of the linear synthesis filter are determined.
摘要:
To provide a three-dimensional acoustic effect to a listener in a reproduction field, via a headphone in particular, a three-dimensional acoustic apparatus is formed by a linear synthesis filter having filter coefficients that are the linear predictive coefficients obtained by performing a linear predictive analysis on an impulse response which represents the acoustic characteristics to be added to the original signal to achieve this effect. By passing the signal through this acoustic characteristics adding filter, the desired acoustic characteristics are added to the original signal, and by dividing the power spectrum of the impulse response of these acoustic characteristics into critical bandwidths and performing this linear predictive analysis based on impulse signal determined based from power spectrum signals representing the signal sound of each of these critical bandwidths, the filter coefficients of the linear synthesis filter are determined.
摘要:
To provide a three-dimensional acoustic effect to a listener in a reproduction field, via a headphone in particular, a three-dimensional acoustic apparatus is formed by a linear synthesis filter having filter coefficients that are the linear predictive coefficients obtained by performing a linear predictive analysis on an impulse response which represents the acoustic characteristics to be added to the original signal to achieve this effect. By passing the signal through this acoustic characteristics adding filter, the desired acoustic characteristics are added to the original signal, and by dividing the power spectrum of the impulse response of these acoustic characteristics into critical bandwidths and performing this linear predictive analysis based on impulse signal determined based from power spectrum signals representing the signal sound of each of these critical bandwidths, the filter coefficients of the linear synthesis filter are determined.
摘要:
An utterance state detection device includes an user voice stream data input unit that gets user voice stream data of an user, a frequency element extraction unit that extracts high frequency elements by frequency-analyzing the user voice stream data, a fluctuation degree calculation unit that calculates a fluctuation degree of the high frequency elements thus extracted every unit time, a statistic calculation unit that calculates a statistic every certain interval based on a plurality of the fluctuation degrees in a certain period of time, and an utterance state detection unit that detects an utterance state of a specified user based on the statistic obtained from user voice stream data of the specified user.
摘要:
A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding features of the utterance data, creating, on the basis of the plurality of statistical quantities regarding the utterance data and another plurality of statistical quantities regarding reference utterance data based on other uttered speech, pseudo-utterance data having at least one statistical quantity equal to a statistical quantity in the other plurality of statistical quantities, computing a plurality of statistical quantities for synthetic utterance data synthesized on the basis of the pseudo-utterance data and the utterance data, and determining, on the basis of a comparison between statistical quantities of the synthetic utterance data and statistical quantities of the reference utterance data, whether the speaker who produced the uttered speech is in a first state or a second state; and a memory.
摘要:
A microphone array apparatus includes: an acquisition unit configured to acquire samples from a sound signal inputted from each of a plurality of microphones, at predetermined time intervals; an operation unit configured to calculate a value based on volumes of the sound signal possessed by a plurality of the samples for each of the sound signals inputted from the plurality of microphones; a correlation coefficient calculator configured to calculate a coefficient of correlation between the sound signals, on the basis of the values calculated for the respective sound signals; and a gain calculator configured to calculate reduction gain for the sound signals inputted from the plurality of microphones, on the basis of the coefficient of correlation.
摘要:
Accepting the speech having the noise superimposed thereon and converting it into a signal on a time axis of the speech, an amplitude component of a speech for each predetermined frequency band of the converted signal on the frequency axis is calculated. Calculating a noise reduction coefficient, the noise component is reduced by multiplying the signal on the frequency axis of the original signal by the calculated noise reduction coefficient. By estimating the target value of the remaining noise for each frequency band, a signal on a frequency axis in which a signal corresponding to a frequency band of which target value estimated by the noise target value is larger than the value of the amplitude component of the signal on the frequency axis of which noise component is reduced is corrected to a signal corresponding to the target value is restored, into a signal on a time axis.
摘要:
A sound processing device includes: a plurality of sound input units; a detecting unit for detecting a frequency component of each sound input to the plurality of sound signal unit, the each sound arriving from a direction approximately perpendicular to a line determined by arrangement positions of two sound input units among the plurality of sound input units; a correction coefficient unit for obtaining a correction coefficient for correcting a level of at least one of the sound signals generated from the input sounds by the two sound input units so as to match the levels of the sound signals with each other based on the sound of the detected frequency component; a correcting unit for correcting the level of at least one of the sound signals using the obtained correction coefficient; and a processing unit for performing a sound process based on the sound signal with the corrected level.