摘要:
A noise suppressing device receives sound signals through a plurality of sound-receiving units and suppresses noise components included in the input sound signals. The noise suppressing device includes a detecting unit which detects a usage pattern of the noise suppressing device from a plurality of usage patterns in which positional relationships of the plurality of sound-receiving units and/or positional relationships between the plurality of sound-receiving units and a target sound source are different from each other, a converting unit which converts using environment information used in a noise suppressing process to each of the sound signals inputted by the plurality of sound-receiving units into using environment information in accordance with a usage pattern detected by the detecting unit and a suppressing unit which performs the noise suppressing process using the using environment information converted by the converting unit to the sound signals.
摘要:
A noise suppressing device receives sound signals through a plurality of sound-receiving units and suppresses noise components included in the input sound signals. The noise suppressing device includes a detecting unit which detects a usage pattern of the noise suppressing device from a plurality of usage patterns in which positional relationships of the plurality of sound-receiving units and/or positional relationships between the plurality of sound-receiving units and a target sound source are different from each other, a converting unit which converts using environment information used in a noise suppressing process to each of the sound signals inputted by the plurality of sound-receiving units into using environment information in accordance with a usage pattern detected by the detecting unit and a suppressing unit which performs the noise suppressing process using the using environment information converted by the converting unit to the sound signals.
摘要:
There is provided a noise suppressing device, for suppressing a noise component contained in a sound, including: at least two sound receiving parts receiving sounds from a plurality of directions containing a sound from a direction of a given sound source and converting the sounds to digital sound signals in a time domain, respectively; an estimating part acquiring both direction information on a direction of the given sound source and distance information on a distance from the given sound source based upon the digital sound signals converted by the sound receiving parts, and estimating a component value of a noise component contained in the signal by use of the direction information and the distance information; and a controlling part acquiring a control value of a suppression amount for controlling a range of a direction of the digital sound signals.
摘要:
A state detecting device includes an input unit that receives an input voice sound; an analyzer that calculates a feature parameter of each of plurality of frames extracted from the voice sound; a calculator that calculates the average of the feature parameters of the frames, determines a threshold on the basis of the average and statistical data representing relationships between other averages of other feature parameters obtained from a plurality of speakers and cumulative frequencies of the other feature parameters, and calculates an appearance frequency of a frame that is among the plurality of frames and whose feature parameter is larger than the threshold; a determining unit that determines, on the basis of the appearance frequency, a strained state of a vocal cord that has made the voice sound; and an output unit that outputs a result of the determination.
摘要:
A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding features of the utterance data, creating, on the basis of the plurality of statistical quantities regarding the utterance data and another plurality of statistical quantities regarding reference utterance data based on other uttered speech, pseudo-utterance data having at least one statistical quantity equal to a statistical quantity in the other plurality of statistical quantities, computing a plurality of statistical quantities for synthetic utterance data synthesized on the basis of the pseudo-utterance data and the utterance data, and determining, on the basis of a comparison between statistical quantities of the synthetic utterance data and statistical quantities of the reference utterance data, whether the speaker who produced the uttered speech is in a first state or a second state; and a memory.
摘要:
There is provided a noise suppressing device, for suppressing a noise component contained in a sound, including: at least two sound receiving parts receiving sounds from a plurality of directions containing a sound from a direction of a given sound source and converting the sounds to digital sound signals in a time domain, respectively; an estimating part acquiring both direction information on a direction of the given sound source and distance information on a distance from the given sound source based upon the digital sound signals converted by the sound receiving parts, and estimating a component value of a noise component contained in the signal by use of the direction information and the distance information; and a controlling part acquiring a control value of a suppression amount for controlling a range of a direction of the digital sound signals.
摘要:
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a second model generation unit to generate a second specific speaker model obtained by modeling speech features of the specific speaker in the depressed state; a likelihood calculation unit to calculate a first likelihood as a likelihood of the first specific speaker model with respect to input voice, and a second likelihood as a likelihood of the second specific speaker model with respect to the input voice; and a state determination unit to determine a state of the speaker of the input voice using the first likelihood and the second likelihood.
摘要:
A directivity control apparatus is capable of acquiring tilt information indicating a tilt angle of the directivity control apparatus; acquiring sound source direction information; storing mapping data indicating a relationship between the tilt angle and the direction; determining whether the sound information indicates a target sound; updating the mapping data based on the sound source direction information and the tilt information, if the sound information indicates the target sound; estimating a direction of sound responsive to the tilt information, based on the mapping data if the sound information doesn't indicate a target sound; and adjusting a directivity of a microphone based on the sound source direction information if the sound information indicates the target sound, or adjusting the directivity of the microphone based on the estimated direction if the sound information doesn't indicate the target sound.
摘要:
A speech recognition apparatus predicts, based on the occurrence cycle and duration time of impulse noise that occurs periodically, a segment in which impulse noise occurs, and executes speech recognition processing based on the feature components of the remaining frames excluding a feature component of a frame corresponding to the predicted segment, or the feature components extracted from frames created from sound data excluding a part corresponding to the predicted segment.
摘要:
A noise estimation apparatus includes a correlation calculator configured to calculate a correlation value of a spectrum between a plurality of frames in sound information obtained using one or more microphones, a power calculator configured to calculate a power value indicating a sound level of one target frame among the plurality of frames, an update determiner configured to determine an update degree indicating a degree to which the sound information of the target frame is to be reflected in a noise model stored in a storage, or determine whether or not the noise model is to be updated to another noise model, based on the power value of the target frame and the correlation value, and an updater configured to generate the other noise model based on a determined result, the sound information of the target frame, and the noise model.