摘要:
A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding features of the utterance data, creating, on the basis of the plurality of statistical quantities regarding the utterance data and another plurality of statistical quantities regarding reference utterance data based on other uttered speech, pseudo-utterance data having at least one statistical quantity equal to a statistical quantity in the other plurality of statistical quantities, computing a plurality of statistical quantities for synthetic utterance data synthesized on the basis of the pseudo-utterance data and the utterance data, and determining, on the basis of a comparison between statistical quantities of the synthetic utterance data and statistical quantities of the reference utterance data, whether the speaker who produced the uttered speech is in a first state or a second state; and a memory.
摘要:
A state detecting device includes an input unit that receives an input voice sound; an analyzer that calculates a feature parameter of each of plurality of frames extracted from the voice sound; a calculator that calculates the average of the feature parameters of the frames, determines a threshold on the basis of the average and statistical data representing relationships between other averages of other feature parameters obtained from a plurality of speakers and cumulative frequencies of the other feature parameters, and calculates an appearance frequency of a frame that is among the plurality of frames and whose feature parameter is larger than the threshold; a determining unit that determines, on the basis of the appearance frequency, a strained state of a vocal cord that has made the voice sound; and an output unit that outputs a result of the determination.
摘要:
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a second model generation unit to generate a second specific speaker model obtained by modeling speech features of the specific speaker in the depressed state; a likelihood calculation unit to calculate a first likelihood as a likelihood of the first specific speaker model with respect to input voice, and a second likelihood as a likelihood of the second specific speaker model with respect to the input voice; and a state determination unit to determine a state of the speaker of the input voice using the first likelihood and the second likelihood.
摘要:
A directivity control apparatus is capable of acquiring tilt information indicating a tilt angle of the directivity control apparatus; acquiring sound source direction information; storing mapping data indicating a relationship between the tilt angle and the direction; determining whether the sound information indicates a target sound; updating the mapping data based on the sound source direction information and the tilt information, if the sound information indicates the target sound; estimating a direction of sound responsive to the tilt information, based on the mapping data if the sound information doesn't indicate a target sound; and adjusting a directivity of a microphone based on the sound source direction information if the sound information indicates the target sound, or adjusting the directivity of the microphone based on the estimated direction if the sound information doesn't indicate the target sound.
摘要:
A noise estimation apparatus includes a correlation calculator configured to calculate a correlation value of a spectrum between a plurality of frames in sound information obtained using one or more microphones, a power calculator configured to calculate a power value indicating a sound level of one target frame among the plurality of frames, an update determiner configured to determine an update degree indicating a degree to which the sound information of the target frame is to be reflected in a noise model stored in a storage, or determine whether or not the noise model is to be updated to another noise model, based on the power value of the target frame and the correlation value, and an updater configured to generate the other noise model based on a determined result, the sound information of the target frame, and the noise model.
摘要:
A sound receiving device 1 having a housing 10 in which a plurality of sound receiving units which can receive sounds arriving from a plurality of directions are arranged, includes an omni-directional main sound receiving unit 11 and a sub-sound receiving unit 12 arranged at a position to receive a sound, arriving from a direction other than a given direction, earlier by a given time than the time at which the main sound receiving unit 11 receives the sound. With respect to the received sounds, the sound receiving device calculates a time difference, as a delay time, between the sound receiving time of the sub-sound receiving unit 11 and the sound receiving time of the main sound receiving unit 12.
摘要:
There is provided a sound processing apparatus for processing received sounds. A plurality of sound receiving units which are included in the apparatus output individually a sound signal corresponding to a received sound, then the sound signals in a time domain are converted into respective converted signal in a frequency domain, and a spectral ratio between the two converted signals is calculated for driving a phase correction value which corrects a phase of the sound signal.