摘要:
Robotics visual and auditory system is provided which is made capable of accurately conducting the sound source localization of a target by associating a visual and an auditory information with respect to a target. It is provided with an audition module (20), a face module (30), a stereo module (37), a motor control module (40), an association module (50) for generating streams by associating events from said each module (20, 30, 37, and 40), and an attention control module (57) for conducting attention control based on the streams generated by the association module (50), and said association module (50) generates an auditory stream (55) and a visual stream (56) from a auditory event (28) from the auditory module (20), a face event (39) from the face module (30), a stereo event (39a) from the stereo module (37), and a motor event (48) from the motor control module (40), and an association stream (57) which associates said streams, as well as said audition module (20) collects sub-bands having the interaural phase difference (IPD) or the interaural intensity difference (IID) within the preset range by an active direction pass filter (23a) having a pass range which, according to auditory characteristics, becomes minimum in the frontal direction, and larger as the angle becomes wider to the left and right, based on an accurate sound source directional information from the association module (50), and conducts sound source separation by restructuring the wave shape of the sound source.
摘要:
An apparatus and method for extracting a predetermined non-harmonic structured spectral component contained in an audio signal. Then, the extracted predetermined spectral component is increased or decreased. In this process, the spectrum of the audio signal is calculated by frequency analysis, so that a spectrum component corresponding to the predetermined non-harmonic structured spectral component is extracted and then increased or decreased. The extraction of the predetermined non-harmonic structured spectral component is performed with reference to a spectral component of a template stored in advance. In this process, the spectral component of the template is adapted in such a manner that the difference between the extracted spectral component and the spectral component of the template goes below or at a predetermined value. This allows the audio-signal contained predetermined non-harmonic structured spectral component to be independently increased or decreased without an influence on other spectral components.
摘要:
A voice recognition system (10) for improving the toughness of voice recognition for a voice input for which a deteriorated feature amount cannot be completely identified. The system comprises at least two sound detecting means (16a, 16b) for detecting a sound signal, a sound source localizing unit (21) for determining the direction of a sound source based on the sound signal, a sound source separating unit (23) for separating a sound by the sound source from the sound signal based on the sound source direction, a mask producing unit (25) for producing a mask value according to the reliability of the separation results, a feature extracting unit (27) for extracting the feature amount of the sound signal, and a voice recognizing unit (29) for applying the mask to the feature amount to recognize a voice from the sound signal.
摘要:
A system capable of reducing the influence of sound reverberation or reflection to improve sound-source separation accuracy. An original signal X(ω,f) is separated from an observed signal Y(ω,f) according to a first model and a second model to extract an unknown signal E(ω,f). According to the first model, the original signal X(ω,f) of the current frame f is represented as a combined signal of known signals S(ω,f−m+1) (m=1 to M) that span a certain number M of current and previous frames. This enables extraction of the unknown signal E(ω,f) without changing the window length while reducing the influence of reverberation or reflection of the known signal S(ω,f) on the observed signal Y(ω,f).
摘要:
A musical piece recommendation system is provided that allows instantaneous registration of a new user and a new musical piece without retraining in a basic training section. A first incremental training section 21 monitors a rating history storage section 3, and each time a change is made to a rating history or a new user is added, performs updating of or addition of the topic selection probability for the user for which the change is made to the rating history or for the new user such that the likelihood determined by a basic training section 17 is kept maximized. A second incremental training section 21 monitors an acoustic feature storage section 5, and each time a new musical piece is added to perform addition to acoustic features, adds the musical piece selection probability related to the added musical piece such that the likelihood determined by the basic training section 17 is kept maximized.
摘要:
A robot includes: a sound collecting unit collecting and converting a musical sound into a musical acoustic signal; a voice signal generating unit generating a self-vocalized voice signal; a sound outputting unit converting the self-vocalized voice signal into a sound and outputting the sound; a self-vocalized voice regulating unit receiving the musical acoustic signal and the self-vocalized voice signal; a filtering unit performing a filtering process; a beat interval reliability calculating unit performing a time-frequency pattern matching process and calculating a beat interval reliability; a beat interval estimating unit estimating a beat interval; a beat time reliability calculating unit calculating a beat time reliability; a beat time estimating unit estimating a beat time on the basis of the calculated beat time reliability; a beat time predicting unit predicting a beat time before the current time; and a synchronization unit synchronizing the self-vocalized voice signal.
摘要:
A first domain satisfying a first condition concerning a current utterance understanding result and a second domain satisfying a second condition concerning a selection history are specified. For each of the first and second domains, indices representing reliability in consideration of the utterance understanding history, selection history, and utterance generation history are evaluated. Based on the evaluation results, one of the first, second, and third domains is selected as a current domain according to a selection rule.
摘要:
The invention is directed to an auditory robot for a human or animal like robot, e.g., a human like robot (10) having a noise generating source such as a driving system in its interior. The apparatus includes a sound insulating cover (14) with which at least a head part (13) of the robot is covered; a pair of outer microphones (16; 16a and 16b) installed outside of the cover and located at a pair of positions where a pair of ears may be provided spaced apart for the robot, respectively, for collecting an external sound primarily; at least one inner microphone (17; 17a and 17b) installed inside of the cover for primarily collecting a noise from the noise generating source in the robot interior; and a processing module (18) on the basis of signals from the outer and inner microphones for removing from sound signals from the outer microphones (16a and 16b), a noise signal from the internal noise generating source. Thus, the robot auditory apparatus of the invention is made capable of effecting active perception by permitting an external sound from a target to be collected unaffected by a noise in the inside of the robot such as from the driving system.
摘要:
An illumination device has a diffusive sheet that diffuses light. The sheet is fitted to a main body 3 of the illumination device by being sandwiched, in the direction of the thickness of the sheet, between the main body 3 and a frame 4 fitted thereto. The sheet has a cut formed in a portion thereof sandwiched between the frame 4 and the main body 3 to prevent bends that develop with a variation in temperature.