摘要:
An illumination device has a diffusive sheet that diffuses light. The sheet is fitted to a main body 3 of the illumination device by being sandwiched, in the direction of the thickness of the sheet, between the main body 3 and a frame 4 fitted thereto. The sheet has a cut formed in a portion thereof sandwiched between the frame 4 and the main body 3 to prevent bends that develop with a variation in temperature.
摘要:
A reverberation suppressing apparatus, includes: a sound acquiring unit which acquires a sound signal; a reverberation data computing unit which computes reverberation data from the acquired sound signal; a reverberation characteristics estimating unit which estimates reverberation characteristics based on the computed reverberation data; a filter length estimating unit which estimates a filter length of a filter which is used to suppress a reverberation based on the estimated reverberation characteristics; and a reverberation suppressing unit which suppresses the reverberation based on the estimated filter length.
摘要:
A first domain satisfying a first condition concerning a current utterance understanding result and a second domain satisfying a second condition concerning a selection history are specified. For each of the first and second domains, indices representing reliability in consideration of the utterance understanding history, selection history, and utterance generation history are evaluated. Based on the evaluation results, one of the first, second, and third domains is selected as a current domain according to a selection rule.
摘要:
A thereminist robot has a characteristic model of theremin and is capable of performing in response to an environment of theremin performance by calibrating the characteristic model before the performance. A robot 10 has a first arm 12, a second arm 11, and a pitch model for indicating an arm position corresponding to a pitch of the theremin The robot 10 plays the theremin by moving the first arm 12 to the arm position corresponding to a musical note based on the target note and the pitch model. The robot further has a parameter adjustment unit for adjusting parameters of the pitch model that change depending on environments surrounding the theremin.
摘要:
A speech recognition system according to the present invention includes a sound source separating section which separates mixed speeches from multiple sound sources from one another; a mask generating section which generates a soft mask which can take continuous values between 0 and 1 for each frequency spectral component of a separated speech signal using distributions of speech signal and noise against separation reliability of the separated speech signal; and a speech recognizing section which recognizes speeches separated by the sound source separating section using soft masks generated by the mask generating section.
摘要:
A speech understanding apparatus includes a speech recognition unit which performs speech recognition of an utterance using multiple language models, and outputs multiple speech recognition results obtained by the speech recognition, a language understanding unit which uses multiple language understanding models to perform language understanding for each of the multiple speech recognition results output from the speech recognition unit, and outputs multiple speech understanding results obtained from the language understanding, and an integrating unit which calculates, based on values representing features of the speech understanding results, utterance batch confidences that numerically express accuracy of the speech understanding results for each of the multiple speech understanding results output from the language understanding unit, and selects one of the speech understanding results with a highest utterance batch confidence among the calculated utterance batch confidences.
摘要:
The speech recognition system of the present invention includes: a sound source separating section which separates mixed speeches from multiple sound sources; a mask generating section which generates a soft mask which can take continuous values between 0 and 1 for each separated speech according to reliability of separation in separating operation of the sound source separating section; and a speech recognizing section which recognizes speeches separated by the sound source separating section using soft masks generated by the mask generating section.
摘要:
A speech recognition system according to the present invention includes a sound source separating section which separates mixed speeches from multiple sound sources from one another; a mask generating section which generates a soft mask which can take continuous values between 0 and 1 for each frequency spectral component of a separated speech signal using distributions of speech signal and noise against separation reliability of the separated speech signal; and a speech recognizing section which recognizes speeches separated by the sound source separating section using soft masks generated by the mask generating section.
摘要:
Robotics visual and auditory system is provided which is made capable of accurately conducting the sound source localization of a target by associating a visual and an auditory information with respect to a target. It is provided with an audition module (20), a face module (30), a stereo module (37), a motor control module (40), an association module (50) for generating streams by associating events from said each module (20, 30, 37, and 40), and an attention control module (57) for conducting attention control based on the streams generated by the association module (50), and said association module (50) generates an auditory stream (55) and a visual stream (56) from a auditory event (28) from the auditory module (20), a face event (39) from the face module (30), a stereo event (39a) from the stereo module (37), and a motor event (48) from the motor control module (40), and an association stream (57) which associates said streams, as well as said audition module (20) collects sub-bands having the interaural phase difference (IPD) or the interaural intensity difference (IID) within the preset range by an active direction pass filter (23a) having a pass range which, according to auditory characteristics, becomes minimum in the frontal direction, and larger as the angle becomes wider to the left and right, based on an accurate sound source directional information from the association module (50), and conducts sound source separation by restructuring the wave shape of the sound source.