摘要:
A system for timbral change, capable of changing timbres included in an existing music audio signal to arbitrary timbres. Replaced harmonic peak parameters are created by replacing a plurality of harmonic peaks included in harmonic peak parameters, which are stored in a separated audio signal analyzing and storing section 3 and indicate relative amplitudes of n-th order harmonic components of each tone generated by a musical instrument of a first kind, with harmonic peaks included in harmonic peak parameters, which are stored in a replacement parameter storing section 6 and indicate relative amplitudes of n-th order harmonic components of each tone generated by a musical instrument of a second kind and corresponding to each tone generated by the musical instrument of the first kind. A synthesized separated audio signal generating section 7 generates a synthesized separated audio signal for each tone using parameters other than the harmonic peak parameters and the replaced harmonic peak parameters.
摘要:
A system for timbral change, capable of changing timbres included in an existing music audio signal to arbitrary timbres. Replaced harmonic peak parameters are created by replacing a plurality of harmonic peaks included in harmonic peak parameters, which are stored in a separated audio signal analyzing and storing section 3 and indicate relative amplitudes of n-th order harmonic components of each tone generated by a musical instrument of a first kind, with harmonic peaks included in harmonic peak parameters, which are stored in a replacement parameter storing section 6 and indicate relative amplitudes of n-th order harmonic components of each tone generated by a musical instrument of a second kind and corresponding to each tone generated by the musical instrument of the first kind. A synthesized separated audio signal generating section 7 generates a synthesized separated audio signal for each tone using parameters other than the harmonic peak parameters and the replaced harmonic peak parameters.
摘要:
An audio signal produced by playing a plurality of musical instruments is separated into sound sources according to respective instrument sounds. Each time a separation process is performed, the updated model parameter estimation/storage section 114 estimates parameters respectively contained in updated model parameters such that updated power spectrograms gradually change from a state close to initial power spectrograms to a state close to a plurality of power spectrograms most recently stored in a power spectrogram separation/storage section. Respective sections including the power spectrogram separation/storage section 112 and an updated distribution function computation/storage section 118 repeatedly perform process operations until the updated power spectrograms change from the state close to the initial power spectrograms to the state close to the plurality of power spectrograms most recently stored in the power spectrogram separation/storage section 112. The final updated power spectrograms are close to the power spectrograms of single tones of one musical instrument contained in the input audio signal formed to contain harmonic and inharmonic models.
摘要:
An audio signal produced by playing a plurality of musical instruments is separated into sound sources according to respective instrument sounds. Each time a separation process is performed, the updated model parameter estimation/storage section 114 estimates parameters respectively contained in updated model parameters such that updated power spectrograms gradually change from a state close to initial power spectrograms to a state close to a plurality of power spectrograms most recently stored in a power spectrogram separation/storage section. Respective sections including the power spectrogram separation/storage section 112 and an updated distribution function computation/storage section 118 repeatedly perform process operations until the updated power spectrograms change from the state close to the initial power spectrograms to the state close to the plurality of power spectrograms most recently stored in the power spectrogram separation/storage section 112. The final updated power spectrograms are close to the power spectrograms of single tones of one musical instrument contained in the input audio signal formed to contain harmonic and inharmonic models.
摘要:
A language understanding device includes: a language understanding model storing unit configured to store word transition data including pre-transition states, input words, predefined outputs corresponding to the input words, word weight information, and post-transition states, and concept weighting data including concepts obtained from language understanding results for at least one word, and concept weight information corresponding to the concepts; a finite state transducer processing unit configured to output understanding result candidates including the predefined outputs, to accumulate word weights so as to obtain a cumulative word weight, and to sequentially perform state transition operations; a concept weighting processing unit configured to accumulate concept weights so as to obtain a cumulative concept weight; and an understanding result determination unit configured to determine an understanding result from the understanding result candidates by referring to the cumulative word weight and the cumulative concept weight.
摘要:
An automatic speech recognition system includes: a sound source localization module for localizing a sound direction of a speaker based on the acoustic signals detected by the plurality of microphones; a sound source separation module for separating a speech signal of the speaker from the acoustic signals according to the sound direction; an acoustic model memory which stores direction-dependent acoustic models that are adjusted to a plurality of directions at intervals; an acoustic model composition module which composes an acoustic model adjusted to the sound direction, which is localized by the sound source localization module, based on the direction-dependent acoustic models, the acoustic model composition module storing the acoustic model in the acoustic model memory; and a speech recognition module which recognizes the features extracted by a feature extractor as character information using the acoustic model composed by the acoustic model composition module.
摘要:
A language understanding device includes: a language understanding model storing unit configured to store word transition data including pre-transition states, input words, predefined outputs corresponding to the input words, word weight information, and post-transition states, and concept weighting data including concepts obtained from language understanding results for at least one word, and concept weight information corresponding to the concepts; a finite state transducer processing unit configured to output understanding result candidates including the predefined outputs, to accumulate word weights so as to obtain a cumulative word weight, and to sequentially perform state transition operations; a concept weighting processing unit configured to accumulate concept weights so as to obtain a cumulative concept weight; and an understanding result determination unit configured to determine an understanding result from the understanding result candidates by referring to the cumulative word weight and the cumulative concept weight.
摘要:
An ultra-directional speaker having a modulator 33 for modulating an ultrasonic carrier signal with an input electric signal from an audible sound signal source, and an emitter 44 for emitting an output of the modulator 33 is mounted in a moving object 1 having a target tracking system for sensing a target in a surrounding space in real time using the above-mentioned emitter 44. The moving object equipped with ultra-directional speaker can therefore transmit a voice only to a specific target through parametric action caused by the nonlinearity of finite amplitude of ultrasonic wave.
摘要:
A robot auditory apparatus and system are disclosed which are made capable of attaining active perception upon collecting a sound from an external target with no influence received from noises generated interior of the robot such as those emitted from the robot driving elements. The apparatus and system are for a robot having a noise generating source in its interior, and include: a sound insulating cladding (14) with which at least a portion of the robot is covered; at least two outer microphones (16 and 16) disposed outside of the cladding (14) for collecting an external sound primarily; at least one inner microphone (17) disposed inside of the cladding (14) for primarily collecting noises from the noise generating source in the robot interior; a processing section (23, 24) responsive to signals from the outer and inner microphones (16 and 16; and 17) for canceling from respective sound signals from the outer microphones (16 and 16), noises signal from the interior noise generating source and then issuing a left and a right sound signal; and a directional information extracting section (27) responsive to the left and right sound signals from the processing section (23, 24) for determining the direction from which the external sound is emitted. The processing section (23, 24) is adapted to detect burst noises owing to the noise generating source from a signal from the at least one inner microphone (17) for removing signal portions from the sound signals for bands containing the burst noises.