摘要:
A system for reconstructing a signal waveform from a correlogram is based upon the recognition that the information in each channel of the correlogram is equivalent to the magnitude of the Fourier transform of a signal. By estimating a signal on the basis of its Short-Time Fourier Transform Magnitude, each channel of information from a cochlear model can be reconstructed. Once this information is retrieved, a signal waveform can be resynthesized through inversion of the cochlear model. The process for reconstructing the cochlear model data can be optimized with the use of techniques for improving the initial estimate of the signal from the magnitude of its Fourier Transform, and by employing information that is known apriori about the signal during the estimation process, such as the characteristics of sound signals.
摘要:
A stimulus waveform is processed using a model of the human auditory system to provide a plurality of output waveforms. Each output waveform corresponds to excitation at different locations along the basilar membrane in the cochlea, and matches the narrow frequency bandwidth, short time response, and wave propagation characteristics of the human cochlea. Primary feature detection is achieved by comparing response waveforms and their spatial and time derivatives to predetermined stereotypes. Secondary feature detection is achieved by comparing spatial and temporal patterns of primary features with patterns stereotypical of human speech elements.
摘要:
A speech unit for producing preselected words or phrases based on the orientation of a toy doll or figure. A gravity sensing means produces an output corresponding to the orientation of the sensing means with respect to gravity. The output of the sensing means is coupled to a speech synthesizer which produces an output based on transitions from one orientation of the sensing means to a second orientation. A timing circuit coupled to the sensing means establishes a time period during which the sensing means must maintain its orientation for an output to be realized. The timing means also is used to shut off power to the speech synthesizer and speaker means to conserve power of the circuit. In an alternate embodiment, the absolute position of the sensing means is used to select a speech output.