摘要:
Voice verification is accomplished at a plurality of spaced apart facilities each having a plurality of terminals. Multiplexing structure interconnects the terminals through a communications link to a central processing station. Analog reproductions of voices transmitted from the terminals are converted into digital signals. The digital signals are transformed into the frequency domain at the central processing station. Predetermined features of the transformed signals are compared with stored predetermined features of each voice to be verified. A verify or non-verify signal is then transmitted to the particular terminal in response to the comparison of the predetermined features.
摘要:
Speech recognition equipment is disclosed which distinguishes the sounds ''''OH'''' and ''''ONE'''' (respectively corresponding to the numerals 0 and 1). The recognition equipment takes advantage of the fact that the characteristic frequency of the second formant of ''''OH'''' increases with the time while the characteristic frequency of the corresponding formant of ONE decreases with respect to time. The sound to be recognized is fed to a frequency discriminator after a test has been made to distinguish the sound from background noise. The discriminator output is applied to two sampling circuits, one of which samples the discriminator output signal shortly after the presence of the sound is detected and the second of which samples the signal shortly after the first sample is taken. The samples are held in capacitive storage means. After both samples have been taken, the outputs of the capacitors are applied to a subtractor. Thus, the polarity output of the signal from the subtractor indicates whether the sound was ''''OH'''' or ''''ONE.
摘要:
Speech is synthesized by repeated readout of prestored basic speech waveforms. For varying the speech tone frequency, readout is done at a fixed rate but skipping samples sequentially stored.
摘要:
An elevator system having an elevator car, and communication apparatus which includes a speech synthesizer for each car which provides audible, informative messages in its associated elevator car in response to its operation. Messages which are repeated within a predetermined period of time have one or more parameters thereof varied, to relieve the monotony which would be otherwise caused by the repetition of identical messages presented in an identical manner, and/or, to emphasize predetermined words or phrases.
摘要:
A method of communicating Digital Speech Data to a speech synthesis circuit. The data is compressed to on the order of 1000-1200 bits, per second for normal human speech. The speech synthesis circuit utilizes linear predictive coding techniques for producing high quality speech or other sounds.The data is preferably stored in a memory which is coupled to the speech synthesis circuit. The data has variable frame lengths; in the disclosed embodiment, four different frame lengths are described having frame lengths from four bits to forty-nine bits. The memory stores the variable frame length data and communicates the same to the speech synthesis circuit in response to certain control signals.
摘要:
This invention relates to a high-speed acoustic noise generating system that is under the control of a computer in order to produce spectrally shaped noise of any desired amplitude versus frequency shape. The foregoing is accomplished by randomly generating and serially storing binary noise in a first memory, by generating with a computer, filter coefficients and serially storing the filter coefficients in a second memory, and by individually adding and accumulating the filter coefficients generated by the computer when the serially corresponding noise coefficient has the same polarity as the logic that triggers the apparatus of this invention.
摘要:
A voice synthesizer of the type set forth in U.S. Pat. No. 3,836,717 wherein the control signals applied to the devices in the vocal track model take the form of variable pulse width ''''duty cycle'''' waveforms. A novel system for producing the duty cycle signals is disclosed. Variable speech rate is provided.
摘要:
A bandwidth compression system such as a digital vocoder including an analysis section employs a transducer to convert an input speech wave into an electrical signal which is then digitized by an analog to digital converter. The digitized signal is directed through a spectrum device where the magnitudes of the frequency spectrum of the input speech wave are obtained. These magnitudes are then directed to a logging circuit to obtain the logarithm of the frequency spectrum magnitudes of the input speech signal. The logged magnitudes of the frequency spectrum are then directed to a computer where the discrete Fourier transform of the logged spectrum magnitudes are obtained to form the Fourier transform of the logarithm of the frequency spectrum magnitude (FTLSM) of the input speech signal. An encoding unit selects and encodes certain ones of the FTLSM coefficients for transmission to a remote terminal for analysis. The encoded signals include pitch data and vocal tract impulse data, both of which are derived from the FTLSM signals. The analysis section of a vocoder terminal employs a decoding device which decodes the received data and separates it into pitch data and vocal tract impulse data. Connected to the decoding device is a computing device for computing the logarithm of the spectrum envelope of the vocal tract impulse response function using the discrete Fourier transform. The logged spectrum is directed through a delogging device to a fast Fourier transform (FET) computer where the Fourier sine transform of the received spectrum signals (the impulse response) are obtained. A convolution unit then convolves the pitch data with the impulse response data to yield the desired synthesized speech signal.