Abstract:
A radio communication system includes a voice recognition system (221) for converting (400) a caller's voice message to a textual speech message. The textual speech message is then transmitted to an intended selective call radio (122). To perform these functions, the radio communication system includes a caller interface circuit (218), a transmitter (116), and a processor (222). To perform voice-to-text conversion, the processor is adapted to cause the caller interface circuit to sample a voice signal generated by the caller during a plurality of frame intervals, and to apply a Fourier transform thereto, thereby generating spectral data. The spectral data is subdivided into a plurality of bands. The spectral envelope of the spectral data is then filtered out to generate filtered spectral data. A Fourier transform is applied thereto to generate an autocorrelation function for each band. From the autocorrelation function of each band, a magnitude is determined, which is representative of the degree of voiceness of each band. The degree of voiceness for each band is then applied to a corresponding plurality of phoneme models, which are used to derive a textual equivalent of speech from the voice signal. The textual equivalent of speech is then transmitted to the selective call radio by way of the transmitter.
Abstract:
A method of accessing a dial-up service involves the following steps: (a) dialing a service number (172); (b) speaking a number of digits to form a first utterance (174); (c) recognizing the digits using speaker independent speaker recognition (176); (d) when a user has used the dial-up service previously, verifying the user based on the first utterance using a speaker verification system (178); (e) when the user cannot be verified, requesting the user enter a personal identification number (182); and (f) when the personal identification number is valid (184), providing access the dial-up service (186).
Abstract:
Systems and methods of the present invention provide for at least one processor executing program code instructions on a server computer coupled to a network. The program code instructions cause the server computer to receive from a user client an assessment audio file. The instructions also cause the computer to extract a plurality of audio features from the assessment audio file using a voice profile module. In addition, the instructions cause the computer to store the assessment audio file and extracted features in a database. Further, the instructions cause the computer to calculate a candidate confidence score indicating the probability that the assessment audio file is from a common speaker as a previously stored audio file within the database. Lastly, the instructions cause the computer to generate a based on the candidate confidence score.
Abstract:
A processing system that is accessible via a number of speech transmission media. Access to the processing system may be made via a mobile radiotelephone, land line telephone (20), acoustic link (22), or datalink (10). Access to programs, files and data is based upon the communication media and authentification of the user.