摘要:
A speech recognition apparatus including a memory for storing with respect to each feature specific to a particular phoneme a name of a process and a procedure of the process which is performed in order to search whether the presence of a feature specific to a certain type of speech is included in a feature vector series, and for storing a table in which the names of the processes in a performed for all the categories of speech to be recognized. The information stored in the memory is used to discriminate between two categories and provides ways for interpreting the results of the process. The recognition processes performed the discrimination is done in accordance with the information stored in the table.
摘要:
A noise reduction system used for transmission and/or recognition of speech includes a speech analyzer for analyzing a noisy speech input signal thereby converting the speech signal into feature vectors such as autocorrelation coefficients, and a neural network for receiving the feature vectors of the noisy speech signal as its input. The neural network extracts from a codebook an index of prototype vectors corresponding to a noise-free equivalent to the noisy speech input signal. Feature vectors of speech are read out from the codebook on the basis of the index delivered as an output from the neural network, thereby causing the speech input to be reproduced on the basis of the feature vectors of speech read out from the codebook.
摘要:
A speech recognition apparatus has a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit, a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches, and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.
摘要:
A speech recognition apparatus has: a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.
摘要:
In this speech recognition system, a set of templates for each phoneme includes clusters of speech patterns based on two speech features: "physical" features (formant spectra of men versus women) and "utterance" features (unvoiced vowels and nasalization), derived from a plurality of reference speakers.
摘要:
Speech sound recognition is made using a reduced number of speech parameter elements, e.g., five correlation coefficients rather than sixteen spectral coefficients. The five correlation coefficients are derived from comparison of the spectral coefficients of unknown or standard sounds against the spectral coefficients of five highly-separable vowel-like sounds. Then, unknown-sound correlation coefficients are compared with standard-sound coefficients for recognition.
摘要:
A character voice communication system including high efficiency voice coding system for encoding and transmitting speech information at a high efficiency and a voice character input/output system for converting speech information into character information or receiving character information and transmitting speech or character information are organically integrated. A speech analyzer and a speech synthesizer are shared by both the voice coding and the voice character input/output systems. Communication apparatus is also provided which allows mutual conversion between speech signals and character codes.
摘要:
Speech signal presence is decided if total signal power is above a first threshold, and if either low or high frequency components exceed thresholds as a large fraction of the total power. Total power is calculated as the zero-order auto-correlation coefficient, and fractional power of frequency components is calculated as the first-order partial auto-correlation coefficient.
摘要:
A speech recognition method makes it possible to improve the accuracy of recognition of input speech and is capable of operating on a real time basis. This is accomplished by generating from the input speech signal a difference signal which indicates whether the speech power of the input speech is increasing or decreasing for each frame. The similarity between the input speech and a standard pattern is then calculated for each frame, and this is then followed by correcting the similarity calculation on the basis of the generated difference signal and a difference signal relating to the standard pattern obtained from storage. The matching of the input speech and the standard pattern is then effected by using the corrected similarity, and the input speech is then recognized from the result of this matching. Thus, a spectrum matching distance weighted by power information of speech can be obtained in real time.
摘要:
A method for detecting a malfunction of a cooling system of an internal combustion engine, comprising: detecting a coolant temperature on a side of the engine; determining a malfunction of the cooling system with the coolant temperature and the value only after a cold start of the engine.