摘要:
A telephone speech recognition system is provided which has a high level of speech recognition without affecting from various conditions of a telephone line. The system comprises speech analyzers (4) and (5), and reference speech model storages (7) to (9) corresponding to line connection data. A telephone line interface (1) having a line connection data acquisition function analyzes a call received from the telephone line for identifying the country, route, and other information of the call and transmits those line connection data to a line connection data processor (2). The line connection data processor (2) selects one of the acoustic analyzers (4) and (5) in response to the line connection data from the interface (1) and also one of the speech model storages (7) to (9). A speech pattern matcher (11) compares an acoustic vector train output of the selected acoustic analyzer with the speech models given from the selected reference speech model storage for speech recognition.
摘要:
A speech endpoint detection unit determines that a first condition is met when a matching score of a partial sentence accepted by a grammatical rule unit is the highest of all partial sentences, and that a second condition is met when a duration time of input speech determined to coincide with a silent standard pattern is longer than a predetermined time. A speech endpoint is determined when the first and second conditions are both met and a speech endpoint detection signal is sent to a word prediction unit and a recognition result output unit. By requiring that both first and second conditions be met, a speech endpoint can correctly be detected even when a long silent period is present in the course of a sentence.
摘要:
This invention relates to a method of recognizing input speech of many unspecific people. Feature parameters representing both a short-time average spectrum envelope characteristic of the input speech, and regression coefficients obtained from the outputs from a wide-band filter bank. The regression coefficients represent the rough directionality of the characteristic of change in the spectrum of the speech signal. Distance is measured between the feature parameters and standard patterns stored in a storage means. The distance between the feature parameters and the stored pattern which is smallest of all the patterns is found to recognize said input speech.
摘要:
This invention is characterized by providing an apparatus for automatically discriminating service users which can prevent troubles from occurring between the service provider and users because of difference in language. When the service provider has a call from a user, the service provider makes an announcement to the user to request the voicing of a specific keyword. If an answer by the keyword is made from the user, it is determined whether or not it is an answer by the correct keyword. If correct, the call is connected to the service provider. If not correct, how to use the service is announced in another language. Then, the line is disconnected. As a result, a user who is weak in the language used by the service provider can be prevented from being connected to the service provider.