摘要:
The present invention relates to a method and arrangement for determining stresses in a spoken sequence. From a sequence recognized in the spoken speech, a model of the speech is created. By comparing the spoken sequence with the modelled speech, a difference between them is obtained. The difference is utilized partly for correcting the modelled speech and partly for determining stresses in the spoken sequence. After having determined the stresses in the speech, it is possible to determine the meaning of individual words and sentences in an unambiguous manner. This is then utilized in different contexts, for example when translating a first language to a second language whilst retaining meaning and intonation. The invention can also be used in verbal man-to-machine communication.
摘要:
The present invention relates to a process for evaluating speech quality in speech synthesizers with the aid of a speech recognition system. The recognition system is programmed using a number of persons. It then receives synthetic or natural speech from speech synthesizers or persons respectively, displaying differing speech quality. The speech recognition system determines a level of recognition for each received speech quality. In order to evaluate the speech quality in a speech synthesizer, speech is received from the synthesizer by the speech recognition system, is allocated a level of recognition and is ranked in comparison to the levels of recognition for previously received speech.
摘要:
The present invention relates to a method and device at speech-to-text conversion. From a given speech the fundamental tone is extracted. A model of the speech is further created from the speech. In the model a duration reproduction in words and sentences is obtained. The duration reproduction is compared with a segment duration in the speech. From the comparison is obtained information which decides which type of accent that exists, at which a text with sentence accent information is produced.
摘要:
The present invention refers to a method and device for deciding quality of speech. The speech to be evaluated is listened in to by a person who reproduces the speech. Stops of vowel sounds in he produced and reproduced speech respectively are appointed. The difference between the stops of the vowel sounds is registered. Out of the obtained differences an average value is created. The achieved average value indicates the quality of the produced speech. The invention can be used for evaluation of different speech producing sources such as equipments and/or machines and people's ability to comprehend the speech.
摘要:
The present invention relates to a device and method at speech synthesis. A speech is registered and polyphones are stored. In connection with registration of the polyphones also the movement pattern in a face is registered. The registration of the movement pattern in the face is made by that a number of measuring points in the face are registered at the same time as the polyphones are registered. In connection with translation of a person's speech from one language into another, the polyphones and corresponding movement patterns in the face are linked up to a movement model in the face. The face of the real person is after that pasted over the model, at which one to the language corresponding movement pattern is obtained. The invention consequently gives the impression that the person really speaks the language in question.
摘要:
The present invention relates to a method and device at speech-to-text conversion. From a given speech the fundamental tone is extracted. A model of the speech is further created from the speech. In the model a duration reproduction in words and sentences is obtained. The duration reproduction is compared with a segment duration in the speech. From the comparison is obtained information which decides which type of accent that exists, at which a text with sentence accent information is produced.