摘要:
Aspects of this disclosure provide techniques for generating a viseme and corresponding intensity pair. In some embodiments, the method includes generating, by a server, a viseme and corresponding intensity pair based at least on one of a clean vocal track or corresponding transcription. The method may include generating, by the server, a compressed audio file based at least on one of the viseme, the corresponding intensity, music, or visual offset. The method may further include generating, by the server or a client end application, a buffer of raw pulse-code modulated (PCM) data based on decoding at least a part of the compressed audio file, where the viseme is scheduled to align with a corresponding phoneme.
摘要:
A main computer runs a primary program performing an ongoing task, the primary program being optimized for performance on a desktop computer. A computerized device remote from the main computer runs an adjunct program which is a modified version of the primary program and is optimized for performance in a hand free mode. Communication means provides communication between the main computer and computerized device, and the main computer and computerized device interact through the communication means so that each influences the operation of the other.
摘要:
A non-transitory processor-readable medium storing code representing instructions to be executed by a processor includes code to cause the processor to receive acoustic data representing an utterance spoken by a language learner in a non-native language in response to prompting the language learner to recite a word in the non-native language and receive a pronunciation lexicon of the word in the non-native language. The pronunciation lexicon includes at least one alternative pronunciation of the word based on a pronunciation lexicon of a native language of the language learner. The code causes the processor to generate an acoustic model of the at least one alternative pronunciation in the non-native language and identify a mispronunciation of the word in the utterance based on a comparison of the acoustic data with the acoustic model. The code causes the processor to send feedback related to the mispronunciation of the word to the language learner.
摘要:
A non-transitory processor-readable medium storing code representing instructions to be executed by a processor includes code to cause the processor to receive acoustic data representing an utterance spoken by a language learner in a non-native language in response to prompting the language learner to recite a word in the non-native language and receive a pronunciation lexicon of the word in the non-native language. The pronunciation lexicon includes at least one alternative pronunciation of the word based on a pronunciation lexicon of a native language of the language learner. The code causes the processor to generate an acoustic model of the at least one alternative pronunciation in the non-native language and identify a mispronunciation of the word in the utterance based on a comparison of the acoustic data with the acoustic model. The code causes the processor to send feedback related to the mispronunciation of the word to the language learner.
摘要:
A main computer runs a primary program performing an ongoing task, the primary program being optimized for performance on a desktop computer. A computerized device remote from the main computer runs an adjunct program which is a modified version of the primary program and is optimized for performance in a hands free mode. Communication mean provides communication between the main computer and computerized device, and the main computer and computerized device interact through the communication means so that each influences the operation of the other.
摘要:
A main computer runs a primary program performing an ongoing task, the primary program being optimized for performance on a desktop computer. A computerized device remote from the main computer runs an adjunct program which is a modified version of the primary program and is optimized for performance in a hands free mode. Communication means provides communication between the main computer and computerized device, and the main computer and computerized device interact through the communication means so that each influences the operation of the other.
摘要:
A method and system for teaching a user a target language includes developing and constructing variable potential paths of nodes representing an exchange between two participants in a dialogue, prompting and selecting a path of nodes through a conversation graph of the target language, the path of nodes defining a dialogue; and determining whether the user is ready to perform the dialogue that has been constructed and defined by the path of nodes, the determination being based on a user model which represents the user's current ability in and current knowledge of, the target language. If the user is ready to perform the dialogue, the path of nodes is executed to allow the user to perform the dialogue defined thereby; and if the user is not ready to perform the dialogue, training the user on one or more nodes of the path of nodes.
摘要:
The invention relates to a system and methods for detecting when a user is representing he is not fluent in a language in which he is fluent. The present system and methods are adapted to be used in conjunction with conventional and novel computer systems and methods and provides detection of concealment of language skills by a user.
摘要:
Herein, a method is disclosed, which may include delexicalizing a first speech segment to provide a first prosodic speech signal; storing data indicative of the first prosodic speech signal in a computer memory; audibly playing the first speech segment to a language student; prompting the student to recite the speech segment; and recording audible speech uttered by the student in response to the prompt.
摘要:
In a communication between individuals having different levels of skill in a language, communication by the more skilled individual is controlled so as to keep it at a level understandable by the lesser skilled individual. For example, a native speaker's communication with a student learning his language (the target language) is monitored by an interface and compared with a stored model representing the student's knowledge and ability in the language. Should the native speaker communicate in a way that would not be understood by the student, for example, by using vocabulary or a sentence structure beyond the student's ability, the interface will notify the native speaker. The interface might then suggest an alternate word or sentence structure to the native speaker, inviting him to use the alternate communication. The native speaker can then substitute and send the alternate communication.