System and method for talking avatar

    公开(公告)号:US11600290B2

    公开(公告)日:2023-03-07

    申请号:US17015902

    申请日:2020-09-09

    摘要: Aspects of this disclosure provide techniques for generating a viseme and corresponding intensity pair. In some embodiments, the method includes generating, by a server, a viseme and corresponding intensity pair based at least on one of a clean vocal track or corresponding transcription. The method may include generating, by the server, a compressed audio file based at least on one of the viseme, the corresponding intensity, music, or visual offset. The method may further include generating, by the server or a client end application, a buffer of raw pulse-code modulated (PCM) data based on decoding at least a part of the compressed audio file, where the viseme is scheduled to align with a corresponding phoneme.

    Generating acoustic models of alternative pronunciations for utterances spoken by a language learner in a non-native language

    公开(公告)号:US10068569B2

    公开(公告)日:2018-09-04

    申请号:US13932506

    申请日:2013-07-01

    摘要: A non-transitory processor-readable medium storing code representing instructions to be executed by a processor includes code to cause the processor to receive acoustic data representing an utterance spoken by a language learner in a non-native language in response to prompting the language learner to recite a word in the non-native language and receive a pronunciation lexicon of the word in the non-native language. The pronunciation lexicon includes at least one alternative pronunciation of the word based on a pronunciation lexicon of a native language of the language learner. The code causes the processor to generate an acoustic model of the at least one alternative pronunciation in the non-native language and identify a mispronunciation of the word in the utterance based on a comparison of the acoustic data with the acoustic model. The code causes the processor to send feedback related to the mispronunciation of the word to the language learner.

    Generating acoustic models of alternative pronunciations for utterances spoken by a language learner in a non-native language

    公开(公告)号:US10679616B2

    公开(公告)日:2020-06-09

    申请号:US16023303

    申请日:2018-06-29

    摘要: A non-transitory processor-readable medium storing code representing instructions to be executed by a processor includes code to cause the processor to receive acoustic data representing an utterance spoken by a language learner in a non-native language in response to prompting the language learner to recite a word in the non-native language and receive a pronunciation lexicon of the word in the non-native language. The pronunciation lexicon includes at least one alternative pronunciation of the word based on a pronunciation lexicon of a native language of the language learner. The code causes the processor to generate an acoustic model of the at least one alternative pronunciation in the non-native language and identify a mispronunciation of the word in the utterance based on a comparison of the acoustic data with the acoustic model. The code causes the processor to send feedback related to the mispronunciation of the word to the language learner.

    Performing a computerized language teaching lesson using a main computer and a mobile device
    6.
    发明授权
    Performing a computerized language teaching lesson using a main computer and a mobile device 有权
    使用主计算机和移动设备执行计算机化语言教学课程

    公开(公告)号:US09135086B2

    公开(公告)日:2015-09-15

    申请号:US12887613

    申请日:2010-09-22

    IPC分类号: G06F9/54

    摘要: A main computer runs a primary program performing an ongoing task, the primary program being optimized for performance on a desktop computer. A computerized device remote from the main computer runs an adjunct program which is a modified version of the primary program and is optimized for performance in a hands free mode. Communication means provides communication between the main computer and computerized device, and the main computer and computerized device interact through the communication means so that each influences the operation of the other.

    摘要翻译: 主计算机运行执行正在进行的任务的主程序,主程序针对桌面计算机上的性能进行了优化。 远离主计算机的计算机化设备运行辅助程序,该辅助程序是主程序的修改版本,并针对免提模式进行了性能优化。 通信装置提供主计算机和计算机化设备之间的通信,主计算机和计算机化设备通过通信装置相互作用,从而影响另一个的操作。

    METHOD AND SYSTEM FOR CREATING CONTROLLED VARIATIONS IN DIALOGUES
    7.
    发明申请
    METHOD AND SYSTEM FOR CREATING CONTROLLED VARIATIONS IN DIALOGUES 审中-公开
    创造对话中控制变异的方法和系统

    公开(公告)号:US20140170610A1

    公开(公告)日:2014-06-19

    申请号:US14101079

    申请日:2013-12-09

    IPC分类号: G09B19/06

    CPC分类号: G09B19/06 G06F17/279 G09B7/02

    摘要: A method and system for teaching a user a target language includes developing and constructing variable potential paths of nodes representing an exchange between two participants in a dialogue, prompting and selecting a path of nodes through a conversation graph of the target language, the path of nodes defining a dialogue; and determining whether the user is ready to perform the dialogue that has been constructed and defined by the path of nodes, the determination being based on a user model which represents the user's current ability in and current knowledge of, the target language. If the user is ready to perform the dialogue, the path of nodes is executed to allow the user to perform the dialogue defined thereby; and if the user is not ready to perform the dialogue, training the user on one or more nodes of the path of nodes.

    摘要翻译: 用于向用户教授目标语言的方法和系统包括开发和构建表示对话中的两个参与者之间的交换的节点的可变潜在路径,通过目标语言的会话图来提示和选择节点的路径,节点的路径 定义对话 并且确定用户是否准备好执行由节点的路径构建和定义的对话,该确定基于表示用户目前语言的当前能力和当前知识的用户模型。 如果用户准备好进行对话,则执行节点的路径以允许用户执行由此定义的对话; 并且如果用户未准备好进行对话,则在节点路径的一个或多个节点上训练用户。

    System and Method for Teaching Non-Lexical Speech Effects
    9.
    发明申请
    System and Method for Teaching Non-Lexical Speech Effects 有权
    非词汇语音效果教学系统与方法

    公开(公告)号:US20120065977A1

    公开(公告)日:2012-03-15

    申请号:US12878402

    申请日:2010-09-09

    IPC分类号: G10L13/08

    摘要: Herein, a method is disclosed, which may include delexicalizing a first speech segment to provide a first prosodic speech signal; storing data indicative of the first prosodic speech signal in a computer memory; audibly playing the first speech segment to a language student; prompting the student to recite the speech segment; and recording audible speech uttered by the student in response to the prompt.

    摘要翻译: 这里,公开了一种方法,其可以包括对第一语音段进行去自动化以提供第一韵律语音信号; 将指示所述第一韵律语音信号的数据存储在计算机存储器中; 向语言学生播放第一个语音段; 促使学生背诵言语片段; 并记录学生响应提示语音发出的声音。

    Method and apparatus for improving language communication
    10.
    发明授权
    Method and apparatus for improving language communication 有权
    改善语言沟通的方法和装置

    公开(公告)号:US08840400B2

    公开(公告)日:2014-09-23

    申请号:US12488778

    申请日:2009-06-22

    IPC分类号: G09B19/08 G09B29/06

    CPC分类号: G09B29/06

    摘要: In a communication between individuals having different levels of skill in a language, communication by the more skilled individual is controlled so as to keep it at a level understandable by the lesser skilled individual. For example, a native speaker's communication with a student learning his language (the target language) is monitored by an interface and compared with a stored model representing the student's knowledge and ability in the language. Should the native speaker communicate in a way that would not be understood by the student, for example, by using vocabulary or a sentence structure beyond the student's ability, the interface will notify the native speaker. The interface might then suggest an alternate word or sentence structure to the native speaker, inviting him to use the alternate communication. The native speaker can then substitute and send the alternate communication.

    摘要翻译: 在具有不同语言技能水平的个体之间的通信中,由技术熟练的个人进行的通信被控制以便将其保持在较低技术人员能够理解的水平。 例如,母语者与学习其语言(目标语言)的学生的沟通由界面监控,并与代表学生的语言知识和能力的存储模型进行比较。 如果母语者以不能被学生理解的方式进行交流,例如,通过使用词汇或句子结构超出学生的能力,接口将通知母语者。 界面可能会向母语者提出一个替代单词或句子结构,邀请他使用备用通信。 母语者可以替代并发送备用通信。