TEXT TO SPEECH
    1.
    发明申请
    TEXT TO SPEECH 审中-公开
    文字转语音

    公开(公告)号:WO03065349B1

    公开(公告)日:2004-02-26

    申请号:PCT/US0302561

    申请日:2003-01-28

    Abstract: A preferred embodiment of the method for converting text to speech using a computing device having a memory is disclosed. The inventive method comprises examining a text to be spoken to an audience for a specific communications purpose, followed by marking-up the text according to a phonetic markup systems such as the Lessac System pronunciation rules notations. A set of rules to control a speech to text generator based on speech principles, such as Lessac principles. Such rules are of the tide normally implemented on prior art text-to-speech engines, and control the operation of the software and the characteristics of the speech generated by a computer using the software. A computer is used to speak the marked-up text expressively is used to speak the marked-up text expressively. The step of using a computer to speak the marked-up text expressively is repeated using alternative pronunciations of the selected style of expression where each of the tonal, structural, and consonant energies, have a different balance in the speech, are also spoken to a trained speech practitioners that listened to the spoken speech generated by the computer. The spoken speech generated by the computer is then evaluated for consistency with style criteria and/or expressiveness. And audience is then assembled and the spoken speech generated by the computer is played back to the audience. Audience comprehension of spoken speech generated by the computer is evaluated and correlated to a particular implemented rule or rules, and those rules which resulted relatively high audience comprehension are selected.

    Abstract translation: 公开了使用具有存储器的计算装置将文本转换成语音的方法的优选实施例。 本发明的方法包括检查用于特定通信目的的待观众的文本,然后根据诸如Lessac System发音规则符号的语音标记系统标记文本。 一套基于言语原则(如Lessac原则)控制语音到文本生成器的规则。 这样的规则通常在现有技术的文本到语音引擎上实现,并且控制软件的操作和使用该软件由计算机产生的语音的特性。 一个电脑用来说明标记的文字,用来表达地用来表达标记的文字。 使用计算机表达地说出标记的文本的步骤被重复使用选择的表达形式的替代发音,其中每个音调,结构和辅音能量在语音中具有不同的平衡,也被称为 训练有素的讲话从业人员聆听了计算机产生的口语演讲。 然后评估由计算机产生的口语语音与风格标准和/或表现力的一致性。 然后组合观众,并将计算机产生的口语演讲播放给观众。 对计算机产生的口语表达的听众理解进行评估,并与特定实施的规则或规则相关联,并且选择导致相对较高的受众理解的规则。

    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION
    2.
    发明申请
    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION 审中-公开
    语音训练方法与彩色指导

    公开(公告)号:WO2004063902A3

    公开(公告)日:2004-11-04

    申请号:PCT/US2004000769

    申请日:2004-01-09

    CPC classification number: G09B5/04 G09B19/04

    Abstract: In accordance with a present invention speech training system is disclosed. It uses a microphone to receive audible sounds input by a user (506) into a first computing device having a program with a database consisting of (i) digital representations of known audible sounds and associated alphanumeric representations of the known audible sounds, and (ii) digital representations of known audible sounds corresponding to mispronunciations resulting from known classes of mispronounced words and phrases (116). The method is performed by receiving the audible sounds in the form of the electrical output of the microphone. A particular audible sound to be recognized is converted into a digital representation of the audible sound. The digital representation of the particular sound is then compared to the digital representations of the known audible sounds to determine which of those known audible sounds is most likely to be the particular audible sound being compared to the sounds in the database. In response to a determination of error (138) corresponding to a known type or instance of mispronunciation, the system presents an interactive training program from the computer to the user to enable the user to correct such mispronunciation (144).

    Abstract translation: 根据本发明,公开了语音训练系统。 它使用麦克风来接收由用户输入的声音(506)到具有数据库的程序的第一计算设备中,该数据库包括(i)已知可听见的声音的数字表示和已知可听见的声音的相关联的字母数字表示,以及 )已知的听觉声音的数字表示,其对应于由已知类型的错误的词和短语产生的误导(116)。 该方法通过以麦克风的电输出的形式接收可听见的声音来执行。 要识别的特定可听见的声音被转换成可听见的声音的数字表示。 然后将特定声音的数字表示与已知可听见的声音的数字表示进行比较,以确定哪些已知的可听见的声音最可能是与数据库中的声音进行比较的特定可听见的声音。 响应于对应于已知类型或错误发生的实例的错误(138)的确定,系统从计算机向用户呈现交互式训练程序,以使用户能够纠正这种错误发音(144)。

    TEXT TO SPEECH
    3.
    发明申请
    TEXT TO SPEECH 审中-公开

    公开(公告)号:WO2003065349A3

    公开(公告)日:2003-08-07

    申请号:PCT/US2003/002561

    申请日:2003-01-28

    Abstract: A preferred embodiment of the method for converting text to speech using a computing device having a memory is disclosed. The inventive method comprises examining a text to be spoken to an audience for a specific communications purpose, followed by marking-up the text according to a phonetic markup systems such as the Lessac System pronunciation rules notations. A set of rules to control a speech to text generator based on speech principles, such as Lessac principles. Such rules are of the tide normally implemented on prior art text-to-speech engines, and control the operation of the software and the characteristics of the speech generated by a computer using the software. A computer is used to speak the marked-up text expressively is used to speak the marked-up text expressively. The step of using a computer to speak the marked-up text expressively is repeated using alternative pronunciations of the selected style of expression where each of the tonal, structural, and consonant energies, have a different balance in the speech, are also spoken to a trained speech practitioners that listened to the spoken speech generated by the computer. The spoken speech generated by the computer is then evaluated for consistency with style criteria and/or expressiveness. And audience is then assembled and the spoken speech generated by the computer is played back to the audience. Audience comprehension of spoken speech generated by the computer is evaluated and correlated to a particular implemented rule or rules, and those rules which resulted relatively high audience comprehension are selected.

    PROSODIC SPEECH TEXT CODES AND THEIR USE IN COMPUTERIZED SPEECH SYSTEMS
    4.
    发明申请
    PROSODIC SPEECH TEXT CODES AND THEIR USE IN COMPUTERIZED SPEECH SYSTEMS 审中-公开
    PROSODIC SPEECH TEXT CODES及其在计算机语音系统中的使用

    公开(公告)号:WO2005088606B1

    公开(公告)日:2005-11-03

    申请号:PCT/US2005007232

    申请日:2005-03-07

    CPC classification number: G09B5/04 G10L13/033 G10L13/10

    Abstract: Disclosed are a method of, and system for, acoustically coding text for use in the synthesis of speech from the text, the method comprising marking the text to be spoken with one or more graphic symbols to indicate to a speaker a desired prosody to impart to the spoken text to convey expressive meaning. The markups can comprise grapheme-phoneme pairs each comprising a visible prosodic-indicating grapheme employable with written text and a corresponding digital phoneme functional in the digital domain. The invention is useful in the generation of appealing, humanized machine speech for a wide range of applications, including voice mail systems, electronically enabled appliances, automobiles, computers, robotic assistants, games and the like, in spoken books and magazines, drama and other entertainment.

    Abstract translation: 公开了一种用于语音编码文本的方法和系统,用于从文本的语音合成中使用,该方法包括使用一个或多个图形符号标记要被说出的文本,以向演讲者指示要赋予的所需韵律 口头文字表达意义。 该标记可以包括每个包括可以用书面文本使用的可见韵律指示字形和在数字域中功能相应的数字音素的字形对音素对。 本发明可用于在口语书籍和杂志,戏剧和其他广泛的应用中产生有吸引力的人性化机器语音,包括语音邮件系统,电子使用的设备,汽车,计算机,机器人助理,游戏等 娱乐。

    PROSODIC SPEECH TEXT CODES AND THEIR USE IN COMPUTERIZED SPEECH SYSTEMS
    5.
    发明申请
    PROSODIC SPEECH TEXT CODES AND THEIR USE IN COMPUTERIZED SPEECH SYSTEMS 审中-公开
    长篇演讲文本代码及其在计算机语音系统中的应用

    公开(公告)号:WO2005088606A1

    公开(公告)日:2005-09-22

    申请号:PCT/US2005/007232

    申请日:2005-03-07

    CPC classification number: G09B5/04 G10L13/033 G10L13/10

    Abstract: Disclosed are a method of, and system for, acoustically coding text for use in the synthesis of speech from the text, the method comprising marking the text to be spoken with one or more graphic symbols to indicate to a speaker a desired prosody to impart to the spoken text to convey expressive meaning. The markups can comprise grapheme-phoneme pairs each comprising a visible prosodic-indicating grapheme employable with written text and a corresponding digital phoneme functional in the digital domain. The invention is useful in the generation of appealing, humanized machine speech for a wide range of applications, including voice mail systems, electronically enabled appliances, automobiles, computers, robotic assistants, games and the like, in spoken books and magazines, drama and other entertainment.

    Abstract translation: 公开了一种用于对来自文本的语音合成中使用的文本进行声学编码的方法和系统,所述方法包括用一个或多个图形符号标记要讲话的文本以指示 说话者是一种理想的韵律,赋予口语文本表达意义。 标记可以包括字形 - 音素对,每个字形对包括可用于书写文本的可见韵律指示字形以及在数字域中起作用的对应数字音素。 本发明可用于在口语书籍和杂志,戏剧和其他广泛应用中产生吸引人的机器语音,包括语音邮件系统,电子启用的电器,汽车,计算机,机器人助理,游戏等等。 娱乐。

    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION
    6.
    发明申请
    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION 审中-公开
    颜色教学的语言训练方法

    公开(公告)号:WO2004063902B1

    公开(公告)日:2004-12-23

    申请号:PCT/US2004000769

    申请日:2004-01-09

    CPC classification number: G09B5/04 G09B19/04

    Abstract: In accordance with a present invention speech training system is disclosed. It uses a microphone to receive audible sounds input by a user (506) into a first computing device having a program with a database consisting of (i) digital representations of known audible sounds and associated alphanumeric representations of the known audible sounds, and (ii) digital representations of known audible sounds corresponding to mispronunciations resulting from known classes of mispronounced words and phrases (116). The method is performed by receiving the audible sounds in the form of the electrical output of the microphone. A particular audible sound to be recognized is converted into a digital representation of the audible sound. The digital representation of the particular sound is then compared to the digital representations of the known audible sounds to determine which of those known audible sounds is most likely to be the particular audible sound being compared to the sounds in the database. In response to a determination of error (138) corresponding to a known type or instance of mispronunciation, the system presents an interactive training program from the computer to the user to enable the user to correct such mispronunciation (144).

    Abstract translation: 根据本发明,公开了语音训练系统。 它使用麦克风来接收由用户(506)输入的具有数据库的程序的第一计算设备中的可听声音,所述数据库包括(i)已知可听声音的数字表示和已知可听声音的相关字母数字表示,以及(ii )已知可听声音的数字表示,其对应于由已知类别的错误发音的单词和短语(116)产生的错误发音。 该方法通过接收麦克风的电输出形式的可听声音来执行。 要识别的特定可听声音被转换为可听声音的数字表示。 然后将特定声音的数字表示与已知可听声音的数字表示进行比较,以确定哪些已知可听声音最有可能是与数据库中的声音进行比较的特定可听声音。 响应于与错误发音的已知类型或实例相对应的错误(138)的确定,系统从计算机向用户呈现交互式训练程序以使用户能够纠正这种错误发音(144)。

    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION
    7.
    发明申请
    SPEECH TRAINING METHOD WITH COLOR INSTRUCTION 审中-公开
    语音训练方法与彩色指导

    公开(公告)号:WO2004063902A2

    公开(公告)日:2004-07-29

    申请号:PCT/US2004/000769

    申请日:2004-01-09

    IPC: G06F

    CPC classification number: G09B5/04 G09B19/04

    Abstract: In accordance with a present invention speech training system is disclosed. It uses a microphone to receive audible sounds input by a user into a first computing device having a program with a database consisting of (i) digital representations of known audible sounds and associated alphanumeric representations of the known audible sounds, and (ii) digital representations of known audible sounds corresponding to mispronunciations resulting from known classes of mispronounced words and phrases. The method is performed by receiving the audible sounds in the form of the electrical output of the microphone. A particular audible sound to be recognized is converted into a digital representation of the audible sound. The digital representation of the particular audible sound is then compared to the digital representations of the known audible sounds to determine which of those known audible sounds is most likely to be the particular audible sound being compared to the sounds in the database. In response to a determination of error corresponding to a known type or instance of mispronunciation, the system presents an interactive training program from the computer to the user to enable the user to correct such mispronunciation.

    Abstract translation: 根据本发明,公开了语音训练系统。 它使用麦克风接收由用户输入的音频声音到具有数据库的程序的第一计算设备,该数据库包括(i)已知可听见的声音的数字表示和已知可听见的声音的相关联的字母数字表示,以及(ii)数字表示 已知的可听见的声音对应于由已知类型的错误的单词和短语产生的错误的声音。 该方法通过以麦克风的电输出的形式接收可听见的声音来执行。 要识别的特定可听见的声音被转换为可听见的声音的数字表示。 然后将特定可听见的声音的数字表示与已知可听见的声音的数字表示进行比较,以确定哪些已知的可听见的声音最可能是与数据库中的声音进行比较的特定可听见的声音。 响应于对于已知类型或错误发音实例的错误的确定,系统从计算机向用户呈现交互式训练程序,以使用户能够纠正这种错误发音。

    SPEECH RECOGNITION METHOD
    8.
    发明申请
    SPEECH RECOGNITION METHOD 审中-公开
    语音识别方法

    公开(公告)号:WO2004061822A1

    公开(公告)日:2004-07-22

    申请号:PCT/US2003/041697

    申请日:2003-12-31

    Abstract: In accordance with a present invention speech recognition is disclosed (10). It uses a microphone to receive audible sounds input by a user into a first computing device (28) having a program with a database (16) comprising (i) digital responses of known audible sounds and associated alphanumeric representations of the known audible sounds and for the first time (ii) digital representations of known audible sounds corresponding to mispronunciations resulting from known class of mispronounced words and phrases. The method is performed by receiving the audible sounds in the form of the electrical output of the microphone (28). A particular audible sound to be recognized is converted into a digital representation of the audible sound (30). The digital representation of the particular audible sound is then compared to the digital representations of the known audible sounds to determine which of those known audible sounds is most likely to be the particular audible sounds in the database (30).

    Abstract translation: 根据本发明,公开了语音识别(10)。 它使用麦克风来接收由用户输入到具有数据库(16)的程序的第一计算设备(28)的可听见的声音,所述数据库包括(i)已知可听见的声音和所述已知可听见的声音的相关联的字母数字表示的数字响应,并且 第一次(ii)已知的可听见的声音的数字表示对应于由已知类型的错误的单词和短语产生的误导。 通过以麦克风(28)的电输出的形式接收可听见的声音来执行该方法。 要识别的特定可听见的声音被转换成可听见的声音的数字表示(30)。 然后将特定可听见的声音的数字表示与已知可听见的声音的数字表示进行比较,以确定哪些已知的可听见的声音最有可能是数据库中的特定可听见的声音(30)。

    TEXT TO SPEECH
    9.
    发明申请
    TEXT TO SPEECH 审中-公开
    文字转语音

    公开(公告)号:WO03065349A2

    公开(公告)日:2003-08-07

    申请号:PCT/US0302561

    申请日:2003-01-28

    Abstract: A preferred embodiment of the method for converting text to speech using a computing device having a memory is disclosed. The inventive method comprises examining a text to be spoken to an audience for a specific communications purpose, followed by marking-up the text according to a phonetic markup systems such as the Lessac System pronunciation rules notations. A set of rules to control a speech to text generator based on speech principles, such as Lessac principles. Such rules are of the tide normally implemented on prior art text-to-speech engines, and control the operation of the software and the characteristics of the speech generated by a computer using the software. A computer is used to speak the marked-up text expressively. The step of using a computer to speak the marked-up text expressively is repeated using alternative pronunciations of the selected style of expression where each of the tonal, structural, and consonant energies, have a different balance in the speech, are also spoken to a trained speech practitioners that listened to the spoken speech generated by the computer. The spoken speech generated by the computer is then evaluated for consistency with style criteria and/or expressiveness. And audience is then assembled and the spoken speech generated by the computer is played back to the audience. Audience comprehension of spoken speech generated by the computer is evaluated and correlated to a particular implemented rule or rules, and those rules which resulted relatively high audience comprehension are selected.

    Abstract translation: 公开了使用具有存储器的计算装置将文本转换成语音的方法的优选实施例。 本发明的方法包括检查用于特定通信目的的待观众的文本,然后根据诸如Lessac System发音规则符号的语音标记系统标记文本。 一套基于言语原则(如Lessac原则)控制语音到文本生成器的规则。 这样的规则通常在现有技术的文本到语音引擎上实现,并且控制软件的操作和使用该软件由计算机产生的语音的特性。 一台电脑用来表达出标记的文字。 使用计算机表达地说出标记的文本的步骤被重复使用选择的表达形式的替代发音,其中每个音调,结构和辅音能量在语音中具有不同的平衡,也被称为 训练有素的讲话从业人员聆听了计算机产生的口语演讲。 然后评估由计算机产生的口语语音与风格标准和/或表现力的一致性。 然后组合观众,并将计算机产生的口语演讲播放给观众。 对计算机产生的口语表达的听众理解进行评估,并与特定实施的规则或规则相关联,并且选择导致相对较高的受众理解的规则。

Patent Agency Ranking