Speech synthesis apparatus, control method therefor and computer-readable memory
    81.
    发明授权
    Speech synthesis apparatus, control method therefor and computer-readable memory 失效
    语音合成装置及其控制方法和计算机可读存储器

    公开(公告)号:US07139712B1

    公开(公告)日:2006-11-21

    申请号:US09263262

    申请日:1999-03-05

    申请人: Masayuki Yamada

    发明人: Masayuki Yamada

    IPC分类号: G10L13/06

    CPC分类号: G10L13/07

    摘要: A second phoneme is generated in consideration of a phonemic context with respect to a first phoneme as a search target. Phonemic piece data corresponding to the second phoneme is searched out from a database. A third phoneme is generated by changing the phonemic context on the basis of the search result, and phonemic piece data corresponding to the third phoneme is re-searched out from the database. The search or re-search result is registered in a table in correspondence with the second or third phoneme.

    摘要翻译: 考虑到作为搜索目标的第一音素的音素上下文产生第二音素。 从数据库中搜索对应于第二音素的音素片数据。 通过基于搜索结果改变音素上下文来生成第三个音素,并且从数据库中重新搜索与第三个音素对应的音素片段数据。 搜索或重新搜索结果被登记在与第二或第三音素对应的表中。

    Memory usage in a text-to-speech system
    82.
    发明申请
    Memory usage in a text-to-speech system 审中-公开
    文本到语音系统中的内存使用

    公开(公告)号:US20060229877A1

    公开(公告)日:2006-10-12

    申请号:US11100001

    申请日:2005-04-06

    IPC分类号: G10L13/06

    CPC分类号: G10L13/06

    摘要: In the concatenative text-to-speech system, high compression rate of duration data in the prosodic template is achieved by extracting statistical parameters describing behavior of actual duration values of instances of each given syllable, phoneme, half-phoneme, diphone, triphone or any other basic speech unit employed, and storing only the extracted statistical parameters, instead of the original duration values. Entries of each given basic unit in the prosodic template is sorted and indexed in the order of increasing duration value. Consequently, the amount of duration data can be significantly reduced, while keeping the error statistically under acceptable range.

    摘要翻译: 在连续的文本到语音系统中,韵律模板中的持续时间数据的高压缩率通过提取统计参数来实现,该统计参数描述了每个给定音节,音素,半音素,聋音,三耳机或任何 使用其他基本语音单元,并且仅存储所提取的统计参数,而不是原始持续时间值。 韵律模板中每个给定基本单位的条目按照持续时间增加值的顺序进行排序和索引。 因此,持续时间数据量可以显着降低,同时将误差统计学上保持在可接受的范围内。

    Time warping frames inside the vocoder by modifying the residual
    83.
    发明申请
    Time warping frames inside the vocoder by modifying the residual 有权
    通过修改剩余部分,声码器内的时间扭曲帧

    公开(公告)号:US20060206334A1

    公开(公告)日:2006-09-14

    申请号:US11123467

    申请日:2005-05-05

    IPC分类号: G10L13/06

    CPC分类号: G10L19/20 G10L21/01

    摘要: In one embodiment, the present invention comprises a vocoder having at least one input and at least one output, an encoder comprising a filter having at least one input operably connected to the input of the vocoder and at least one output, a decoder comprising a synthesizer having at least one input operably connected to the at least one output of the encoder, and at least one output operably connected to the at least one output of the vocoder, wherein the encoder comprises a memory and the encoder is adapted to execute instructions stored in the memory comprising classifying speech segments and encoding speech segments, and the decoder comprises a memory and the decoder is adapted to execute instructions stored in the memory comprising time-warping a residual speech signal to an expanded or compressed version of the residual speech signal.

    摘要翻译: 在一个实施例中,本发明包括具有至少一个输入和至少一个输出的声码器,编码器,包括具有可操作地连接到声码器的输入和至少一个输出的至少一个输入的滤波器,解码器,包括合成器 具有可操作地连接到所述编码器的所述至少一个输出的至少一个输入以及可操作地连接到所述声码器的所述至少一个输出的至少一个输出,其中所述编码器包括存储器,并且所述编码器适于执行存储在所述编码器中的指令 所述存储器包括对语音片段进行分类和编码语音片段,并且所述解码器包括存储器,并且所述解码器适于执行存储在所述存储器中的指令,其包括对剩余语音信号进行扩展或压缩版本的时间扭曲。

    Enhancing speech intelligibility using variable-rate time-scale modification
    84.
    发明授权
    Enhancing speech intelligibility using variable-rate time-scale modification 失效
    使用可变速率时间尺度修改提高语音清晰度

    公开(公告)号:US07065485B1

    公开(公告)日:2006-06-20

    申请号:US10042880

    申请日:2002-01-09

    CPC分类号: G10L21/04 G10L21/0364

    摘要: The method and preprocessor enhances the intelligibility of narrowband speech without essentially lengthening the overall time duration of the signal. Both spectral enhancements and variable-rate time-scaling procedures are implemented to improve the salience of initial consonants, particularly the perceptually important formant transitions. Emphasis is transferred from the dominating vowel to the preceding consonant through adaptation of the phoneme timing structure. In a further embodiment, the technique is applied as a preprocessor to a speech coder.

    摘要翻译: 该方法和预处理器增强了窄带语音的可懂度,而基本上不延长信号的整个持续时间。 实现光谱增强和可变速率时间缩放程序以改善初始辅音的显着性,特别是感知上重要的共振峰转变。 通过调整音素时序结构,强调从主导元音转移到前面的辅音。 在另一实施例中,该技术作为预处理器应用于语音编码器。

    Voice synthesis apparatus and method

    公开(公告)号:US20060015344A1

    公开(公告)日:2006-01-19

    申请号:US11180108

    申请日:2005-07-13

    申请人: Hideki Kemmochi

    发明人: Hideki Kemmochi

    IPC分类号: G10L13/06

    摘要: A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is designated between start and end points of a vowel phoneme included in any one of the acquired voice segments. Voice is synthesized for a region of the vowel phoneme that precedes the designated boundary vowel phoneme, or a region of the vowel phoneme that succeeds the designated boundary in the vowel phoneme. By synthesizing a voice for the region preceding the designated boundary, it is possible to synthesize a voice imitative of a vowel sound that is uttered by a person and then stopped to sound with his or her mouth kept opened. Further, by synthesizing a voice for the region succeeding the designated boundary, it is possible to synthesize a voice imitative of a vowel sound that is started to sound with the mouth opened.

    Synthesis unit selection apparatus and method, and storage medium
    86.
    发明授权
    Synthesis unit selection apparatus and method, and storage medium 失效
    合成单元选择装置和方法以及存储介质

    公开(公告)号:US06980955B2

    公开(公告)日:2005-12-27

    申请号:US09818581

    申请日:2001-03-28

    CPC分类号: G10L13/10 G10L13/04 G10L13/06

    摘要: Input text data undergoes language analysis to generate prosody, and a speech database is searched for a synthesis unit on the basis of the prosody. A modification distortion of the found synthesis unit, and concatenation distortions upon connecting that synthesis unit to those in the preceding phoneme are computed, and a distortion determination unit weights the modification and concatenation distortions to determine the total distortion. An Nbest determination unit obtains N best paths that can minimize the distortion using the A* search algorithm, and a registration unit determination unit selects a synthesis unit to be registered in a synthesis unit inventory on the basis of the N best paths in the order of frequencies of occurrence, and registers it in the synthesis unit inventory.

    摘要翻译: 输入文本数据经历语言分析以产生韵律,并且基于韵律搜索语音数据库以搜索合成单元。 计算所找到的合成单元的修改失真,以及将该合成单元连接到前一个音素中时的级联失真,并且失真确定单元对修改和级联失真加权以确定总失真。 最大确定单元使用A *搜索算法获得可以使失真最小化的N个最佳路径,并且注册单元确定单元基于N个最佳路径按照以下顺序选择要在合成单元清单中注册的合成单元: 出现频率,并将其注册在合成单位库存中。

    Speech signal processing apparatus and method, and storage medium
    87.
    发明申请
    Speech signal processing apparatus and method, and storage medium 审中-公开
    语音信号处理装置和方法以及存储介质

    公开(公告)号:US20050209855A1

    公开(公告)日:2005-09-22

    申请号:US11126372

    申请日:2005-05-11

    CPC分类号: G09B7/02 G09B19/04

    摘要: A speech segment search unit searches a speech database for speech segments that satisfy a phonetic environment, and a HMM learning unit computes the HMMs of phonemes on the basis of the search result. A segment recognition unit performs segment recognition of speech segments on the basis of the computed HMMs of the phonemes, and when the phoneme of the segment recognition result is equal to a phoneme of the source speech segment, that speech segment is registered in a segment dictionary.

    摘要翻译: 语音段搜索单元在语音数据库中搜索满足语音环境的语音段,并且HMM学习单元基于搜索结果来计算音素的HMM。 段识别单元基于所计算的音素的HMM来执行语音段的段识别,并且当段识别结果的音素等于源语音段的音素时,该语音段被登记在段字典 。

    Method for synthesizing speech
    88.
    发明申请
    Method for synthesizing speech 有权
    语音合成方法

    公开(公告)号:US20050131679A1

    公开(公告)日:2005-06-16

    申请号:US10511369

    申请日:2003-04-01

    申请人: Ercan Gigi

    发明人: Ercan Gigi

    CPC分类号: G10L13/07 G10L25/00

    摘要: The present invention relates to a method for analyzing speech, the method comprising the steps of: a) inputting a speech signal, b) obtaining the first harmonic of the speech signal, c) determining the phase-difference Df between the speech signal and the first harmonic.

    摘要翻译: 本发明涉及语音分析方法,该方法包括以下步骤:a)输入语音信号,b)获得语音信号的一次谐波,c)确定语音信号与 一次谐波。

    Rule based speech synthesis method and apparatus
    89.
    发明申请
    Rule based speech synthesis method and apparatus 失效
    基于规则的语音合成方法和装置

    公开(公告)号:US20050119889A1

    公开(公告)日:2005-06-02

    申请号:US10864130

    申请日:2004-06-09

    申请人: Nobuhide Yamazaki

    发明人: Nobuhide Yamazaki

    CPC分类号: G10L13/07

    摘要: A rule based speech synthesis apparatus by which concatenation distortion may be less than a preset value without dependency on utterance, wherein a parameter correction unit reads out a target parameter for a vowel from a target parameter storage, responsive to the phoneme at the a leading end and at a trailing end of a speech element and acoustic feature parameters output from a speech element selector, and accordingly corrects the acoustic feature parameters of the speech element. The parameter correction unit corrects the parameters, so that the parameters ahead and behind the speech element are equal to the target parameter for the vowel of the corresponding phoneme, and outputs the so corrected parameters.

    摘要翻译: 一种基于规则的语音合成装置,其中连接失真可以小于预设值而不依赖于话语,其中参数校正单元响应于前端的音素从目标参数存储器中读出元音的目标参数 并且在语音元素的尾端和从语音元素选择器输出的声学特征参数,并且因此校正语音元素的声学特征参数。 参数校正单元校正参数,使语音元素前后的参数等于相应音素元音的目标参数,并输出如此校正的参数。

    Waveform speech synthesis
    90.
    发明授权
    Waveform speech synthesis 失效
    波形语音综合

    公开(公告)号:US6067519A

    公开(公告)日:2000-05-23

    申请号:US737206

    申请日:1996-11-07

    申请人: Andrew Lowry

    发明人: Andrew Lowry

    CPC分类号: G10L13/07

    摘要: Portions of spoon waveform are joined by forming extrapolations at the end of one and the beginning of the next portion to create an overlap region with synchronous pitchmarks, and then forming a weighted sum across the overlap to provide a smooth transition.

    摘要翻译: PCT No.PCT / GB96 / 00817 Sec。 371日期:1996年11月7日 102(e)1996年11月7日PCT PCT 1996年4月3日PCT公布。 公开号WO96 / 32711 PCT 日期1996年10月17日通过在一个结尾和下一个部分的开始形成外推来结合勺子波形的部分,以创建具有同步间距标记的重叠区域,然后在重叠之间形成加权和以提供平滑过渡。