Speech synthesizer, speech synthesis method and computer program product
    1.
    发明授权
    Speech synthesizer, speech synthesis method and computer program product 有权
    语音合成器,语音合成方法和计算机程序产品

    公开(公告)号:US09058807B2

    公开(公告)日:2015-06-16

    申请号:US13051541

    申请日:2011-03-18

    IPC分类号: G10L13/00 G10L13/04 G10L25/18

    CPC分类号: G10L13/04 G10L25/18

    摘要: According to one embodiment, a first storage unit stores n band noise signals obtained by applying n band-pass filters to a noise signal. A second storage unit stores n band pulse signals. A parameter input unit inputs a fundamental frequency, n band noise intensities, and a spectrum parameter. A extraction unit extracts for each pitch mark the n band noise signals while shifting. An amplitude control unit changes amplitudes of the extracted band noise signals and band pulse signals in accordance with the band noise intensities. A generation unit generates a mixed sound source signal by adding the n band noise signals and the n band pulse signals. A generation unit generates the mixed sound source signal generated based on the pitch mark. A vocal tract filter unit generates a speech waveform by applying a vocal tract filter using the spectrum parameter to the generated mixed sound source signal.

    摘要翻译: 根据一个实施例,第一存储单元存储通过将n个带通滤波器应用于噪声信号而获得的n个带噪声信号。 第二存储单元存储n个带脉冲信号。 参数输入单元输入基频,n频带噪声强度和频谱参数。 提取单元在移位期间针对每个节距标记提取n个带噪声信号。 幅度控制单元根据带噪声强度改变提取的频带噪声信号和频带脉冲信号的幅度。 一代单元通过相加n个带噪声信号和n个带脉冲信号来产生混合声源信号。 生成单元生成基于间距标记生成的混合声源信号。 声道滤波器单元通过使用频谱参数对所生成的混合声源信号应用声道滤波器来产生语音波形。

    Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access
    2.
    发明授权
    Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access 有权
    语音合成装置和方法,其中通过一次访问从连续存储器区域获取多于一个语音单元

    公开(公告)号:US08468020B2

    公开(公告)日:2013-06-18

    申请号:US11745785

    申请日:2007-05-08

    IPC分类号: G10L13/04 G10L13/06 G10L13/00

    摘要: An apparatus for synthesizing a speech including a waveform memory that stores a plurality of speech unit waveforms, an information memory that correspondingly stores speech unit information and an address of each of the speech unit waveforms, a selector that selects a speech unit sequence corresponding to the input phoneme sequence by referring to the speech unit information, a speech unit waveform acquisition unit that acquires a speech unit waveform corresponding to each speech unit of the speech unit sequence from the waveform memory by referring to the address, a speech unit concatenation unit that generates the speech by concatenating the speech unit waveform acquired.

    摘要翻译: 一种用于合成包括存储多个语音单元波形的波形存储器的语音的装置,对应地存储语音单元信息和每个语音单元波形的地址的信息存储器,选择器,其选择对应于 通过参考语音单元信息输入音素序列;语音单元波形获取单元,通过参考地址从波形存储器获取对应于语音单元序列的每个语音单元的语音单元波形;语音单元连接单元,其生成 通过连接所获取的语音单位波形的语音。

    Voice conversion apparatus and method and speech synthesis apparatus and method
    3.
    发明授权
    Voice conversion apparatus and method and speech synthesis apparatus and method 有权
    语音转换装置及方法及语音合成装置及方法

    公开(公告)号:US08438033B2

    公开(公告)日:2013-05-07

    申请号:US12505684

    申请日:2009-07-20

    IPC分类号: G10L13/06 G10L13/00 G10L21/00

    CPC分类号: G10L13/033 G10L2021/0135

    摘要: A voice conversion apparatus stores, in a parameter memory, target speech spectral parameters of target speech, stores, in a voice conversion rule memory, a voice conversion rule for converting voice quality of source speech into voice quality of the target speech, extracts, from an input source speech, a source speech spectral parameter of the input source speech, converts extracted source speech spectral parameter into a first conversion spectral parameter by using the voice conversion rule, selects target speech spectral parameter similar to the first conversion spectral parameter from the parameter memory, generates an aperiodic component spectral parameter representing from selected target speech spectral parameter, mixes a periodic component spectral parameter included in the first conversion spectral parameter with the aperiodic component spectral parameter, to obtain a second conversion spectral parameter, and generates a speech waveform from the second conversion spectral parameter.

    摘要翻译: 语音转换装置在参数存储器中存储目标语音的目标语音频谱参数,在语音转换规则存储器中存储用于将源语音的语音质量转换为目标语音的语音质量的语音转换规则,从 输入源语音,输入源语音的源语音频谱参数通过使用语音转换规则将提取的源语音频谱参数转换为第一转换频谱参数,从参数中选择类似于第一转换谱参数的目标语音频谱参数 生成从选定的目标语音频谱参数表示的非周期分量谱参数,将包含在第一转换频谱参数中的周期分量频谱参数与非周期分量频谱参数进行混合,得到第二转换频谱参数,并从 第二个转换光谱第 仪表。

    Speech processing apparatus and program
    4.
    发明授权
    Speech processing apparatus and program 有权
    语音处理装置和程序

    公开(公告)号:US08170876B2

    公开(公告)日:2012-05-01

    申请号:US12210338

    申请日:2008-09-15

    IPC分类号: G10L13/00 G10L21/00 G10L13/08

    CPC分类号: G10L13/08

    摘要: A word dictionary including sets of a character string which constitutes a word, a phoneme sequence which constitutes pronunciation of the word and a part of speech of the word is referenced, an entered text is analyzed, the entered text is divided into one or more subtexts, a phoneme sequence and a part of speech sequence are generated for each subtext, the part of speech sequence of the subtext and a list of part of speech sequence are collated to determine whether the phonetic sound of the subtext is to be converted or not, and the phonetic sounds of the phoneme sequence in the subtext whose phonetic sounds are determined to be converted are converted.

    摘要翻译: 参考包括构成字的字符串的集合的词典,构成该单词的发音的音素序列和该单词的一部分语音,分析输入的文本,将输入的文本分为一个或多个子文本 为每个子文本生成音素序列和语音序列,对子文本的部分语音序列和部分语音序列进行整理,以确定子文本的语音是否被转换, 并且将其语音确定要转换的子文本中的音素序列的语音转换。

    METHOD AND APPARATUS FOR EDITING SPEECH, AND METHOD FOR SYNTHESIZING SPEECH
    5.
    发明申请
    METHOD AND APPARATUS FOR EDITING SPEECH, AND METHOD FOR SYNTHESIZING SPEECH 有权
    用于编辑语音的方法和装置,以及用于合成语音的方法

    公开(公告)号:US20110238420A1

    公开(公告)日:2011-09-29

    申请号:US12880796

    申请日:2010-09-13

    IPC分类号: G10L13/00

    摘要: According to one embodiment, a method for editing speech is disclosed. The method can generate speech information from a text. The speech information includes phonologic information and prosody information. The method can divide the speech information into a plurality of speech units, based on at least one of the phonologic information and the prosody information. The method can search at least two speech units from the plurality of speech units. At least one of the phonologic information and the prosody information in the at least two speech units are identical or similar. In addition, the method can store a speech unit waveform corresponding to one of the at least two speech units as a representative speech unit into a memory.

    摘要翻译: 根据一个实施例,公开了一种用于编辑语音的方法。 该方法可以从文本生成语音信息。 语音信息包括语音信息和韵律信息。 该方法可以基于语音信息和韵律信息中的至少一个将语音信息划分成多个语音单元。 该方法可以从多个语音单元中搜索至少两个语音单元。 至少两个语音单元中的语音信息和韵律信息中的至少一个是相同的或类似的。 此外,该方法可以将与至少两个语音单元中的一个对应的语音单元波形作为代表语音单元存储到存储器中。

    SPEECH SYNTHESIZING APPARATUS AND METHOD THEREOF
    6.
    发明申请
    SPEECH SYNTHESIZING APPARATUS AND METHOD THEREOF 审中-公开
    语音合成设备及其方法

    公开(公告)号:US20090326951A1

    公开(公告)日:2009-12-31

    申请号:US12423233

    申请日:2009-04-14

    IPC分类号: G10L13/06 G10L13/00 G10L11/04

    CPC分类号: G10L13/06

    摘要: Ratios of powers at the peaks of respective formants of the spectrum of a pitch-cycle waveform and powers at boundaries between the formants are obtained and, when the ratios are large, bandwidth of window functions are widened and the formant waveforms are generated by multiplying generated sinusoidal waveforms from the formant parameter sets on the basis of pitch-cycle waveform generating data by the window functions of the widened bandwidth, whereby a pitch-cycle waveform is generated by the sum of these formant waveforms.

    摘要翻译: 获得了音调周期波形的频谱的各个峰值的峰值与共振峰边界的功率之间的功率比,并且当比率大时,窗口函数的带宽被加宽,并且产生共振峰波形 基于通过加宽带宽的窗口函数的音调周期波形生成数据的共振峰参数的正弦波形,由此通过这些共振峰波形的和产生音调周期波形。

    SPEECH PROCESSING APPARATUS AND PROGRAM
    7.
    发明申请
    SPEECH PROCESSING APPARATUS AND PROGRAM 失效
    语音处理设备和程序

    公开(公告)号:US20090177474A1

    公开(公告)日:2009-07-09

    申请号:US12212759

    申请日:2008-09-18

    IPC分类号: G10L13/08 G10L13/00

    CPC分类号: G10L13/07

    摘要: A speech synthesizer includes a periodic component fusing unit and an aperiodic component fusing unit, and fuses periodic components and aperiodic components of a plurality of speech units for each segment, which are selected by a unit selector, by a periodic component fusing unit and an aperiodic component fusing unit, respectively. The speech synthesizer is further provided with an adder, so that the adder adds, edits, and concatenates the periodic components and the aperiodic components of the fused speech units to generate a speech waveform.

    摘要翻译: 语音合成器包括周期性分量定影单元和非周期性分量定影单元,并且通过周期性分量定影单元和非周期性分量定影单元,对由单元选择器选择的每个分段的多个语音单元的周期性分量和非周期分量进行融合 分量定影单元。 语音合成器还具有加法器,使得加法器对融合语音单元的周期分量和非周期分量进行相加,编辑和级联,以产生语音波形。

    SPEECH PROCESSING APPARATUS AND PROGRAM
    8.
    发明申请
    SPEECH PROCESSING APPARATUS AND PROGRAM 有权
    语音处理设备和程序

    公开(公告)号:US20090150157A1

    公开(公告)日:2009-06-11

    申请号:US12210338

    申请日:2008-09-15

    IPC分类号: G10L13/08

    CPC分类号: G10L13/08

    摘要: A word dictionary including sets of a character string which constitutes a word, a phoneme sequence which constitutes pronunciation of the word and a part of speech of the word is referenced, an entered text is analyzed, the entered text is divided into one or more subtexts, a phoneme sequence and a part of speech sequence are generated for each subtext, the part of speech sequence of the subtext and a list of part of speech sequence are collated to determine whether the phonetic sound of the subtext is to be converted or not, and the phonetic sounds of the phoneme sequence in the subtext whose phonetic sounds are determined to be converted are converted.

    摘要翻译: 参考包括构成单词的字符串组的单词字典,构成单词的发音的音素序列和单词的一部分语音,分析输入的文本,将输入的文本分为一个或多个子文本 为每个子文本生成音素序列和语音序列,对子文本的部分语音序列和部分语音序列进行整理,以确定子文本的语音是否被转换, 并且将其语音确定要转换的子文本中的音素序列的语音转换。

    PITCH PATTERN GENERATION METHOD AND APPARATUS THEREOF
    9.
    发明申请
    PITCH PATTERN GENERATION METHOD AND APPARATUS THEREOF 审中-公开
    PITCH图案生成方法及其设备

    公开(公告)号:US20090055188A1

    公开(公告)日:2009-02-26

    申请号:US12035965

    申请日:2008-02-22

    IPC分类号: G10L13/08 G10L13/00

    CPC分类号: G10L13/10 G10L13/04

    摘要: The prosody control unit pattern generation module generates pitch patterns in respective prosody control units based on language attribute information, the phoneme duration and emphasis degree information, the modification method decision module decides a modification method by smoothing processing with respect to the pitch pattern in a connection portion between the prosody control unit and at least one of previous and next prosody control units based on at least emphasis degree information to generate modification method information, and the pattern connection module modifies pitch patterns generated in respective prosody control units by smoothing processing according to the modification method information and connects them to generate a sentence pitch pattern corresponding to a text to be a target for speech synthesis.

    摘要翻译: 韵律控制单元模式生成模块基于语言属性信息,音素持续时间和强调度信息,在各个韵律控制单元中生成音调模式,修改方法决定模块通过对连接中的音调模式的平滑处理来决定修改方法 基于至少强调度信息,生成修改方法信息,并且模式连接模块通过根据所述韵律控制单元的平滑处理来修改在各个韵律控制单元中生成的音调模式 修改方法信息并连接它们以产生与作为语音合成的目标的文本相对应的句子节距模式。

    Speech translation apparatus and method
    10.
    发明申请
    Speech translation apparatus and method 审中-公开
    语音翻译设备和方法

    公开(公告)号:US20090055158A1

    公开(公告)日:2009-02-26

    申请号:US12230036

    申请日:2008-08-21

    IPC分类号: G06F17/28 G10L11/00

    摘要: A speech translation apparatus includes a speech recognition unit configured to recognize input speech of a first language to generate a first text of the first language, an extraction unit configured to compare original prosody information of the input speech with first synthesized prosody information based on the first text to extract paralinguistic information about each of first words of the first text, a machine translation unit configured to translate the first text to a second text of a second language, a mapping unit configured to allocate the paralinguistic information about each of the first words to each of second words of the second text in accordance with synonymity, a generating unit configured to generate second synthesized prosody information based on the paralinguistic information allocated to each of the second words, and a speech synthesis unit configured to synthesize output speech based on the second synthesized prosody information.

    摘要翻译: 语音翻译装置包括:语音识别单元,被配置为识别第一语言的输入语音,以生成第一语言的第一文本;提取单元,被配置为基于第一语言比较输入语音的原始韵律信息与第一合成韵律信息 用于提取关于第一文本中的每个第一单词的分词信息的文本,被配置为将第一文本转换为第二语言的第二文本的机器翻译单元,被配置为将关于每个第一单词的辅助信息分配给 根据同义词的第二文本的每个第二字;生成单元,被配置为基于分配给每个第二字的分配信息生成第二合成韵律信息;以及语音合成单元,被配置为基于第二字母合成输出语音 合成韵律信息。