Concatenation of speech segments by use of a speech synthesizer
    1.
    发明授权
    Concatenation of speech segments by use of a speech synthesizer 有权
    使用语音合成器连接语音段

    公开(公告)号:US06366883B1

    公开(公告)日:2002-04-02

    申请号:US09250405

    申请日:1999-02-16

    IPC分类号: G10L1308

    摘要: In a speech synthesizer apparatus, a weighting coefficient training controller calculates acoustic distances in second acoustic feature parameters between one target phoneme from the same phoneme and the phoneme candidates other than the target phoneme based on first acoustic feature parameters and prosodic feature parameters, and determines weighting coefficient vectors for respective target phonemes defining degrees of contribution to the second acoustic feature parameters for respective phoneme candidates by executing a predetermined statistical analysis therefor. Then, a speech unit selector searches for a combination of phoneme candidates which correspond to a phoneme sequence of an input sentence and which minimizes a cost including a target cost representing approximate costs between a target phoneme and the phoneme candidates and a concatenation cost representing approximate costs between two phoneme candidates to be adjacently concatenated, and outputs index information on the searched out combination of phoneme candidates. Further, a speech synthesizer synthesizes a speech signal corresponding to the input phoneme sequence by sequentially reading out speech segments of speech waveform signals corresponding to the index information and concatenating the read speech segments of the speech waveform signals.

    摘要翻译: 在语音合成器装置中,加权系数训练控制器基于第一声学特征参数和韵律特征参数来计算来自相同音素的一个目标音素和除了目标音素之外的音素候选者之间的第二声学特征参数中的声学距离,并且确定加权 通过对其进行预定的统计分析来确定各个音素候选的第二声学特征参数的贡献度的各个目标音素的系数矢量。 然后,语音单元选择器搜索对应于输入句子的音素序列的音素候选的组合,并且使包括目标音素和音素候选者之间的近似成本的目标成本的成本最小化,以及代表近似成本的级联成本 在两个音素候选者之间相邻连接,并输出关于所搜索出的音素候选组合的索引信息。 此外,语音合成器通过顺序地读出对应于索引信息的语音波形信号的语音段并连接语音波形信号的读出的语音段,来合成对应于输入音素序列的语音信号。

    Action Agenda Determining Apparatus
    3.
    发明申请
    Action Agenda Determining Apparatus 失效
    行动议程确定装置

    公开(公告)号:US20100138380A1

    公开(公告)日:2010-06-03

    申请号:US11990191

    申请日:2005-10-18

    申请人: Nick Campbell

    发明人: Nick Campbell

    IPC分类号: G06N5/02 G06F13/42 H04N7/18

    CPC分类号: G06N99/005 G06K9/00664

    摘要: In one embodiment of the present invention, an action agenda determining apparatus for determining an agenda of action to be taken with reference to surrounding situation is provided. An action agenda determining apparatus includes a matching model storage unit for storing an action agenda determining model that has learned in advance relation between time-sequence of prescribed feature information related to human motion extracted from surrounding images and action agenda to be taken, and a model reference unit for forming the time-sequence of prescribed feature information from the surrounding motion images and referring to the action agenda determining model stored in the matching model storage unit, for determining the action agenda to be taken. Sound may be included as part of the feature information.

    摘要翻译: 在本发明的一个实施例中,提供了一种动作议程确定装置,用于确定参照周围情况采取的动作的议程。 动作议程确定装置包括:匹配模型存储单元,用于存储从周围图像提取的与人体动画相关的规定特征信息的时间序列和要采取的动作议程之间预先学习的动作议程确定模型;以及模型 参考单元,用于从周围运动图像形成规定特征信息的时间序列,并参考存储在匹配模型存储单元中的动作议程确定模型,以确定要采取的动作议程。 声音可能包含在功能信息的一部分中。

    Action agenda determining apparatus
    4.
    发明授权
    Action agenda determining apparatus 失效
    行动议程确定装置

    公开(公告)号:US07984010B2

    公开(公告)日:2011-07-19

    申请号:US11990191

    申请日:2005-10-18

    申请人: Nick Campbell

    发明人: Nick Campbell

    IPC分类号: G10L11/00 G10L21/00 G06K9/00

    CPC分类号: G06N99/005 G06K9/00664

    摘要: In one embodiment of the present invention, an action agenda determining apparatus for determining an agenda of action to be taken with reference to surrounding situation is provided. An action agenda determining apparatus includes a matching model storage unit for storing an action agenda determining model that has learned in advance relation between time-sequence of prescribed feature information related to human motion extracted from surrounding images and action agenda to be taken, and a model reference unit for forming the time-sequence of prescribed feature information from the surrounding motion images and referring to the action agenda determining model stored in the matching model storage unit, for determining the action agenda to be taken. Sound may be included as part of the feature information.

    摘要翻译: 在本发明的一个实施例中,提供了一种动作议程确定装置,用于确定参照周围情况采取的动作的议程。 动作议程确定装置包括:匹配模型存储单元,用于存储从周围图像提取的与人体动画相关的规定特征信息的时间序列和要采取的动作议程之间预先学习的动作议程确定模型;以及模型 参考单元,用于从周围运动图像形成规定特征信息的时间序列,并参考存储在匹配模型存储单元中的动作议程确定模型,以确定要采取的动作议程。 声音可能包含在功能信息的一部分中。

    Syllabic kernel extraction apparatus and program product thereof
    6.
    发明申请
    Syllabic kernel extraction apparatus and program product thereof 失效
    音节提取仪器及其程序产品

    公开(公告)号:US20050246168A1

    公开(公告)日:2005-11-03

    申请号:US10514413

    申请日:2003-02-21

    CPC分类号: G10L25/00 G10L21/06

    摘要: An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit (92) calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit (94) estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit (96) extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.

    摘要翻译: 一种能够自动确定可靠地表示语音波形特征的部分的装置,包括:声/韵律分析部(92),从时间轴上计算语音波形的规定频率范围的能量的分布 并且基于语音波形的分布和音调,在语音波形的各个音节中提取稳定生成的范围; 倒谱分析单元(94)基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元(96)作为语音波形的高可靠性的一部分提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。

    Apparatus and method for extracting syllabic nuclei
    7.
    发明授权
    Apparatus and method for extracting syllabic nuclei 失效
    提取音节核的装置和方法

    公开(公告)号:US07627468B2

    公开(公告)日:2009-12-01

    申请号:US10514413

    申请日:2003-02-21

    CPC分类号: G10L25/00 G10L21/06

    摘要: An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.

    摘要翻译: 能够自动确定可靠地表示语音波形特征的部分的装置包括:声/韵律分析单元,从时间轴上计算语音波形的规定频率范围的能量的分布, 在语音波形的各个音节中,基于语音波形的分布和音调提取稳定地生成的范围; 倒谱分析单元基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元,作为语音波形的高可靠性的一部分,提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。

    Apparatus and method for speech processing using paralinguistic information in vector form
    8.
    发明申请
    Apparatus and method for speech processing using paralinguistic information in vector form 审中-公开
    使用向量形式的辅助信息进行语音处理的装置和方法

    公开(公告)号:US20060080098A1

    公开(公告)日:2006-04-13

    申请号:US11238044

    申请日:2005-09-29

    申请人: Nick Campbell

    发明人: Nick Campbell

    IPC分类号: G10L15/06

    摘要: A speech processing apparatus includes a statistics collecting module operable to collect, for each of a prescribed utterance units of a speech in a training speech corpus, a prescribed type of acoustic feature and statistic information on a plurality of paralinguistic information labels being selected by a plurality of listeners to a speech corresponding to the utterance unit; and a training apparatus trained by supervised machine training using said prescribed acoustic feature as input data and using the statistic information as answer data, to output probability of allocation of the label to a given acoustic feature, for each of said plurality of paralinguistic information labels, forming a paralinguistic information vector.

    摘要翻译: 一种语音处理装置,包括:统计收集模块,用于针对训练语音语料库中的语音的规定说话单元中的每一个,针对由多个选择的多个旁路信息标签收集规定类型的声学特征和统计信息 听众对讲话单位的演讲; 以及培训装置,通过使用所述规定的声学特征作为输入数据并使用统计信息作为答复数据,通过监督机器训练训练,将所述标签的分配概率输出到给定的声学特征,对于所述多个旁路信息标签中的每一个, 形成一个协同信息向量。