Method and apparatus for producing natural sounding pitch contours in a speech synthesizer
    11.
    发明授权
    Method and apparatus for producing natural sounding pitch contours in a speech synthesizer 有权
    用于在语音合成器中产生自然声音俯仰轮廓的方法和装置

    公开(公告)号:US07280969B2

    公开(公告)日:2007-10-09

    申请号:US09732122

    申请日:2000-12-07

    IPC分类号: G10L13/06

    CPC分类号: G10L13/033 G10L13/0335

    摘要: A speech synthesis system is disclosed that utilizes a pitch contour resulting in a more natural-sounding speech. The present invention modifies the predicted pitch, b(t), for synthesized speech using a low frequency energy booster. The low frequency energy booster interpolates the discrete pitch values, if necessary, and increase the amount of energy of the pitch contour associated with low frequency values, such as all frequency values below 10 Hertz. The amount of energy of the pitch contour associated with low frequency values can be increased, for example, by adding band-limited noise (a carrier signal) to the pitch contour, b(t), or by filtering the pitch values with an impulse response filter having a pole at the desired low frequency value. The present invention serves to add vibrato to the to the original pitch contour, b(t), and thereby improves the naturalness of the synthetic waveform.

    摘要翻译: 公开了一种语音合成系统,其利用音调轮廓导致更自然的语音。 本发明使用低频能量增强器来修改用于合成语音的预测音调b(t)。 如果需要,低频能量增强器内插离散音调值,并增加与低频值相关联的音高轮廓的能量的量,例如低于10赫兹的所有频率值。 与低频值相关联的音高轮廓的能量的量可以增加,例如通过将频带限制噪声(载波信号)添加到音调轮廓b(t),或者通过用脉冲对频率值进行滤波 响应滤波器具有所需低频值的极点。 本发明用于将颤音添加到原始音调轮廓b(t),从而提高合成波形的自然度。

    Speech and signal digitization by using recognition metrics to select from multiple techniques
    12.
    发明授权
    Speech and signal digitization by using recognition metrics to select from multiple techniques 有权
    通过使用识别度量来选择多种技术的语音和信号数字化

    公开(公告)号:US07016835B2

    公开(公告)日:2006-03-21

    申请号:US10323549

    申请日:2002-12-19

    IPC分类号: G10L15/00

    CPC分类号: G10L15/32 G10L17/26

    摘要: A characteristic-specific digitization method and apparatus are disclosed that reduces the error rate in converting input information into a computer-readable format. The input information is analyzed and subsets of the input information are classified according to whether the input information exhibits a specific physical parameter affecting recognition accuracy. If the input information exhibits the specific physical parameter affecting recognition accuracy, the characteristic-specific digitization system recognizes the input information using a characteristic-specific recognizer that demonstrates improved performance for the given physical parameter. If the input information does not exhibit the specific physical parameter affecting recognition accuracy, the characteristic-specific digitization system recognizes the input information using a general recognizer that performs well for typical input information. In one implementation, input speech having very low recognition accuracy as a result of a physical speech characteristic is automatically identified and recognized using a characteristic-specific speech recognizer.

    摘要翻译: 公开了特征数字化方法和装置,其减少将输入信息转换为计算机可读格式的错误率。 分析输入信息,并根据输入信息是否表现出影响识别精度的特定物理参数对输入信息的子集进行分类。 如果输入信息表现出影响识别精度的特定物理参数,则特征特定数字化系统使用特征识别器识别输入信息,该识别器演示了给定物理参数的改进性能。 如果输入信息不具有影响识别精度的特定物理参数,则特征数字化系统使用对典型输入信息执行良好的一般识别器来识别输入信息。 在一个实现中,作为物理语音特征的结果具有非常低的识别精度的输入语音被使用特征语音识别器自动识别和识别。

    Methods and apparatus for conveying synthetic speech style from a text-to-speech system
    13.
    发明授权
    Methods and apparatus for conveying synthetic speech style from a text-to-speech system 有权
    从文字到语音系统传达合成语音风格的方法和设备

    公开(公告)号:US07747440B2

    公开(公告)日:2010-06-29

    申请号:US12165937

    申请日:2008-07-01

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Methods and apparatus for adapting output speech in accordance with context of communication
    14.
    发明授权
    Methods and apparatus for adapting output speech in accordance with context of communication 有权
    根据通信背景调整输出语音的方法和装置

    公开(公告)号:US07490042B2

    公开(公告)日:2009-02-10

    申请号:US11092057

    申请日:2005-03-29

    IPC分类号: G10L15/00

    CPC分类号: G10L13/027 G10L15/22

    摘要: A technique for producing speech output in an automatic dialog system in accordance with a detected context is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is created in a natural language generator of the automatic dialog system in communication with the context detector. The message is conveyed to the user through a speech synthesis system of the automatic dialog system, in communication with the natural language generator and the context detector. Responsive to a detected level of ambient noise, the context detector provides at least one command in a markup language to cause the natural language generator to create the message using maximally intelligible words and to cause the speech synthesis system to convey the message with increased volume and decreased speed.

    摘要翻译: 提供了一种根据检测到的上下文在自动对话系统中产生语音输出的技术。 在自动对话系统中从用户接收通信。 在自动对话系统的上下文检测器中检测来自用户的通信的上下文。 在与上下文检测器通信的自动对话系统的自然语言生成器中创建消息。 该消息通过与自然语言生成器和上下文检测器通信的自动对话系统的语音合成系统传送给用户。 响应于检测到的环境噪声水平,上下文检测器以标记语言提供至少一个命令,以使自然语言生成器使用最大可理解的单词来创建消息,并且使得语音合成系统以增加的音量传达消息,并且 降低速度

    Fast vocabulary independent method and apparatus for spotting words in
speech
    15.
    发明授权
    Fast vocabulary independent method and apparatus for spotting words in speech 失效
    快速词汇独立的方法和设备,用于在言语中发现单词

    公开(公告)号:US6073095A

    公开(公告)日:2000-06-06

    申请号:US950621

    申请日:1997-10-15

    摘要: A fast vocabulary independent method for spotting words in speech utilizes a preprocessing step and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing includes a Viterbi-beam phone level decoding using a tree-based phone language model. The coarse search matches phone-ngrams to identify regions of speech as putative word hits, and the detailed search performs an acoustic match at the putative hits with a model of the given word included in the vocabulary of the recognizer.

    摘要翻译: 用于在语音中发现单词的快速词汇独立方法利用预处理步骤和用于在语音中发现单词/电话序列的粗略到详细的搜索策略。 预处理包括使用基于树的手机语言模型的维特比波束电话级解码。 粗略搜索匹配电话号码以将语音区域识别为假定词命中,并且详细搜索在推定命中与在识别器的词汇表中包括的给定单词的模型进行声匹配。