Methods and apparatus for masking latency in text-to-speech systems
    1.
    发明授权
    Methods and apparatus for masking latency in text-to-speech systems 有权
    用于在文本到语音系统中屏蔽延迟的方法和装置

    公开(公告)号:US08355484B2

    公开(公告)日:2013-01-15

    申请号:US11620842

    申请日:2007-01-08

    IPC分类号: H04M1/64

    CPC分类号: G10L15/22

    摘要: A technique for masking latency in an automatic dialog system is provided. A communication is received from a user at the automatic dialog system. The communication is processed in the automatic dialog system to provide a response. At least one transitional message is provided to the user from the automatic dialog system while processing the communication. A response is provided to the user from the automatic dialog system in accordance with the received communication from the user.

    摘要翻译: 提供了一种用于在自动对话系统中屏蔽等待时间的技术。 在自动对话系统中从用户接收到通信。 在自动对话系统中处理通信以提供响应。 在处理通信时,从自动对话系统向用户提供至少一个过渡消息。 根据接收到的来自用户的通信,从自动对话系统向用户提供响应。

    Methods and Apparatus for Conveying Synthetic Speech Style from a Text-to-Speech System
    2.
    发明申请
    Methods and Apparatus for Conveying Synthetic Speech Style from a Text-to-Speech System 有权
    从文本到语音系统输入合成语音方式的方法和装置

    公开(公告)号:US20080300882A1

    公开(公告)日:2008-12-04

    申请号:US12165937

    申请日:2008-07-01

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Methods and Apparatus for Masking Latency in Text-to-Speech Systems
    3.
    发明申请
    Methods and Apparatus for Masking Latency in Text-to-Speech Systems 有权
    用于在文本到语音系统中屏蔽延迟的方法和装置

    公开(公告)号:US20080167874A1

    公开(公告)日:2008-07-10

    申请号:US11620842

    申请日:2007-01-08

    IPC分类号: G10L15/18

    CPC分类号: G10L15/22

    摘要: A technique for masking latency in an automatic dialog system is provided. A communication is received from a user at the automatic dialog system. The communication is processed in the automatic dialog system to provide a response. At least one transitional message is provided to the user from the automatic dialog system while processing the communication. A response is provided to the user from the automatic dialog system in accordance with the received communication from the user.

    摘要翻译: 提供了一种用于在自动对话系统中屏蔽等待时间的技术。 在自动对话系统中从用户接收到通信。 在自动对话系统中处理通信以提供响应。 在处理通信时,从自动对话系统向用户提供至少一个过渡消息。 根据接收到的来自用户的通信,从自动对话系统向用户提供响应。

    Methods and apparatus for conveying synthetic speech style from a text-to-speech system
    4.
    发明授权
    Methods and apparatus for conveying synthetic speech style from a text-to-speech system 有权
    从文字到语音系统传达合成语音风格的方法和设备

    公开(公告)号:US07747440B2

    公开(公告)日:2010-06-29

    申请号:US12165937

    申请日:2008-07-01

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Methods and apparatus for adapting output speech in accordance with context of communication
    5.
    发明授权
    Methods and apparatus for adapting output speech in accordance with context of communication 有权
    根据通信背景调整输出语音的方法和装置

    公开(公告)号:US07490042B2

    公开(公告)日:2009-02-10

    申请号:US11092057

    申请日:2005-03-29

    IPC分类号: G10L15/00

    CPC分类号: G10L13/027 G10L15/22

    摘要: A technique for producing speech output in an automatic dialog system in accordance with a detected context is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is created in a natural language generator of the automatic dialog system in communication with the context detector. The message is conveyed to the user through a speech synthesis system of the automatic dialog system, in communication with the natural language generator and the context detector. Responsive to a detected level of ambient noise, the context detector provides at least one command in a markup language to cause the natural language generator to create the message using maximally intelligible words and to cause the speech synthesis system to convey the message with increased volume and decreased speed.

    摘要翻译: 提供了一种根据检测到的上下文在自动对话系统中产生语音输出的技术。 在自动对话系统中从用户接收通信。 在自动对话系统的上下文检测器中检测来自用户的通信的上下文。 在与上下文检测器通信的自动对话系统的自然语言生成器中创建消息。 该消息通过与自然语言生成器和上下文检测器通信的自动对话系统的语音合成系统传送给用户。 响应于检测到的环境噪声水平,上下文检测器以标记语言提供至少一个命令,以使自然语言生成器使用最大可理解的单词来创建消息,并且使得语音合成系统以增加的音量传达消息,并且 降低速度

    On demand TTS vocabulary for a telematics system
    6.
    发明授权
    On demand TTS vocabulary for a telematics system 有权
    远程信息处理系统的按需TTS词汇表

    公开(公告)号:US08311804B2

    公开(公告)日:2012-11-13

    申请号:US13279626

    申请日:2011-10-24

    IPC分类号: G06F17/20 G10L21/00 G01C21/30

    CPC分类号: G10L13/04 G01C21/3629

    摘要: A driving directions system loads into memory a limited subset of prerecorded, spoken utterances of geographic names from a mass media storage. The subset of spoken utterances may be limited, for example, to the geographic names within a predetermined radius (e.g., a few miles) of the driver's present location. The present location of the driver may be manually entered into the driving directions system by the driver, or automatically determined using a global positioning system (“GPS”) receiver. As the vehicle moves from its present location, the driving directions system loads into memory new names from the mass media storage and overwrites, if necessary, those which are now geographically out of range. Based on the current location of the driving, the driving directions system can audibly output geographic names from the run-time memory.

    摘要翻译: 驾驶方向系统将来自大众媒体存储器的地理名称的预先记录的讲话话语的有限子集加载到记忆体中。 讲话语音的子集可以例如限于驾驶员现在位置的预定半径(例如几英里)内的地理名称。 驾驶员的当前位置可以由驾驶员手动输入驾驶方向系统,或者使用全球定位系统(GPS)接收机自动确定。 随着车辆从现在的位置移动,驾驶方向系统从大容量媒体存储器中加载新名称,并且如果需要,覆盖现在地理上超出范围的那些。 根据目前驾驶的位置,驾驶方向系统可以从运行时记忆体中可听见地输出地名。

    Methods for conveying synthetic speech style from a text-to-speech system
    7.
    发明授权
    Methods for conveying synthetic speech style from a text-to-speech system 有权
    从文字到语音系统传达合成语音风格的方法

    公开(公告)号:US07415413B2

    公开(公告)日:2008-08-19

    申请号:US11092008

    申请日:2005-03-29

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Method and apparatus for a time-synchronous tree-based search strategy
    8.
    发明授权
    Method and apparatus for a time-synchronous tree-based search strategy 失效
    一种基于时间同步树的搜索策略的方法和装置

    公开(公告)号:US5884259A

    公开(公告)日:1999-03-16

    申请号:US798011

    申请日:1997-02-12

    IPC分类号: G10L15/08 G10L9/06

    CPC分类号: G10L15/08

    摘要: A method and apparatus for using a tree structure to constrain a time-synchronous, fast search for candidate words in an acoustic stream is described. A minimum stay of three frames in each graph node visited is imposed by allowing transitions only every third frame. This constraint enables the simplest possible Markov model for each phoneme while enforcing the desired minimum duration. The fast, time-synchronous search for likely words is done for an entire sentence/utterance. The list of hypotheses beginning at each time frame is stored for providing, on-demand, lists of contender/candidate words to the asynchronous, detailed match phase of decoding.

    摘要翻译: 描述了使用树结构约束声流中的候选词的时间同步,快速搜索的方法和装置。 在每个图形节点访问的最小停留时间为3帧,只允许每三帧进行一次转换。 这个约束使每个音素的最可能的马可夫模型成为可能,同时执行所需的最小持续时间。 快速,时间同步的搜索可能的单词是为整个句子/话语完成的。 存储在每个时间帧开始的假设列表,用于将竞争者/候选词的按需提供到解码的异步,详细匹配阶段。

    APPLYING VOCAL CHARACTERISTICS FROM A TARGET SPEAKER TO A SOURCE SPEAKER FOR SYNTHETIC SPEECH
    10.
    发明申请
    APPLYING VOCAL CHARACTERISTICS FROM A TARGET SPEAKER TO A SOURCE SPEAKER FOR SYNTHETIC SPEECH 审中-公开
    将目标声音的声像特性应用于合成音箱的声源扬声器

    公开(公告)号:US20090177473A1

    公开(公告)日:2009-07-09

    申请号:US11970282

    申请日:2008-01-07

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033 G10L2021/0135

    摘要: A computer implemented method, system and computer usable program code for synthesizing speech. A computer implemented method for synthesizing speech includes providing a database of speech of a source speaker, and providing a prosody model of speech of a target speaker different from the source speaker. Text input to be synthesized is received, and the prosody model of speech of the target speaker is applied to the text input to select segments of the speech of the source speaker in the database to form synthesized speech of the text input. The synthesized speech of the text input is then output.

    摘要翻译: 一种用于合成语音的计算机实现的方法,系统和计算机可用程序代码。 一种用于合成语音的计算机实现方法包括:提供源说话者的语音数据库,以及提供不同于所述源说话者的目标说话者的语音韵律模型。 接收要合成的文本输入,并且将目标说话者的语音韵律模型应用于文本输入,以选择数据库中的来源说话者的语音段,以形成文本输入的合成语音。 然后输出文本输入的合成语音。