METHOD AND SYSTEM FOR PROMPT CONSTRUCTION FOR SELECTION FROM A LIST OF ACOUSTICALLY CONFUSABLE ITEMS IN SPOKEN DIALOG SYSTEMS
    2.
    发明申请
    METHOD AND SYSTEM FOR PROMPT CONSTRUCTION FOR SELECTION FROM A LIST OF ACOUSTICALLY CONFUSABLE ITEMS IN SPOKEN DIALOG SYSTEMS 有权
    用于从SPOKEN对话系统中的声音可混合项目列表中选择的提供构建的方法和系统

    公开(公告)号:US20080281598A1

    公开(公告)日:2008-11-13

    申请号:US11746087

    申请日:2007-05-09

    IPC分类号: G10L11/00

    CPC分类号: G10L15/22 G10L15/187

    摘要: A method (and system) of determining confusable list items and resolving this confusion in a spoken dialog system includes receiving user input, processing the user input and determining if a list of items needs to be played back to the user, retrieving the list to be played back to the user, identifying acoustic confusions between items on the list, changing the items on the list as necessary to remove the acoustic confusions, and playing unambiguous list items back to the user.

    摘要翻译: 一种确定可混淆列表项目并在口头对话系统中解决这种混淆的方法(和系统)包括接收用户输入,处理用户输入并确定是否需要向用户回放项目列表,将列表检索为 播放给用户,识别列表上的项目之间的声音混淆,根据需要更改列表上的项目以消除声音混淆,并将明确的列表项目播放回用户。

    Methods and apparatus for masking latency in text-to-speech systems
    3.
    发明授权
    Methods and apparatus for masking latency in text-to-speech systems 有权
    用于在文本到语音系统中屏蔽延迟的方法和装置

    公开(公告)号:US08355484B2

    公开(公告)日:2013-01-15

    申请号:US11620842

    申请日:2007-01-08

    IPC分类号: H04M1/64

    CPC分类号: G10L15/22

    摘要: A technique for masking latency in an automatic dialog system is provided. A communication is received from a user at the automatic dialog system. The communication is processed in the automatic dialog system to provide a response. At least one transitional message is provided to the user from the automatic dialog system while processing the communication. A response is provided to the user from the automatic dialog system in accordance with the received communication from the user.

    摘要翻译: 提供了一种用于在自动对话系统中屏蔽等待时间的技术。 在自动对话系统中从用户接收到通信。 在自动对话系统中处理通信以提供响应。 在处理通信时,从自动对话系统向用户提供至少一个过渡消息。 根据接收到的来自用户的通信,从自动对话系统向用户提供响应。

    Methods and Apparatus for Conveying Synthetic Speech Style from a Text-to-Speech System
    4.
    发明申请
    Methods and Apparatus for Conveying Synthetic Speech Style from a Text-to-Speech System 有权
    从文本到语音系统输入合成语音方式的方法和装置

    公开(公告)号:US20080300882A1

    公开(公告)日:2008-12-04

    申请号:US12165937

    申请日:2008-07-01

    IPC分类号: G10L13/08

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Methods and Apparatus for Masking Latency in Text-to-Speech Systems
    5.
    发明申请
    Methods and Apparatus for Masking Latency in Text-to-Speech Systems 有权
    用于在文本到语音系统中屏蔽延迟的方法和装置

    公开(公告)号:US20080167874A1

    公开(公告)日:2008-07-10

    申请号:US11620842

    申请日:2007-01-08

    IPC分类号: G10L15/18

    CPC分类号: G10L15/22

    摘要: A technique for masking latency in an automatic dialog system is provided. A communication is received from a user at the automatic dialog system. The communication is processed in the automatic dialog system to provide a response. At least one transitional message is provided to the user from the automatic dialog system while processing the communication. A response is provided to the user from the automatic dialog system in accordance with the received communication from the user.

    摘要翻译: 提供了一种用于在自动对话系统中屏蔽等待时间的技术。 在自动对话系统中从用户接收到通信。 在自动对话系统中处理通信以提供响应。 在处理通信时,从自动对话系统向用户提供至少一个过渡消息。 根据接收到的来自用户的通信,从自动对话系统向用户提供响应。

    Method and apparatus for a time-synchronous tree-based search strategy
    6.
    发明授权
    Method and apparatus for a time-synchronous tree-based search strategy 失效
    一种基于时间同步树的搜索策略的方法和装置

    公开(公告)号:US5884259A

    公开(公告)日:1999-03-16

    申请号:US798011

    申请日:1997-02-12

    IPC分类号: G10L15/08 G10L9/06

    CPC分类号: G10L15/08

    摘要: A method and apparatus for using a tree structure to constrain a time-synchronous, fast search for candidate words in an acoustic stream is described. A minimum stay of three frames in each graph node visited is imposed by allowing transitions only every third frame. This constraint enables the simplest possible Markov model for each phoneme while enforcing the desired minimum duration. The fast, time-synchronous search for likely words is done for an entire sentence/utterance. The list of hypotheses beginning at each time frame is stored for providing, on-demand, lists of contender/candidate words to the asynchronous, detailed match phase of decoding.

    摘要翻译: 描述了使用树结构约束声流中的候选词的时间同步,快速搜索的方法和装置。 在每个图形节点访问的最小停留时间为3帧,只允许每三帧进行一次转换。 这个约束使每个音素的最可能的马可夫模型成为可能,同时执行所需的最小持续时间。 快速,时间同步的搜索可能的单词是为整个句子/话语完成的。 存储在每个时间帧开始的假设列表,用于将竞争者/候选词的按需提供到解码的异步,详细匹配阶段。

    APPLYING VOCAL CHARACTERISTICS FROM A TARGET SPEAKER TO A SOURCE SPEAKER FOR SYNTHETIC SPEECH
    7.
    发明申请
    APPLYING VOCAL CHARACTERISTICS FROM A TARGET SPEAKER TO A SOURCE SPEAKER FOR SYNTHETIC SPEECH 审中-公开
    将目标声音的声像特性应用于合成音箱的声源扬声器

    公开(公告)号:US20090177473A1

    公开(公告)日:2009-07-09

    申请号:US11970282

    申请日:2008-01-07

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033 G10L2021/0135

    摘要: A computer implemented method, system and computer usable program code for synthesizing speech. A computer implemented method for synthesizing speech includes providing a database of speech of a source speaker, and providing a prosody model of speech of a target speaker different from the source speaker. Text input to be synthesized is received, and the prosody model of speech of the target speaker is applied to the text input to select segments of the speech of the source speaker in the database to form synthesized speech of the text input. The synthesized speech of the text input is then output.

    摘要翻译: 一种用于合成语音的计算机实现的方法,系统和计算机可用程序代码。 一种用于合成语音的计算机实现方法包括:提供源说话者的语音数据库,以及提供不同于所述源说话者的目标说话者的语音韵律模型。 接收要合成的文本输入,并且将目标说话者的语音韵律模型应用于文本输入,以选择数据库中的来源说话者的语音段,以形成文本输入的合成语音。 然后输出文本输入的合成语音。

    Methods and apparatus for conveying synthetic speech style from a text-to-speech system
    8.
    发明授权
    Methods and apparatus for conveying synthetic speech style from a text-to-speech system 有权
    从文字到语音系统传达合成语音风格的方法和设备

    公开(公告)号:US07747440B2

    公开(公告)日:2010-06-29

    申请号:US12165937

    申请日:2008-07-01

    IPC分类号: G10L13/00

    CPC分类号: G10L13/033

    摘要: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.

    摘要翻译: 提供了一种用于在文本到语音系统中产生语音输出的技术。 创建用于与文本到语音系统的自然语言生成器中的用户通信的消息。 消息在自然语言生成器中用合成语音输出样式注释。 通过与自然语言生成器通信的语音合成系统将消息传送给用户,其中根据合成语音输出方式传送消息。

    Methods and apparatus for adapting output speech in accordance with context of communication
    9.
    发明授权
    Methods and apparatus for adapting output speech in accordance with context of communication 有权
    根据通信背景调整输出语音的方法和装置

    公开(公告)号:US07490042B2

    公开(公告)日:2009-02-10

    申请号:US11092057

    申请日:2005-03-29

    IPC分类号: G10L15/00

    CPC分类号: G10L13/027 G10L15/22

    摘要: A technique for producing speech output in an automatic dialog system in accordance with a detected context is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is created in a natural language generator of the automatic dialog system in communication with the context detector. The message is conveyed to the user through a speech synthesis system of the automatic dialog system, in communication with the natural language generator and the context detector. Responsive to a detected level of ambient noise, the context detector provides at least one command in a markup language to cause the natural language generator to create the message using maximally intelligible words and to cause the speech synthesis system to convey the message with increased volume and decreased speed.

    摘要翻译: 提供了一种根据检测到的上下文在自动对话系统中产生语音输出的技术。 在自动对话系统中从用户接收通信。 在自动对话系统的上下文检测器中检测来自用户的通信的上下文。 在与上下文检测器通信的自动对话系统的自然语言生成器中创建消息。 该消息通过与自然语言生成器和上下文检测器通信的自动对话系统的语音合成系统传送给用户。 响应于检测到的环境噪声水平,上下文检测器以标记语言提供至少一个命令,以使自然语言生成器使用最大可理解的单词来创建消息,并且使得语音合成系统以增加的音量传达消息,并且 降低速度

    Fast vocabulary independent method and apparatus for spotting words in
speech
    10.
    发明授权
    Fast vocabulary independent method and apparatus for spotting words in speech 失效
    快速词汇独立的方法和设备,用于在言语中发现单词

    公开(公告)号:US6073095A

    公开(公告)日:2000-06-06

    申请号:US950621

    申请日:1997-10-15

    摘要: A fast vocabulary independent method for spotting words in speech utilizes a preprocessing step and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing includes a Viterbi-beam phone level decoding using a tree-based phone language model. The coarse search matches phone-ngrams to identify regions of speech as putative word hits, and the detailed search performs an acoustic match at the putative hits with a model of the given word included in the vocabulary of the recognizer.

    摘要翻译: 用于在语音中发现单词的快速词汇独立方法利用预处理步骤和用于在语音中发现单词/电话序列的粗略到详细的搜索策略。 预处理包括使用基于树的手机语言模型的维特比波束电话级解码。 粗略搜索匹配电话号码以将语音区域识别为假定词命中,并且详细搜索在推定命中与在识别器的词汇表中包括的给定单词的模型进行声匹配。