Automatically training speech synthesizers
    1.
    发明授权
    Automatically training speech synthesizers 有权
    自动训练语音合成器

    公开(公告)号:US08423366B1

    公开(公告)日:2013-04-16

    申请号:US13552484

    申请日:2012-07-18

    IPC分类号: G10L13/00

    CPC分类号: G10L13/06 G10L15/26 G10L15/30

    摘要: A method includes receiving, by a system, a voice recording associated with a user, transcribing, the voice recording into text that includes a group of words, and storing an association between a portion of each respective word and a corresponding portion of the voice recording. The corresponding portion of the voice recording is the portion of the voice recording from which the portion of the respective word was transcribed. The method may also include determining a modification to a speech synthesis voice associated with the user based at least in part on the association.

    摘要翻译: 一种方法包括:通过系统接收与用户相关联的语音记录,将所述语音记录转录成包括一组单词的文本,以及存储每个相应单词的一部分与所述语音记录的相应部分之间的关​​联 。 语音记录的相应部分是语音记录的一部分,从该录音的各个单词的部分被转录。 该方法还可以包括至少部分地基于关联来确定与用户相关联的语音合成语音的修改。

    Directing dictation into input fields
    4.
    发明授权
    Directing dictation into input fields 有权
    指导输入字段

    公开(公告)号:US08255218B1

    公开(公告)日:2012-08-28

    申请号:US13245698

    申请日:2011-09-26

    IPC分类号: G10L11/00 G10L15/00 G10L17/00

    CPC分类号: G10L15/22 G06F3/167

    摘要: In general, this disclosure describes techniques to direct textual characters converted from vocal input into selected graphical user interface input fields. Vocal input may be received. Textual characters may be identified based on the vocal input. A first portion of the textual characters corresponding to a first portion of the vocal input may be graphically inputted into a first input field of a GUI. While receiving the vocal input, a selection by of a second input field in the GUI may be accepted after the first portion of the vocal input has been received. After accepting the selection of the second input field, a second portion of the textual characters corresponding to a second portion of the vocal input received after the selection of the second input field may be inputted into the second input field.

    摘要翻译: 通常,本公开描述了将从声乐输入转换的文本字符引导到选定的图形用户界面输入字段中的技术。 可以接收声音输入。 可以基于声音输入来识别文本字符。 对应于声音输入的第一部分的文本字符的第一部分可以被图形地输入到GUI的第一输入字段中。 在接收到声音输入时,在接收到声音输入的第一部分之后,可以接受GUI中的第二输入字段的选择。 在接受第二输入字段的选择之后,可以将与选择第二输入字段之后接收的声音输入的第二部分对应的文本字符的第二部分输入到第二输入字段。