Conversion of input text strings
    1.
    发明授权

    公开(公告)号:US10133737B2

    公开(公告)日:2018-11-20

    申请号:US13818869

    申请日:2011-08-26

    IPC分类号: G06F17/28 G06F17/22

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transforming text strings. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving input string having a plurality of terms, the input string being in a first form; transforming the input string from the first form to a second form including: applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, translating the identified one or more terms to one or more translated terms in the second form, and transliterating the remaining terms of the plurality of terms into transliterated terms in the second form; and concatenating the translated and transliterated terms to form a hybrid output string in the second form.

    CONVERSION OF INPUT TEXT STRINGS
    2.
    发明申请
    CONVERSION OF INPUT TEXT STRINGS 审中-公开
    输入文字行的转换

    公开(公告)号:US20140163952A1

    公开(公告)日:2014-06-12

    申请号:US13818869

    申请日:2011-08-26

    IPC分类号: G06F17/28

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for transforming text strings. In general, one aspect of the subject matter described in this specification can be embodied in methods that include the actions of receiving input string having a plurality of terms, the input string being in a first form; transforming the input string from the first form to a second form including: applying one or more rules to the input string to identify one or more terms for translation, the one or more identified terms being fewer than the plurality of terms, translating the identified one or more terms to one or more translated terms in the second form, and transliterating the remaining terms of the plurality of terms into transliterated terms in the second form; and concatenating the translated and transliterated terms to form a hybrid output string in the second form.

    摘要翻译: 方法,系统和装置,包括编码在计算机存储介质上的用于转换文本串的计算机程序。 通常,本说明书中描述的主题的一个方面可以体现在包括接收具有多个项的输入串的动作的方法中,输入串处于第一形式; 将所述输入字符串从所述第一形式转换为第二形式,包括:将一个或多个规则应用于所述输入字符串以识别用于转换的一个或多个条款,所述一个或多个已标识的条款少于所述多个条款, 或多个术语表示第二形式的一个或多个翻译术语,并将多个术语的其余术语音译为第二形式的音译术语; 并连接翻译和音译术语以形成第二种形式的混合输出字符串。

    System and method of using meta-data in speech processing
    3.
    发明申请
    System and method of using meta-data in speech processing 有权
    在语音处理中使用元数据的系统和方法

    公开(公告)号:US20050096908A1

    公开(公告)日:2005-05-05

    申请号:US10977030

    申请日:2004-10-29

    摘要: Systems and methods relate to generating a language model for use in, for example, a spoken dialog system or some other application. The method comprises building a class-based language model, generating at least one sequence network and replacing class labels in the class-based language model with the at least one sequence network. In this manner, placeholders or tokens associated with classes can be inserted into the models at training time and word/phone networks can be built based on meta-data information at test time. Finally, the placeholder token can be replaced with the word/phone networks at run time to improve recognition of difficult words such as proper names.

    摘要翻译: 系统和方法涉及生成用于例如口语对话系统或某些其他应用的语言模型。 该方法包括构建基于类的语言模型,生成至少一个序列网络,并用至少一个序列网络替换基于类的语言模型中的类标签。 以这种方式,可以在训练时间将与课程相关的占位符或令牌插入到模型中,并且可以在测试时基于元数据信息构建单词/电话网络。 最后,占位符标记可以在运行时用单词/电话网络替换,以改善对诸如专有名称等困难词的识别。