System and method for machine learning a confidence metric for machine translation
    1.
    发明授权
    System and method for machine learning a confidence metric for machine translation 有权
    用于机器学习机器翻译的置信度量的系统和方法

    公开(公告)号:US07209875B2

    公开(公告)日:2007-04-24

    申请号:US10309950

    申请日:2002-12-04

    IPC分类号: G06F17/28 G10L11/00

    CPC分类号: G06F17/28

    摘要: A machine translation system is trained to generate confidence scores indicative of a quality of a translation result. A source string is translated with a machine translator to generate a target string. Features indicative of translation operations performed are extracted from the machine translator. A trusted entity-assigned translation score is obtained and is indicative of a trusted entity-assigned translation quality of the translated string. A relationship between a subset of the extracted features and the trusted entity-assigned translation score is identified.

    摘要翻译: 训练机器翻译系统以产生指示翻译结果的质量的置信度分数。 使用机器翻译器翻译源字符串以生成目标字符串。 从机器翻译器提取表示所执行的翻译操作的特征。 获得受信任的实体分配的翻译分数,并且指示被翻译的字符串的受信任的实体分配的翻译质量。 识别提取的特征的子集与可信实体分配的翻译分数之间的关系。

    Machine translation split between front end and back end processors
    2.
    发明授权
    Machine translation split between front end and back end processors 有权
    机器翻译分为前端和后端处理器

    公开(公告)号:US08209162B2

    公开(公告)日:2012-06-26

    申请号:US11414844

    申请日:2006-05-01

    IPC分类号: G06F17/28 G06F17/21

    CPC分类号: G06F17/289

    摘要: A method of translation includes uploading a source text portion to a back end processor. The back end processor identifies a subset of translation knowledge associated with the source text portion. The back end processor downloads the subset to a front end processor. A translation engine runs on the front end processor. The translation engine generates a translation of the source text portion as a function of the subset.

    摘要翻译: 一种翻译方法包括将源文本部分上传到后端处理器。 后端处理器识别与源文本部分相关联的翻译知识的子集。 后端处理器将子集下载到前端处理器。 翻译引擎在前端处理器上运行。 翻译引擎生成作为子集的函数的源文本部分的翻译。

    Machine translation split between front end and back end processors
    3.
    发明授权
    Machine translation split between front end and back end processors 有权
    机器翻译分为前端和后端处理器

    公开(公告)号:US08886516B2

    公开(公告)日:2014-11-11

    申请号:US13409419

    申请日:2012-03-01

    IPC分类号: G06F17/28

    CPC分类号: G06F17/289

    摘要: A method of translation includes uploading a source text portion to a back end processor. The back end processor identifies a subset of translation knowledge associated with the source text portion. The back end processor downloads the subset to a front end processor. A translation engine runs on the front end processor. The translation engine generates a translation of the source text portion as a function of the subset.

    摘要翻译: 一种翻译方法包括将源文本部分上传到后端处理器。 后端处理器识别与源文本部分相关联的翻译知识的子集。 后端处理器将子集下载到前端处理器。 翻译引擎在前端处理器上运行。 翻译引擎生成作为子集的函数的源文本部分的翻译。

    System and method for machine learning a confidence metric for machine translation
    5.
    发明授权
    System and method for machine learning a confidence metric for machine translation 有权
    用于机器学习机器翻译的置信度量的系统和方法

    公开(公告)号:US07496496B2

    公开(公告)日:2009-02-24

    申请号:US11725435

    申请日:2007-03-19

    IPC分类号: G06F17/28 G06F17/20 G10L11/00

    CPC分类号: G06F17/28

    摘要: A machine translation system is trained to generate confidence scores indicative of a quality of a translation result. A source string is translated with a machine translator to generate a target string. Features indicative of translation operations performed are extracted from the machine translator. A trusted entity-assigned translation score is obtained and is indicative of a trusted entity-assigned translation quality of the translated string. A relationship between a subset of the extracted features and the trusted entity-assigned translation score is identified.

    摘要翻译: 训练机器翻译系统以产生指示翻译结果的质量的置信度分数。 使用机器翻译器翻译源字符串以生成目标字符串。 从机器翻译器提取表示所执行的翻译操作的特征。 获得受信任的实体分配的翻译分数,并且指示被翻译的字符串的受信任的实体分配的翻译质量。 识别提取的特征的子集与可信实体分配的翻译分数之间的关系。

    MACHINE TRANSLATION SPLIT BETWEEN FRONT END AND BACK END PROCESSORS
    6.
    发明申请
    MACHINE TRANSLATION SPLIT BETWEEN FRONT END AND BACK END PROCESSORS 有权
    前端和后端处理器之间的机器翻译分割

    公开(公告)号:US20120179450A1

    公开(公告)日:2012-07-12

    申请号:US13409419

    申请日:2012-03-01

    IPC分类号: G06F17/28

    CPC分类号: G06F17/289

    摘要: A method of translation includes uploading a source text portion to a back end processor. The back end processor identifies a subset of translation knowledge associated with the source text portion. The back end processor downloads the subset to a front end processor. A translation engine runs on the front end processor. The translation engine generates a translation of the source text portion as a function of the subset.

    摘要翻译: 一种翻译方法包括将源文本部分上传到后端处理器。 后端处理器识别与源文本部分相关联的翻译知识的子集。 后端处理器将子集下载到前端处理器。 翻译引擎在前端处理器上运行。 翻译引擎生成作为子集的函数的源文本部分的翻译。

    Machine translation using language order templates
    7.
    发明授权
    Machine translation using language order templates 有权
    机器翻译使用语言订单模板

    公开(公告)号:US08150677B2

    公开(公告)日:2012-04-03

    申请号:US12146531

    申请日:2008-06-26

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2872 G06F17/2827

    摘要: Many machine translation scenarios involve the generation of a language translation rule set based on parallel training corpuses (e.g., sentences in a first language and word-for-word translations into a second language.) However, the translation of a source corpus in a source language to a target corpus in a target language involves at least two aspects: selecting elements of the target language to match the elements of the source corpus, and ordering the target elements according to the semantic organization of the source corpus and the grammatic rules of the target language. The breadth of generalization of the translation rules derived from the training may be improved, while retaining contextual information, by formulating language order templates that specify orderings of small sets of target elements according to target element types. These language order templates may be represented with varying degrees of association with the alignment rules derived from the training in order to improve the scope of target elements to which the ordering rules and alignment rules may be applied.

    摘要翻译: 许多机器翻译方案涉及到基于平行训练语料库(例如,第一语言中的句子和逐字翻译成第二语言)的语言翻译规则集的生成。然而,源语料库在源中的翻译 目标语言中的目标语料库的语言至少涉及两个方面:选择目标语言的元素以匹配源语料库的元素,并根据源语料库的语义组织和语法规则对目标元素进行排序 目标语言。 可以通过根据目标元素类型制定指定小组目标元素的排序的语言顺序模板,同时保留上下文信息,从而改进从训练中导出的翻译规则的广义化。 这些语言顺序模板可以以与从训练导出的对准规则的不同程度的关联来表示,以便改进可以应用排序规则和对准规则的目标元素的范围。

    Machine translation system incorporating syntactic dependency treelets into a statistical framework
    8.
    发明授权
    Machine translation system incorporating syntactic dependency treelets into a statistical framework 有权
    机器翻译系统将句法依赖树结合到统计框架中

    公开(公告)号:US07698124B2

    公开(公告)日:2010-04-13

    申请号:US11014503

    申请日:2004-12-16

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2818 G06F17/28

    摘要: In one embodiment of the present invention, a decoder receives a dependency tree as a source language input and accesses a set of statistical models that produce outputs combined in a log linear framework. The decoder also accesses a table of treelet translation pairs and returns a target dependency tree based on the source dependency tree, based on access to the table of treelet translation pairs, and based on the application of the statistical models.

    摘要翻译: 在本发明的一个实施例中,解码器接收依赖树作为源语言输入,并访问产生在对数线性框架中组合的输出的一组统计模型。 解码器还访问树形图转换对的表,并基于对依赖树的转换对的访问,并且基于统计模型的应用,基于源依赖关系树返回目标依赖关系树。

    Extracting treelet translation pairs
    9.
    发明授权
    Extracting treelet translation pairs 有权
    提取树皮翻译对

    公开(公告)号:US07577562B2

    公开(公告)日:2009-08-18

    申请号:US11014492

    申请日:2004-12-16

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2818 G06F17/28

    摘要: In one embodiment of the present invention, a decoder receives a dependency tree as a source language input and accesses a set of statistical models that produce outputs combined in a log linear framework. The decoder also accesses a table of treelet translation pairs and returns a target dependency tree based on the source dependency tree, based on access to the table of treelet translation pairs, and based on the application of the statistical models.

    摘要翻译: 在本发明的一个实施例中,解码器接收依赖树作为源语言输入,并访问产生在对数线性框架中组合的输出的一组统计模型。 解码器还访问树形图转换对的表,并基于对依赖树的转换对的访问,并且基于统计模型的应用,基于源依赖关系树返回目标依赖关系树。

    Extracting treelet translation pairs
    10.
    发明授权
    Extracting treelet translation pairs 有权
    提取树皮翻译对

    公开(公告)号:US08082143B2

    公开(公告)日:2011-12-20

    申请号:US12499379

    申请日:2009-07-08

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2818 G06F17/28

    摘要: In one embodiment of the present invention, a decoder receives a dependency tree as a source language input and accesses a set of statistical models that produce outputs combined in a log linear framework. The decoder also accesses a table of treelet translation pairs and returns a target dependency tree based on the source dependency tree, based on access to the table of treelet translation pairs, and based on the application of the statistical models.

    摘要翻译: 在本发明的一个实施例中,解码器接收依赖树作为源语言输入,并访问产生在对数线性框架中组合的输出的一组统计模型。 解码器还访问树形图转换对的表,并基于对依赖树的转换对的访问,并且基于统计模型的应用,基于源依赖关系树返回目标依赖关系树。