OPTIMIZING PARAMETERS FOR MACHINE TRANSLATION
    1.
    发明申请
    OPTIMIZING PARAMETERS FOR MACHINE TRANSLATION 审中-公开
    机器翻译优化参数

    公开(公告)号:US20100004919A1

    公开(公告)日:2010-01-07

    申请号:US12497169

    申请日:2009-07-02

    IPC分类号: G06F17/28 G06F17/20

    CPC分类号: G06F17/2818

    摘要: Methods, systems, and apparatus, including computer program products, for language translation are disclosed. In one implementation, a method is provided. The method includes determining, for a plurality of feature functions in a translation lattice, a corresponding plurality of error surfaces for each of one or more candidate translations represented in the translation lattice; adjusting weights for the feature functions by traversing a combination of the plurality of error surfaces for phrases in a training set; selecting weighting values that minimize error counts for the traversed combination; and applying the selected weighting values to convert a sample of text from a first language to a second language.

    摘要翻译: 公开了用于语言翻译的方法,系统和设备,包括计算机程序产品。 在一个实现中,提供了一种方法。 所述方法包括为所述平移网格中的多个特征函数确定针对在所述平移网格中表示的一个或多个候选翻译中的每一个的对应的多个误差表面; 通过遍历训练集中的短语的多个误差表面的组合来调整特征功能的权重; 选择最小化遍历组合的错误计数的加权值; 以及应用所选择的加权值以将来自第一语言的文本样本转换为第二语言。

    PARALLEL DOCUMENT MINING
    2.
    发明申请
    PARALLEL DOCUMENT MINING 审中-公开
    并行文件采矿

    公开(公告)号:US20120047172A1

    公开(公告)日:2012-02-23

    申请号:US13214941

    申请日:2011-08-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2827 G06F16/30

    摘要: A technique includes providing a collection of documents in multiple languages, identifying, from the collection of documents, a group of candidate documents, where each candidate document in the group shares multiple corresponding rare features, evaluating pairs of candidate documents in the group using multiple common features present in the collection of documents, and determining, based on evaluating the pairs of candidate documents, whether each pair of candidate documents corresponds to a translated pair of documents.

    摘要翻译: 一种技术包括提供多种语言的文档集合,从文档的收集中识别一组候选文件,其中组中的每个候选文档共享多个对应的稀有特征,使用多个共同的方法评估该组中候选文档的对 在文件收集中存在的特征,以及基于评估候选文件对来确定每对候选文档是否对应于已翻译的一对文档。

    Identifying Languages Missing From Campaigns
    3.
    发明申请
    Identifying Languages Missing From Campaigns 审中-公开
    识别广告系列中缺少的语言

    公开(公告)号:US20120310955A1

    公开(公告)日:2012-12-06

    申请号:US13216032

    申请日:2011-08-23

    IPC分类号: G06F17/30

    摘要: A technique includes determining whether a first webpage includes content in a first language, where the first webpage is a landing page associated with an advertising campaign of a content sponsor, determining whether a second webpage includes the content in a different second language where the second webpage is not a landing page in the advertising campaign, evaluating one or more criteria in order to make a recommendation for expanding the advertising campaign to include the second webpage, identifying a recommendation for expanding the advertising campaign to include the second webpage based at least in part on the evaluating.

    摘要翻译: 一种技术包括确定第一网页是否包括第一语言的内容,其中第一网页是与内容赞助商的广告活动相关联的着陆页,确定第二网页是否包含不同的第二语言的内容,其中第二网页 不是广告活动中的目标网页,评估一个或多个标准以便推荐扩展广告活动以包括第二网页,识别用于扩展广告活动的建议以至少部分地包括第二网页 对评估。