-
公开(公告)号:US09009148B2
公开(公告)日:2015-04-14
申请号:US13329345
申请日:2011-12-19
申请人: Jianfeng Gao , Kristina Toutanova , Wen-tau Yih
发明人: Jianfeng Gao , Kristina Toutanova , Wen-tau Yih
IPC分类号: G06F17/30
CPC分类号: G06F17/30867
摘要: There is provided a computer-implemented method and system for ranking documents. The method includes identifying a number of query-document pairs based on clickthrough data for a number of documents. The method also includes building a latent semantic model based on the query-document pairs and ranking the documents for a search based on the latent semantic model.
摘要翻译: 提供了用于对文档进行排序的计算机实现的方法和系统。 该方法包括基于多个文档的点击数据来识别多个查询文档对。 该方法还包括基于查询文档对构建潜在语义模型,并根据潜在语义模型对搜索文档进行排序。
-
公开(公告)号:US20130159320A1
公开(公告)日:2013-06-20
申请号:US13329345
申请日:2011-12-19
申请人: Jianfeng Gao , Kristina Toutanova , Wen-tau Yih
发明人: Jianfeng Gao , Kristina Toutanova , Wen-tau Yih
IPC分类号: G06F17/30
CPC分类号: G06F17/30867
摘要: There is provided a computer-implemented method and system for ranking documents. The method includes identifying a number of query-document pairs based on clickthrough data for a number of documents. The method also includes building a latent semantic model based on the query-document pairs and ranking the documents for a search based on the latent semantic model.
摘要翻译: 提供了用于对文档进行排序的计算机实现的方法和系统。 该方法包括基于多个文档的点击数据来识别多个查询文档对。 该方法还包括基于查询文档对构建潜在语义模型,并根据潜在语义模型对搜索文档进行排序。
-
公开(公告)号:US20070083357A1
公开(公告)日:2007-04-12
申请号:US11485015
申请日:2006-07-12
申请人: Robert Moore , Wen-tau Yih , Galen Andrew , Kristina Toutanova
发明人: Robert Moore , Wen-tau Yih , Galen Andrew , Kristina Toutanova
IPC分类号: G06F17/28
CPC分类号: G06F17/2827 , G06F17/2836
摘要: A weighted linear word alignment model linearly combines weighted features to score a word alignment for a bilingual, aligned pair of text fragments. The features are each weighted by a feature weight. One of the features is a word association metric, which may be generated from surface statistics.
摘要翻译: 加权线性字对齐模型线性组合加权特征以对双语对齐的文本片段对进行字对齐。 特征各自由特征权重加权。 特征之一是字关联度量,其可以从表面统计量生成。
-
公开(公告)号:US20090125501A1
公开(公告)日:2009-05-14
申请号:US11938811
申请日:2007-11-13
申请人: Jianfeng Gao , Galen Andrew , Mark Johnson , Kristina Toutanova
发明人: Jianfeng Gao , Galen Andrew , Mark Johnson , Kristina Toutanova
IPC分类号: G06F7/10
CPC分类号: G06F17/2715
摘要: Systems and methods for selecting a ranker for statistical natural language processing are provided. One disclosed system includes a computer program configured to be executed on a computing device, the computer program comprising a data store including reference performance data for a plurality of candidate rankers, the reference performance data being calculated based on a processing of test data by each of the plurality of candidate rankers. The system may further include a ranker selector configured to receive a statistical natural language processing task and a performance target, and determine a selected ranker from the plurality of candidate rankers based on the statistical natural language processing task, the performance target, and the reference performance data.
摘要翻译: 提供了用于选择用于统计自然语言处理的游戏者的系统和方法。 一种公开的系统包括被配置为在计算设备上执行的计算机程序,该计算机程序包括数据存储器,该数据存储器包括用于多个候选排名者的参考演出数据,该参考演出数据是基于每个测试数据的处理来计算的 多个候选排名。 该系统可以进一步包括配置成接收统计自然语言处理任务和性能目标的排队选择器,并且基于统计自然语言处理任务,性能目标和参考性能来确定来自多个候选排名者的选定队员 数据。
-
公开(公告)号:US07844555B2
公开(公告)日:2010-11-30
申请号:US11938811
申请日:2007-11-13
申请人: Jianfeng Gao , Galen Andrew , Mark Johnson , Kristina Toutanova
发明人: Jianfeng Gao , Galen Andrew , Mark Johnson , Kristina Toutanova
CPC分类号: G06F17/2715
摘要: Systems and methods for selecting a ranker for statistical natural language processing are provided. One disclosed system includes a computer program configured to be executed on a computing device, the computer program comprising a data store including reference performance data for a plurality of candidate rankers, the reference performance data being calculated based on a processing of test data by each of the plurality of candidate rankers. The system may further include a ranker selector configured to receive a statistical natural language processing task and a performance target, and determine a selected ranker from the plurality of candidate rankers based on the statistical natural language processing task, the performance target, and the reference performance data.
摘要翻译: 提供了用于选择用于统计自然语言处理的游戏者的系统和方法。 一种公开的系统包括被配置为在计算设备上执行的计算机程序,该计算机程序包括数据存储器,该数据存储器包括用于多个候选排名者的参考演出数据,该参考演出数据是基于每个测试数据的处理来计算的 多个候选排名。 该系统可以进一步包括配置成接收统计自然语言处理任务和性能目标的排队选择器,并且基于统计自然语言处理任务,性能目标和参考性能来确定来自多个候选排名者的选定队员 数据。
-
公开(公告)号:US09201871B2
公开(公告)日:2015-12-01
申请号:US12814422
申请日:2010-06-11
申请人: Xiaodong He , Kristina Toutanova
发明人: Xiaodong He , Kristina Toutanova
CPC分类号: G06F17/2854 , G06F17/2818 , G06F17/2827
摘要: A joint optimization strategy is employed for combining translation hypotheses from multiple machine-translation systems. Decisions on word alignment, between the hypotheses, ordering, and selection of a combined translation output are made jointly in accordance with a set of features. Additional features that model alignment and ordering behavior are also provided and utilized.
摘要翻译: 联合优化策略用于组合来自多个机器翻译系统的翻译假设。 根据一组特征联合进行字对齐,组合翻译输出的假设,排序和选择之间的决定。 还提供和利用了模型对齐和排序行为的其他功能。
-
公开(公告)号:US08209163B2
公开(公告)日:2012-06-26
申请号:US11504285
申请日:2006-08-15
申请人: Hisami Suzuki , Kristina Toutanova
发明人: Hisami Suzuki , Kristina Toutanova
IPC分类号: G06F17/28
CPC分类号: G06F17/2818 , G06F17/274
摘要: Grammatical element prediction is used in the context of machine translation. Features from both the source language and the target language sentences (or other text fragments) are used in predicting the grammatical elements.
摘要翻译: 语法元素预测用于机器翻译的上下文中。 源语言和目标语言句子(或其他文本片段)的特征用于预测语法元素。
-
公开(公告)号:US07865352B2
公开(公告)日:2011-01-04
申请号:US11483943
申请日:2006-07-10
申请人: Hisami Suzuki , Kristina Toutanova
发明人: Hisami Suzuki , Kristina Toutanova
IPC分类号: G06F17/28
CPC分类号: G06F17/2775 , G06F17/2863
摘要: Grammatical element prediction is used to predict grammatical elements in text fragments (such as phrases or sentences). In one embodiment, a statistical model, using syntax features, is used to predict grammatical elements.
摘要翻译: 语法元素预测用于预测文本片段(如短语或句子)中的语法元素。 在一个实施例中,使用语法特征的统计模型用于预测语法元素。
-
公开(公告)号:US20110307244A1
公开(公告)日:2011-12-15
申请号:US12814422
申请日:2010-06-11
申请人: Xiaodong He , Kristina Toutanova
发明人: Xiaodong He , Kristina Toutanova
IPC分类号: G06F17/28
CPC分类号: G06F17/2854 , G06F17/2818 , G06F17/2827
摘要: A joint optimization strategy is employed for combining translation hypotheses from multiple machine-translation systems. Decisions on word alignment, between the hypotheses, ordering, and selection of a combined translation output are made jointly in accordance with a set of features. Additional features that model alignment and ordering behavior are also provided and utilized.
摘要翻译: 联合优化策略用于组合来自多个机器翻译系统的翻译假设。 根据一组特征联合进行字对齐,组合翻译输出的假设,排序和选择之间的决定。 还提供和利用了模型对齐和排序行为的其他功能。
-
公开(公告)号:US20070282596A1
公开(公告)日:2007-12-06
申请号:US11483943
申请日:2006-07-10
申请人: Hisami Suzuki , Kristina Toutanova
发明人: Hisami Suzuki , Kristina Toutanova
IPC分类号: G06F17/27
CPC分类号: G06F17/2775 , G06F17/2863
摘要: Grammatical element prediction is used to predict grammatical elements in text fragments (such as phrases or sentences). In one embodiment, a statistical model, using syntax features, is used to predict grammatical elements.
摘要翻译: 语法元素预测用于预测文本片段(如短语或句子)中的语法元素。 在一个实施例中,使用语法特征的统计模型用于预测语法元素。
-
-
-
-
-
-
-
-
-