-
公开(公告)号:US20110246457A1
公开(公告)日:2011-10-06
申请号:US12749972
申请日:2010-03-30
申请人: Anlei Dong , Pranam Kolari , Ruiqiang Zhang , Jing Bai , Yi Chang , Zhaohui Zheng
发明人: Anlei Dong , Pranam Kolari , Ruiqiang Zhang , Jing Bai , Yi Chang , Zhaohui Zheng
CPC分类号: G06Q10/06 , G06F17/30864 , G06Q50/01
摘要: An information retrieval system is described herein that monitors a microblog data stream that includes microblog posts to discover and index fresh resources for searching by a search engine. The information retrieval system also uses data from the microblog data stream as well as data obtained from a microblog subscription system to compute novel and effective features for ranking fresh resources which would otherwise have impoverished representations. An embodiment of the present invention advantageously enables a search engine to produce a fresher set of resources and to rank such resources for both relevancy and freshness in a more accurate manner.
摘要翻译: 本文描述了一种信息检索系统,其监视包括微博帖子的微博数据流,以发现和索引新的资源以供搜索引擎搜索。 信息检索系统还使用来自微博数据流的数据以及从微博订阅系统获得的数据来计算新颖而有效的特征来排列否则将具有贫困表示的新鲜资源。 本发明的一个实施例有利地使得搜索引擎能够产生更新鲜的资源集合,并且以更准确的方式对相关性和新鲜度进行排名。
-
公开(公告)号:US08751511B2
公开(公告)日:2014-06-10
申请号:US12749972
申请日:2010-03-30
申请人: Anlei Dong , Pranam Kolari , Ruiqiang Zhang , Jing Bai , Yi Chang , Zhaohui Zheng
发明人: Anlei Dong , Pranam Kolari , Ruiqiang Zhang , Jing Bai , Yi Chang , Zhaohui Zheng
CPC分类号: G06Q10/06 , G06F17/30864 , G06Q50/01
摘要: An information retrieval system is described herein that monitors a microblog data stream that includes microblog posts to discover and index fresh resources for searching by a search engine. The information retrieval system also uses data from the microblog data stream as well as data obtained from a microblog subscription system to compute novel and effective features for ranking fresh resources which would otherwise have impoverished representations. An embodiment of the present invention advantageously enables a search engine to produce a fresher set of resources and to rank such resources for both relevancy and freshness in a more accurate manner.
摘要翻译: 本文描述了一种信息检索系统,其监视包括微博帖子的微博数据流,以发现和索引新的资源以供搜索引擎搜索。 信息检索系统还使用来自微博数据流的数据以及从微博订阅系统获得的数据来计算新颖而有效的特征来排列否则将具有贫困表示的新鲜资源。 本发明的一个实施例有利地使得搜索引擎能够产生更新鲜的资源集合,并且以更准确的方式对相关性和新鲜度进行排名。
-
公开(公告)号:US20120042020A1
公开(公告)日:2012-02-16
申请号:US12857000
申请日:2010-08-16
申请人: Pranam Kolari , Ruiqiang Zhang , Yi Chang , Anlei Dong , Zhaohui Zheng , Lei Duan
发明人: Pranam Kolari , Ruiqiang Zhang , Yi Chang , Anlei Dong , Zhaohui Zheng , Lei Duan
IPC分类号: G06F15/16
CPC分类号: G06Q10/107
摘要: Example methods, apparatuses, or articles of manufacture are disclosed that may be implemented using one or more computing devices to provide or otherwise support micro-blog message filtering.
摘要翻译: 公开了可以使用一个或多个计算设备实现以提供或以其他方式支持微博消息过滤的示例性方法,设备或制品。
-
公开(公告)号:US20110093459A1
公开(公告)日:2011-04-21
申请号:US12579855
申请日:2009-10-15
申请人: Anlei Dong , Yi Chang , Ruiqiang Zhang , Zhaohui Zheng , Gilad Avraham Mishne , Jing Bai , Karolina Barbara Buchner , Ciya Liao , Shihao Ji , Gilbert Leung , Georges-Eric Albert Marie Robert Dupret , Ling Liu
发明人: Anlei Dong , Yi Chang , Ruiqiang Zhang , Zhaohui Zheng , Gilad Avraham Mishne , Jing Bai , Karolina Barbara Buchner , Ciya Liao , Shihao Ji , Gilbert Leung , Georges-Eric Albert Marie Robert Dupret , Ling Liu
IPC分类号: G06F17/30
CPC分类号: G06F17/30867
摘要: In one embodiment, access a set of recency ranking data comprising one or more recency search queries and one or more recency search results, each of the recency search queries being recency-sensitive with respect to a particular time period and being associated with a query timestamp representing the time at which the recency search query is received at a search engine, each of the recency search results being generated by the search engine for one of the recency search queries and comprising one or more recency network resources. Construct a plurality of recency features from the set of recency ranking data. Train a first ranking model via machine learning using at least the recency features.
摘要翻译: 在一个实施例中,访问包括一个或多个新近度搜索查询和一个或多个新近度搜索结果的一组新近度排序数据,每个新近度搜索查询相对于特定时间段是新近敏感度,并且与查询时间戳相关联 表示在搜索引擎处接收到新近度搜索查询的时间,每个新近度搜索结果由搜索引擎为新近搜索查询之一生成,并且包括一个或多个新近网络资源。 从新近度排序数据集合构建多个新近特征。 通过机器学习,至少使用新特性来训练第一个排名榜。
-
公开(公告)号:US08886641B2
公开(公告)日:2014-11-11
申请号:US12579855
申请日:2009-10-15
申请人: Anlei Dong , Yi Chang , Ruiqiang Zhang , Zhaohui Zheng , Gilad Avraham Mishne , Jing Bai , Karolina Barbara Buchner , Ciya Liao , Shihao Ji , Gilbert Leung , Georges-Eric Albert Marie Robert Dupret , Ling Liu
发明人: Anlei Dong , Yi Chang , Ruiqiang Zhang , Zhaohui Zheng , Gilad Avraham Mishne , Jing Bai , Karolina Barbara Buchner , Ciya Liao , Shihao Ji , Gilbert Leung , Georges-Eric Albert Marie Robert Dupret , Ling Liu
IPC分类号: G06F17/30
CPC分类号: G06F17/30867
摘要: In one embodiment, access a set of recency ranking data comprising one or more recency search queries and one or more recency search results, each of the recency search queries being recency-sensitive with respect to a particular time period and being associated with a query timestamp representing the time at which the recency search query is received at a search engine, each of the recency search results being generated by the search engine for one of the recency search queries and comprising one or more recency network resources. Construct a plurality of recency features from the set of recency ranking data. Train a first ranking model via machine learning using at least the recency features.
摘要翻译: 在一个实施例中,访问包括一个或多个新近度搜索查询和一个或多个新近度搜索结果的一组新近度排序数据,每个新近度搜索查询相对于特定时间段是新近敏感度,并且与查询时间戳相关联 表示在搜索引擎处接收到新近度搜索查询的时间,每个新近度搜索结果由搜索引擎为新近搜索查询之一生成,并且包括一个或多个新近网络资源。 从新近度排序数据集合构建多个新近特征。 通过机器学习,至少使用新特性来训练第一个排名榜。
-
6.
公开(公告)号:US20110087655A1
公开(公告)日:2011-04-14
申请号:US12576534
申请日:2009-10-09
申请人: Ruiqiang Zhang , Yi Chang , Anlei Dong , Zhaohui Zheng
发明人: Ruiqiang Zhang , Yi Chang , Anlei Dong , Zhaohui Zheng
IPC分类号: G06F17/30
CPC分类号: G06F16/9535
摘要: In one embodiment, a method comprises accessing a search query received at a search engine; identifying a plurality of network resources for the search query; calculating a ranking score for each of the network resources; determining whether the search query is year-qualified; and if the search query is year-qualified, then adjusting the ranking scores of selected ones of the network resources based on a difference between the ranking score of an oldest one of the network resources and the ranking score of a newest one of the network resources and a confidence score representing a likelihood that the search query is year-qualified.
摘要翻译: 在一个实施例中,一种方法包括访问在搜索引擎处接收的搜索查询; 识别搜索查询的多个网络资源; 计算每个网络资源的排名得分; 确定搜索查询是否合格; 并且如果搜索查询是合格的,则基于网络资源中最老的一个网络资源的排名分数与最新的一个网络资源的排名得分之间的差异来调整所选择的网络资源的排名得分 以及表示搜索查询符合年限的可能性的置信度分数。
-
-
-
-
-