发明申请
US20120284275A1 UTILIZING OFFLINE CLUSTERS FOR REALTIME CLUSTERING OF SEARCH RESULTS
审中-公开
利用搜索结果实时聚类的离线群集
- 专利标题: UTILIZING OFFLINE CLUSTERS FOR REALTIME CLUSTERING OF SEARCH RESULTS
- 专利标题(中): 利用搜索结果实时聚类的离线群集
-
申请号: US13099197申请日: 2011-05-02
-
公开(公告)号: US20120284275A1公开(公告)日: 2012-11-08
- 发明人: Srinivas Vadrevu , Choon Hui Teo , Suju Rajan , Kunal Punera , Byron E. Dom , Alex J. Smola
- 申请人: Srinivas Vadrevu , Choon Hui Teo , Suju Rajan , Kunal Punera , Byron E. Dom , Alex J. Smola
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Techniques for clustering of search results are described. In an example embodiment, a plurality of first clusters is determined, in a corpus of articles, independently of user queries issued against the corpus of articles, where each first cluster represents a group of articles that relate to a news story. One or more cluster identifiers are assigned to each article in the corpus, where the one or more cluster identifiers respectively identify one or more of the plurality of first clusters to which the article belongs. A query that specifies search criteria against the corpus of articles is received. In response to receiving the query, a result for the query is generated by at least selecting, from the corpus of articles, a set of articles based on the search criteria. The selected set of articles is grouped into one or more second clusters based at least on the one or more cluster identifiers that are assigned to each article in the set of articles. In the result for the query, the set of articles is organized according to the one or more second clusters.
信息查询