-
公开(公告)号:US08150841B2
公开(公告)日:2012-04-03
申请号:US12690184
申请日:2010-01-20
申请人: Christopher Avery Meyers , Gopi Prashanth Gopal , Andrew Peter Oakley , Nitin Agrawal , Nicholas Eric Craswell , Milad Shokouhi , Derrick Leslie Connell , Sanaz Ahari , Neil Bruce Sharman , Gaurav Sareen , Hugh Evan Williams , Jay Kumar Goyal
发明人: Christopher Avery Meyers , Gopi Prashanth Gopal , Andrew Peter Oakley , Nitin Agrawal , Nicholas Eric Craswell , Milad Shokouhi , Derrick Leslie Connell , Sanaz Ahari , Neil Bruce Sharman , Gaurav Sareen , Hugh Evan Williams , Jay Kumar Goyal
CPC分类号: G06F17/30448 , G06Q30/0254
摘要: Methods, systems, and media are provided for identifying and clustering queries that are rising in popularity. Resultant clustered queries can be compared to other stored queries using textual and temporal correlations. Fresh indices containing information and results from recently crawled content sources are searched to obtain the most recent query activity. Historical indices are also searched to obtain temporally correlated information and results that match the clustered query stream. A weighted average acceleration of a spike can be calculated to distinguish between a legitimate spike and a non-legitimate spike. Legitimate clusters are combined with other stored clusters and presented as grouped content results to a user output device.
摘要翻译: 提供了方法,系统和媒体,用于识别和聚集正在日益普及的查询。 可以使用文本和时间相关性将所产生的聚类查询与其他存储的查询进行比较。 搜索包含最近爬取的内容源的信息和结果的新索引以获取最近的查询活动。 还搜索历史索引以获得与聚集查询流匹配的时间相关信息和结果。 可以计算穗的加权平均加速度,以区分合法穗和非合法穗。 合法集群与其他存储的集群组合,并以分组的内容结果呈现给用户输出设备。