发明授权
US07996349B2 Methods and apparatus for computing graph similarity via sequence similarity
有权
通过序列相似度计算图相似度的方法和装置
- 专利标题: Methods and apparatus for computing graph similarity via sequence similarity
- 专利标题(中): 通过序列相似度计算图相似度的方法和装置
-
申请号: US11951146申请日: 2007-12-05
-
公开(公告)号: US07996349B2公开(公告)日: 2011-08-09
- 发明人: Ali Dasdan , Panagiotis Papadimitriou
- 申请人: Ali Dasdan , Panagiotis Papadimitriou
- 申请人地址: US CA Sunnyvale
- 专利权人: Yahoo! Inc.
- 当前专利权人: Yahoo! Inc.
- 当前专利权人地址: US CA Sunnyvale
- 代理机构: Greenberg Traurig, LLP
- 代理商 James J. DeCarlo
- 主分类号: G06F17/00
- IPC分类号: G06F17/00 ; G06N5/00
摘要:
This disclosure describes systems and methods for identifying and correcting anomalies in web graphs. A web graph is transformed into a sequence of tokens via a walk algorithm. The sequence is fingerprinted to form a set of shingles. The singles are compared to shingles for other web graphs in order to determine similarity between web graphs. Actions are then carried out to remove anomalous web graphs and modify parameters governing web mapping in order to decrease the likelihood of future anomalous web graphs being built.
公开/授权文献
信息查询