发明申请
- 专利标题: Consecutive crawling to identify transient links
- 专利标题(中): 连续爬行以识别短暂的链接
-
申请号: US11388681申请日: 2006-03-23
-
公开(公告)号: US20070226206A1公开(公告)日: 2007-09-27
- 发明人: Dmitri Pavlovski , Vladimir Ofitserov , Alexander Arsky
- 申请人: Dmitri Pavlovski , Vladimir Ofitserov , Alexander Arsky
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
According to the approach described herein, an approach is provided for identifying transient links on a Web page by crawling a Web page consecutively after a brief interval and comparing the links from each crawl to identify transient links. The approach ensures that transient links are not crawled and archived, thereby saving resources for crawling valid links leading to useful information
信息查询