Consecutive crawling to identify transient links

发明申请

US20070226206A1 Consecutive crawling to identify transient links 审中-公开

标题翻译：连续爬行以识别短暂的链接

请登陆查看更多内容

专利标题： Consecutive crawling to identify transient links
专利标题（中）： 连续爬行以识别短暂的链接
申请号： US11388681

申请日： 2006-03-23
公开(公告)号： US20070226206A1

公开(公告)日： 2007-09-27
发明人: Dmitri Pavlovski , Vladimir Ofitserov , Alexander Arsky
申请人： Dmitri Pavlovski , Vladimir Ofitserov , Alexander Arsky
主分类号： G06F17/30
IPC分类号： G06F17/30

Consecutive crawling to identify transient links

摘要：

According to the approach described herein, an approach is provided for identifying transient links on a Web page by crawling a Web page consecutively after a brief interval and comparing the links from each crawl to identify transient links. The approach ensures that transient links are not crawled and archived, thereby saving resources for crawling valid links leading to useful information

摘要（中）：

根据本文描述的方法，提供了一种用于通过在短暂间隔之后连续爬行网页来识别网页上的瞬态链接并比较来自每个爬行的链接以识别瞬时链接的方法。该方法确保临时链接不被爬网和归档，从而节省了用于爬行有效链接的资源，从而获得有用的信息

信息查询

Global Dossier Espacenet