Document reuse in a search engine crawler

    公开(公告)号:US10216847B2

    公开(公告)日:2019-02-26

    申请号:US15617634

    申请日:2017-06-08

    Applicant: Google Inc.

    Abstract: Systems and method are provided for setting a respective reuse flag for a corresponding document in a plurality of documents based on a query-independent score associated with the corresponding document. A document crawling operation is performed on the plurality of documents in accordance with the reuse flag for respective documents in the plurality of documents. This document crawling operation includes reusing a previously downloaded version of a respective document in the plurality of documents instead of downloading a current version of the respective document from a host computer in accordance with a determination that the reuse flag associated with the respective document meets a predefined criterion.

    Document reuse in a search engine crawler

    公开(公告)号:US09679056B2

    公开(公告)日:2017-06-13

    申请号:US14245806

    申请日:2014-04-04

    Applicant: Google Inc.

    CPC classification number: G06F17/30864

    Abstract: Systems and method are provided for setting a respective reuse flag for a corresponding document in a plurality of documents based on a query-independent score associated with the corresponding document. A document crawling operation is performed on the plurality of documents in accordance with the reuse flag for respective documents in the plurality of documents. This document crawling operation includes reusing a previously downloaded version of a respective document in the plurality of documents instead of downloading a current version of the respective document from a host computer in accordance with a determination that the reuse flag associated with the respective document meets a predefined criterion.

Patent Agency Ranking