Representative Document Selection for a Set of Duplicate Documents
    1.
    发明申请
    Representative Document Selection for a Set of Duplicate Documents 审中-公开
    一组重复文件的代表性文件选择

    公开(公告)号:US20150026170A1

    公开(公告)日:2015-01-22

    申请号:US14510775

    申请日:2014-10-09

    Applicant: GOOGLE INC.

    Abstract: Systems and methods are provided for obtaining a plurality of documents. A respective document in the plurality of documents is associated with a score and each document in the plurality of documents is from a different data structure in a plurality of data structures. Each data structure in the plurality of data structures represents a different portion of a document address space. A first document in the plurality of documents is selected in accordance with the score associated with the first document. The first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents. In accordance with the score, the first document is indexed thereby producing an indexed first document. With respect to the plurality of documents, the indexed first document is included in a document index as representative of each document in the plurality of documents.

    Abstract translation: 提供了用于获得多个文档的系统和方法。 多个文档中的相应文档与分数相关联,并且多个文档中的每个文档来自多个数据结构中的不同数据结构。 多个数据结构中的每个数据结构表示文档地址空间的不同部分。 根据与第一文档相关联的得分来选择多个文档中的第一文档。 第一文档具有指示,其指示第一文档具有与多个文档中的每个其他文档基本相同的内容。 根据分数,第一个文档被索引,从而产生索引的第一个文档。 关于多个文档,索引的第一文档被包括在作为多个文档中的每个文档的代表的文档索引中。

    ENDORSEMENT SMEARING AMONG RELATED WEBPAGES
    3.
    发明申请

    公开(公告)号:US20180052807A1

    公开(公告)日:2018-02-22

    申请号:US14080721

    申请日:2013-11-14

    Applicant: Google Inc.

    CPC classification number: G06F16/951

    Abstract: A system and method for combining endorsements in related webpages, the method including receiving an indication of an endorsement at a first webpage, incrementing a primary count of the first webpage in response to receiving the indication, determining if the first page is related to one or more other webpages, identifying the one or more other webpages related to the first page, if it is determined that the first page is related to one or more other webpages, incrementing a secondary count of the first webpage and the one or more other webpages if it is determined that the first page is related to one or more other webpages in response to receiving the indication and providing the secondary count for display at the one or more of the first webpage or the one or more other webpages.

Patent Agency Ranking