Abstract:
Systems and methods are provided for obtaining a plurality of documents. A respective document in the plurality of documents is associated with a score and each document in the plurality of documents is from a different data structure in a plurality of data structures. Each data structure in the plurality of data structures represents a different portion of a document address space. A first document in the plurality of documents is selected in accordance with the score associated with the first document. The first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents. In accordance with the score, the first document is indexed thereby producing an indexed first document. With respect to the plurality of documents, the indexed first document is included in a document index as representative of each document in the plurality of documents.
Abstract:
Systems and methods are provided that, in response to obtaining an email to a recipient from a sender, and in accordance with a determination that an indirect relationship exists between the sender and the recipient, determine a spam probability of the email by evaluating statistical information regarding the historical electronic interactions associated with the sender. In this way, the email is classified according to the identified spam probability.
Abstract:
A system and method for combining endorsements in related webpages, the method including receiving an indication of an endorsement at a first webpage, incrementing a primary count of the first webpage in response to receiving the indication, determining if the first page is related to one or more other webpages, identifying the one or more other webpages related to the first page, if it is determined that the first page is related to one or more other webpages, incrementing a secondary count of the first webpage and the one or more other webpages if it is determined that the first page is related to one or more other webpages in response to receiving the indication and providing the secondary count for display at the one or more of the first webpage or the one or more other webpages.