Near-duplicate filtering in search engine result page of an online shopping system
Abstract:
Reducing near-duplicate entries in online shopping system search results. For each pair of entries in a set of entries, each entry characterizing a product in a data store of an online shopping system and each entry characterized by a set of attributes, determining a distance between the entries in the pair based on the attributes. Determining entry clusters from a graph formed with each determined distance as an edge between nodes representing the entries used to determine the distance, each entry cluster identified by cluster identifier. Returning an ordered list of results responsive to the query from the data store of an online shopping system, filtered as a function of at least one of the distance and the cluster identifier.
Information query
Patent Agency Ranking
0/0