Abstract:
A computer-implemented method is presented herein. The method obtains a first content item from an online source, and then generates a characterizing signature of the first content item. The method continues by finding a previously-saved instance of the characterizing signature and retrieving data associated with a second content item (the second content item is characterized by the characterizing signature). The method continues by analyzing the data associated with the second content item, corresponding data associated with the first content item, and decision criteria. Thereafter, either the first content item or the second content item is identified as an original content item, based on the analyzing. The other content item can be flagged as an aggregated content item.
Abstract:
A computer-implemented method is disclosed herein. The method obtains a first content item from an online source, and selects only a portion of the words in the first content item, in accordance with a defined selection algorithm. The method continues by generating a characterizing signature of the first content item, wherein the characterizing signature is derived from the selected words. The method continues by finding a previously-saved instance of the characterizing signature, retrieving data associated with a second content item (which is linked to the saved characterizing signature), and comparing the data associated with the second content item against corresponding data associated with the first content item. The first content item or the second content item is identified as an original content item, based on the comparing. The other content item is flagged as an aggregated content item.