Abstract:
The specification relates to framework for removing non-authored content documents from an authored-content database by recording a sequence of authorship data for at least one authored-content document over a period of time. The at least one authored-content document can be indexed in an authored-content database. The sequence of authorship data is analyzed to determine if the at least one authored-content document changed in a meaningful way beyond a set threshold. If the at least one authored-content document is changed beyond the set threshold, the at least one authored-content document is removed from the authored-content database.