Abstract:
An enhanced method and system for the classification of a target web page and the description of a set of web pages web pages utilizing virtual documents, in which a virtual document comprises extended anchortext extracted from each of a plurality of web pages that includes at least one hyperlink citing each target web page.