摘要:
A method (and system) for extracting information from a document, includes segregating a set of documents from a plurality of documents based on a likelihood that at least one document in the set of documents carries an instance of a preset information.