摘要:
A system and method of data mining comprises processing contents of a primary posting index; and producing a posting within a secondary posting index based on the processing of the contents of the primary posting index, wherein the processing of contents of the primary posting index comprises submitting a disjunction of terms or phrases to the primary posting index. The processing of contents of the primary posting index comprises generating a query result by submitting a query to the primary posting index using a query language of the primary posting index. Moreover, the processing of contents of the primary posting index comprises processing the primary posting index in order to generate results, wherein the results comprise a set of candidate entries with additional metadata; and filtering the results in order to produce the posting within the secondary posting index.
摘要:
A system and method of indexing a plurality of entities located in a taxonomy, the entities comprising sets of terms, comprises receiving terms in an index structure; building a posting list for an entity with respect to the locations of the set of terms defining the entity and data associated with the respective terms; and indexing a name of a group comprising the entities within this group at the location of the entities with the data of the group comprising the name of the respective entity at each location. The building of the posting list comprises storing the location of the term and data associated with the term in an entry in the posting list for the term. The method comprises indexing aliases of the name of the group comprising the term, and using an inverted list index to associate data with each occurrence of an index term.