Annotation of inverted list text indexes using search queries
    1.
    发明申请
    Annotation of inverted list text indexes using search queries 审中-公开
    使用搜索查询对反向列表文本索引进行注释

    公开(公告)号:US20060248037A1

    公开(公告)日:2006-11-02

    申请号:US11118526

    申请日:2005-04-29

    IPC分类号: G06F17/30

    CPC分类号: G06F16/3331

    摘要: A system and method of data mining comprises processing contents of a primary posting index; and producing a posting within a secondary posting index based on the processing of the contents of the primary posting index, wherein the processing of contents of the primary posting index comprises submitting a disjunction of terms or phrases to the primary posting index. The processing of contents of the primary posting index comprises generating a query result by submitting a query to the primary posting index using a query language of the primary posting index. Moreover, the processing of contents of the primary posting index comprises processing the primary posting index in order to generate results, wherein the results comprise a set of candidate entries with additional metadata; and filtering the results in order to produce the posting within the secondary posting index.

    摘要翻译: 数据挖掘的系统和方法包括处理主要发布索引的内容; 以及基于所述主要发布索引的内容的处理,在次要发布索引内生成发布,其中所述主要发布索引的内容的处理包括向所述主要发布索引提交术语或短语的分离。 主要发布索引的内容的处理包括使用主要发布索引的查询语言向主要发布索引提交查询来生成查询结果。 此外,主要发布索引的内容的处理包括处理主要发布索引以生成结果,其中结果包括具有附加元数据的一组候选条目; 并对结果进行过滤,以便在二次发布索引中产生过帐。

    Method and framework to support indexing and searching taxonomies in large scale full text indexes
    2.
    发明申请
    Method and framework to support indexing and searching taxonomies in large scale full text indexes 有权
    支持大规模全文索引分类和搜索索引的方法和框架

    公开(公告)号:US20070078880A1

    公开(公告)日:2007-04-05

    申请号:US11241687

    申请日:2005-09-30

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30734

    摘要: A system and method of indexing a plurality of entities located in a taxonomy, the entities comprising sets of terms, comprises receiving terms in an index structure; building a posting list for an entity with respect to the locations of the set of terms defining the entity and data associated with the respective terms; and indexing a name of a group comprising the entities within this group at the location of the entities with the data of the group comprising the name of the respective entity at each location. The building of the posting list comprises storing the location of the term and data associated with the term in an entry in the posting list for the term. The method comprises indexing aliases of the name of the group comprising the term, and using an inverted list index to associate data with each occurrence of an index term.

    摘要翻译: 一种对位于分类法中的多个实体进行索引的系统和方法,所述实体包括术语集合,包括在索引结构中接收术语; 为一个实体建立关于定义与各个条款相关联的实体和数据的术语集的位置的实体的发布列表; 并且在包括在每个位置处的相应实体的名称的组的数据的实体的位置处索引包括在该组内的实体的组的名称。 发布列表的构建包括将术语的位置和与该术语相关联的数据存储在该术语的发布列表中的条目中。 该方法包括对包括该术语的组的名称的别名进行索引,并使用反向列表索引将数据与索引项的每次出现相关联。