TEXT INDEXING FOR UPDATEABLE TOKENIZED TEXT

    公开(公告)号:US20120150864A1

    公开(公告)日:2012-06-14

    申请号:US12967419

    申请日:2010-12-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30613 G06F17/30017

    摘要: Systems, methods, and other embodiments associated with text indexing for updateable tokenized text are described. One example method includes receiving revised tokenized text intended to replace existing tokenized text in an indexed document. Token location information corresponding to the revised tokenized text is stored in an allocated free space portion of a text index posting.

    摘要翻译: 描述了与用于可更新标记化文本的文本索引相关联的系统,方法和其他实施例。 一个示例性方法包括接收旨在替换索引文档中的现有标记化文本的经修改的标记化文本。 对应于经修改的标记化文本的令牌位置信息被存储在文本索引发布的分配的可用空间部分中。

    REAL-TIME TEXT INDEXING
    2.
    发明申请
    REAL-TIME TEXT INDEXING 有权
    实时文本索引

    公开(公告)号:US20120166404A1

    公开(公告)日:2012-06-28

    申请号:US12979413

    申请日:2010-12-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30622 G06F17/30631

    摘要: Systems, methods, and other embodiments associated with real-time text indexing are described. One example method includes receiving a document for indexing in a search system that includes a mature index and indexing the received document in a staging index. The staging index may be stored in direct access memory associated with query processing that does not degrade query performance even when postings become fragmented. The staging index and the mature text index are accessed to process queries on the search system. The example method may also include periodically merging the staging index into the mature index based on query feedback.

    摘要翻译: 描述了与实时文本索引相关联的系统,方法和其他实施例。 一个示例性方法包括在搜索系统中接收用于索引的文档,该搜索系统包括成熟索引并且以分段索引对接收到的文档进行索引。 分级索引可以存储在与查询处理相关联的直接访问存储器中,即使在发布分段时也不降低查询性能。 访问分段索引和成熟文本索引以在搜索系统上处理查询。 示例性方法还可以包括基于查询反馈将登台索引周期性地合并到成熟索引中。