Systems and methods for indexing content for fast and scalable retrieval
    1.
    发明授权
    Systems and methods for indexing content for fast and scalable retrieval 有权
    用于索引内容的系统和方法用于快速和可扩展的检索

    公开(公告)号:US07620624B2

    公开(公告)日:2009-11-17

    申请号:US10966265

    申请日:2004-10-15

    IPC分类号: G06F17/30

    摘要: Systems and methods for query processing and indexing of documents in connection with a content store in a computing system are provided. In various embodiments, an indexing model is provided that is optimized for fast, efficient and scalable retrieval of documents satisfying a query, including the mixed use of forward and inverted indexing representations, including algorithms for achieving a balance between the two representations. When processing queries, fast and efficient generation of reverse chronologically ordered posting lists is enabled for efficient execution of logical operators on query result sets. A term expand index is also provided wherein the overall terms included in the term expand index are decomposed into a plurality of lexicon files, which are combined when convenient for fast, scalable efficiency when performing queries of the content in the content store.

    摘要翻译: 提供了用于与计算系统中的内容存储相关联的文档的查询处理和索引的系统和方法。 在各种实施例中,提供了针对满足查询的文档的快速,有效和可扩展检索进行优化的索引模型,包括前向和反向索引表示的混合使用,包括用于实现两个表示之间的平衡的算法。 在处理查询时,可以快速有效地生成反向时间顺序的发布列表,以便有效执行查询结果集上的逻辑运算符。 还提供了一个术语展开索引,其中包括在术语展开索引中的总体术语被分解成多个词典文件,当在执行内容存储中的内容的查询时方便快速,可扩展的效率时,它们被组合。

    Systems and methods for indexing content for fast and scalable retrieval
    3.
    发明授权
    Systems and methods for indexing content for fast and scalable retrieval 有权
    用于索引内容的系统和方法用于快速和可扩展的检索

    公开(公告)号:US07849063B2

    公开(公告)日:2010-12-07

    申请号:US10966566

    申请日:2004-10-15

    IPC分类号: G06F7/00

    摘要: Systems and methods for query processing and indexing of documents in connection with a content store in a computing system are provided. In various embodiments, an indexing model is provided that is optimized for fast, efficient and scalable retrieval of documents satisfying a query, including the mixed use of forward and inverted indexing representations, including algorithms for achieving a balance between the two representations. When processing queries, fast and efficient generation of reverse chronologically ordered posting lists is enabled for efficient execution of logical operators on query result sets. A term expand index is also provided wherein the overall terms included in the term expand index are decomposed into a plurality of lexicon files, which are combined when convenient for fast, scalable efficiency when performing queries of the content in the content store.

    摘要翻译: 提供了用于与计算系统中的内容存储相关联的文档的查询处理和索引的系统和方法。 在各种实施例中,提供了针对满足查询的文档的快速,有效和可扩展检索进行优化的索引模型,包括前向和反向索引表示的混合使用,包括用于实现两个表示之间的平衡的算法。 在处理查询时,可以快速有效地生成反向时间顺序的发布列表,以便有效执行查询结果集上的逻辑运算符。 还提供了一个术语展开索引,其中包括在术语展开索引中的总体术语被分解成多个词典文件,当在执行内容存储中的内容的查询时方便快速,可扩展的效率时,它们被组合。

    Systems and methods for updating query results based on query deltas
    4.
    发明授权
    Systems and methods for updating query results based on query deltas 有权
    基于查询三角形更新查询结果的系统和方法

    公开(公告)号:US07720845B2

    公开(公告)日:2010-05-18

    申请号:US10917742

    申请日:2004-08-13

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/30693

    摘要: Systems and methods for automatically updating queries on a data store, such as a personal content database (PCDB), are provided. A query engine runs queries against two indexes: a first index that represents a previous state of documents and a second index that represents a current state of documents. The query is run twice and a delta analysis is performed, i.e., a determination is made as to which documents have changed in some respect from the previous state to the current state, and a view or a count associated with at least one query changes in accordance with the delta analysis. Transactions may be batched dynamically by a transaction manager until an optimal number of documents have changed or a certain amount of time has passed prior to re-running the query and performing the delta analysis.

    摘要翻译: 提供了用于自动更新诸如个人内容数据库(PCDB)的数据存储上的查询的系统和方法。 查询引擎针对两个索引运行查询:表示文档的先前状态的第一个索引和表示文档的当前状态的第二个索引。 该查询运行两次并进行增量分析,即,确定哪些文档在某些方面从先前状态改变到当前状态,并且与至少一个查询相关联的视图或计数改变 按照delta分析。 交易可以由事务管理器动态分批,直到文档的最佳数量发生变化或在重新运行查询并执行增量分析之前已经过了一段时间。