Multi-stage query processing system and method for use with tokenspace repository
    18.
    发明申请
    Multi-stage query processing system and method for use with tokenspace repository 有权
    多阶段查询处理系统和方法用于托管存储库

    公开(公告)号:US20060036593A1

    公开(公告)日:2006-02-16

    申请号:US10917746

    申请日:2004-08-13

    IPC分类号: G06F17/30

    摘要: A multi-stage query processing system and method enables multi-stage query scoring, including “snippet” generation, through incremental document reconstruction facilitated by a multi-tiered mapping scheme. At one or more stages of a multi-stage query processing system a set of relevancy scores are used to select a subset of documents for presentation as an ordered list to a user. The set of relevancy scores can be derived in part from one or more sets of relevancy scores determined in prior stages of the multi-stage query processing system. In some embodiments, the multi-stage query processing system is capable of executing one or more passes on a user query, and using information from each pass to expand the user query for use in a subsequent pass to improve the relevancy of documents in the ordered list.

    摘要翻译: 多级查询处理系统和方法通过多层次映射方案促进的增量文档重建实现了多阶段查询评分,包括“代码段”生成。 在多阶段查询处理系统的一个或多个阶段,使用一组相关性分数来选择文档的子集,以作为用户的排序列表呈现。 相关性分数的集合可以部分地从多级查询处理系统的先前阶段中确定的一组或多组相关性得分导出。 在一些实施例中,多级查询处理系统能够执行用户查询的一个或多个传递,并且使用来自每个遍的信息来扩展用户查询以用于随后的传递中以改善订购中的文档的相关性 列表。

    Generating content snippets using a tokenspace repository
    19.
    发明授权
    Generating content snippets using a tokenspace repository 有权
    使用令牌空间存储库生成内容片段

    公开(公告)号:US08321445B2

    公开(公告)日:2012-11-27

    申请号:US13040220

    申请日:2011-03-03

    IPC分类号: G06F7/00 G06F17/30

    摘要: A search engine server system receives from a client system a search query and identifies a set of documents in accordance with the search query. A content snippet corresponding to content in a respective document of the identified set of documents is generated, the content snippet associated with at least one query term of the one or more query terms in the search query. A response to the search query is returned to the client system, the response including information identifying at least the respective document and including the content snippet. Generating the content snippet includes performing a first decompression operation on first token identifiers, from a compressed document repository, to provide a set of second token identifiers, and performing a second decompression operation on the set of second token identifiers to recover uncompressed content comprising a portion of the respective document.

    摘要翻译: 搜索引擎服务器系统从客户端系统接收搜索查询,并根据搜索查询识别一组文档。 产生对应于所识别的一组文档的相应文档中的内容的内容片段,该内容片段与搜索查询中的一个或多个查询词的至少一个查询词相关联。 对搜索查询的响应被返回到客户端系统,响应包括至少标识相应文档并且包括内容片段的信息。 生成内容片段包括对来自压缩文档库的第一令牌标识符执行第一解压缩操作,以提供一组第二令牌标识符,以及对所述第二令牌标识符集合执行第二解压缩操作,以恢复未压缩内容,其包括部分 的相关文件。

    Query Processing System and Method for Use with Tokenspace Repository
    20.
    发明申请
    Query Processing System and Method for Use with Tokenspace Repository 有权
    查询处理系统和方法用于Tokenpace存储库

    公开(公告)号:US20110153577A1

    公开(公告)日:2011-06-23

    申请号:US13040220

    申请日:2011-03-03

    IPC分类号: G06F17/30

    摘要: A search engine server system receives from a client system a search query and identifies a set of documents in accordance with the search query. A content snippet corresponding to content in a respective document of the identified set of documents is generated, the content snippet associated with at least one query term of the one or more query terms in the search query. A response to the search query is returned to the client system, the response including information identifying at least the respective document and including the content snippet. Generating the content snippet includes performing a first decompression operation on first token identifiers, from a compressed document repository, to provide a set of second token identifiers, and performing a second decompression operation on the set of second token identifiers to recover uncompressed content comprising a portion of the respective document.

    摘要翻译: 搜索引擎服务器系统从客户端系统接收搜索查询,并根据搜索查询识别一组文档。 产生对应于所识别的一组文档的相应文档中的内容的内容片段,该内容片段与搜索查询中的一个或多个查询词的至少一个查询词相关联。 对搜索查询的响应被返回到客户端系统,响应包括至少标识相应文档并且包括内容片段的信息。 生成内容片段包括对来自压缩文档库的第一令牌标识符执行第一解压缩操作,以提供一组第二令牌标识符,以及对所述第二令牌标识符集合执行第二解压缩操作,以恢复未压缩内容,其包括部分 的相关文件。