Document compression scheme that supports searching and partial decompression
    31.
    发明授权
    Document compression scheme that supports searching and partial decompression 有权
    支持搜索和部分解压缩的文档压缩方案

    公开(公告)号:US07319994B1

    公开(公告)日:2008-01-15

    申请号:US10444761

    申请日:2003-05-23

    申请人: Olcan Sercinoglu

    发明人: Olcan Sercinoglu

    IPC分类号: G06F7/00 G06F17/00

    摘要: One embodiment of the present invention provides a system that facilitates accessing a compressed representation of a set of documents, wherein the compressed representation supports searching and partial decompression. During operation, the system receives a search request containing terms to be searched for in the set of documents. In response to the search request, the system identifies occurrences of the terms in the set of documents by following pointers through the compressed representation. This compressed representation encodes occurrences of a term as a pointer to the next occurrence of the term to facilitate rapid enumeration of the occurrences of the term. Moreover, the compressed representation maintains sequential ordering between adjacent terms in the set of documents, which allows fast access to neighboring terms.

    摘要翻译: 本发明的一个实施例提供一种便于访问一组文档的压缩表示的系统,其中压缩表示支持搜索和部分解压缩。 在操作期间,系统接收包含在该组文档中搜索的术语的搜索请求。 响应于搜索请求,系统通过以下指针通过压缩表示来识别文档集合中的术语的出现。 这种压缩的表示将术语的出现作为指向该术语的下一个出现的指针,以便于快速枚举术语的发生。 此外,压缩表示维持文档集合中的相邻项之间的顺序排序,这允许快速访问邻近项。

    Systems and Methods for Generating Statistics from Search Engine Query Logs
    32.
    发明申请
    Systems and Methods for Generating Statistics from Search Engine Query Logs 有权
    从搜索引擎查询日志生成统计信息的系统和方法

    公开(公告)号:US20120215765A1

    公开(公告)日:2012-08-23

    申请号:US13396511

    申请日:2012-02-14

    IPC分类号: G06F17/30

    摘要: A computer-implemented method includes calculating first statistics about a user-identified event within a first subset of a database of events; selecting a second subset of the database of events based on said first statistics; calculating second statistics about the user-identified event within the second subset of the database of events; merging the first and second statistics as statistics of the user-identified event within the entire database of events; and generating a result including at least a portion of the merged statistics of the user-identified event.

    摘要翻译: 计算机实现的方法包括计算关于事件数据库的第一子集内的用户标识事件的第一统计信息; 基于所述第一统计数据选择事件数据库的第二子集; 计算关于事件数据库的第二子集内的用户标识事件的第二统计; 将第一和第二统计信息合并到整个事件数据库内的用户标识事件的统计数据; 以及生成包括所述用户识别事件的合并统计信息的至少一部分的结果。