Method For Generating Score-Optimal R-Trees
    5.
    发明申请
    Method For Generating Score-Optimal R-Trees 审中-公开
    生成分数最优R树的方法

    公开(公告)号:US20100036865A1

    公开(公告)日:2010-02-11

    申请号:US12188169

    申请日:2008-08-07

    IPC分类号: G06F17/30

    CPC分类号: G06F16/322

    摘要: A method of constructing a score-optimal R-tree to support top-k stabbing queries over a set of scored intervals generates a constraint graph from the set, and determines over each node in the constraint graph that has no other nodes pointing to it the node with the smallest left endpoint; for each of these nodes, the associated interval is added to the tree and the node is removed from the constraint graph.

    摘要翻译: 一种构建分数最优R树以支持一组计分间隔的顶部k个刺激查询的方法从该集合生成约束图,并确定约束图中没有其他节点指向它的每个节点 具有最小左端点的节点; 对于这些节点中的每一个,将相关联的间隔添加到树中,并且从约束图中移除该节点。

    Methods and apparatus for representing probabilistic data using a probabilistic histogram
    9.
    发明授权
    Methods and apparatus for representing probabilistic data using a probabilistic histogram 失效
    使用概率直方图表示概率数据的方法和装置

    公开(公告)号:US08145669B2

    公开(公告)日:2012-03-27

    申请号:US12636544

    申请日:2009-12-11

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30536

    摘要: Methods and apparatus for representing probabilistic data using a probabilistic histogram are disclosed. An example method comprises partitioning a plurality of ordered data items into a plurality of buckets, each of the data items capable of having a data value from a plurality of possible data values with a probability characterized by a respective individual probability distribution function (PDF), each bucket associated with a respective subset of the ordered data items bounded by a respective beginning data item and a respective ending data item, and determining a first representative PDF for a first bucket associated with a first subset of the ordered data items by partitioning the plurality of possible data values into a first plurality of representative data ranges and respective representative probabilities based on an error between the first representative PDF and a first plurality of individual PDFs characterizing the first subset of the ordered data items.

    摘要翻译: 公开了使用概率直方图表示概率数据的方法和装置。 一种示例性方法包括将多个有序数据项划分成多个桶,每个数据项能够具有来自多个可能数据值的数据值,其特征在于各自的概率分布函数(PDF), 每个桶与由相应的开始数据项和相应的结束数据项限定的有序数据项的相应子集相关联,并且通过分割多个数据项来确定与有序数据项的第一子集相关联的第一个桶的第一代表性PDF 基于第一代表性PDF和表征有序数据项的第一子集的第一多个单独PDF之间的误差,将可能的数据值转换成第一多个代表性数据范围和相应的代表概率。

    Streaming algorithms for robust, real-time detection of DDoS attacks
    10.
    发明授权
    Streaming algorithms for robust, real-time detection of DDoS attacks 有权
    用于强大,实时检测DDoS攻击的流式算法

    公开(公告)号:US07669241B2

    公开(公告)日:2010-02-23

    申请号:US10954901

    申请日:2004-09-30

    IPC分类号: G06F12/14

    摘要: A distinct-count estimate is obtained in a guaranteed small footprint using a two level hash, distinct count sketch. A first hash fills the first-level hash buckets with an exponentially decreasing number of data-elements. These are then uniformly hashed to an array of second-level-hash tables, and have an associated total-element counter and bit-location counters. These counters are used to identify singletons and so provide a distinct-sample and a distinct-count. An estimate of the total distinct-count is obtained by dividing by the distinct-count by the probability of mapping a data-element to that bucket. An estimate of the total distinct-source frequencies of destination address can be found in a similar fashion. By further associating the distinct-count sketch with a list of singletons, a total singleton count and a heap containing the destination addresses ordered by their distinct-source frequencies, a tracking distinct-count sketch may be formed that has considerably improved query time.

    摘要翻译: 使用两级散列,不同的计数草图在保证的小尺寸中获得不同的计数估计。 第一个散列填充了数据元素数量级数下降的第一级哈希桶。 然后将它们均匀地散列到二级哈希表的阵列,并具有关联的全元计数器和位位计数器。 这些计数器用于识别单例,因此提供了不同的样本和不同的数字。 通过将distinct-count除以将数据元素映射到该存储桶的概率,可以获得总区分计数的估计。 可以以类似的方式找到目的地地址的不同源频率的总体估计。 通过进一步将不同数量的草图与单例列表相关联,总共单例数和包含由其不同源频率排​​序的目的地地址的堆,可以形成具有显着改进的查询时间的跟踪不同计划草图。