Method and apparatus for globally approximating quantiles in a distributed monitoring environment
    1.
    发明授权
    Method and apparatus for globally approximating quantiles in a distributed monitoring environment 有权
    用于在分布式监控环境中全局近似分位数的方法和装置

    公开(公告)号:US07783647B2

    公开(公告)日:2010-08-24

    申请号:US11301387

    申请日:2005-12-13

    IPC分类号: G06F17/30

    摘要: The invention comprises a method and apparatus for determining a rank of a query value. Specifically, the method comprises receiving a rank query request, determining, for each of the at least one remote monitor, a predicted lower-bound rank value and upper-bound rank value, wherein the predicted lower-bound rank value and upper-bound rank value are determined according to at least one respective prediction model used by each of the at least one remote monitor to compute the at least one local quantile summary, computing a predicted average rank value for each of the at least one remote monitor using the at least one predicted lower-bound rank value and the at least one predicted upper-bound rank value associated with the respective at least one remote monitor, and computing the rank of the query value using the at least one predicted average rank value associated with the respective at least one remote monitor.

    摘要翻译: 本发明包括一种用于确定查询值的等级的方法和装置。 具体地说,该方法包括:接收秩查询请求,为所述至少一个远程监视器中的每一个确定预测的下限秩值和上限秩值,其中预测的下限秩值和上限秩 根据由所述至少一个远程监视器中的每一个使用的至少一个相应的预测模型来确定所述值,以计算所述至少一个本地分位数概要,使用所述至少一个远程监视器至少计算所述至少一个远程监视器中的每一个的预测平均等级值 一个预测的下限秩值和与相应的至少一个远程监视器相关联的至少一个预测的上限秩值,以及使用与各自的至少一个远程监视器相关联的至少一个预测平均等级值来计算查询值的等级 至少一个远程监视器。

    Fast approximate wavelet tracking on streams
    2.
    发明授权
    Fast approximate wavelet tracking on streams 有权
    在流上快速近似小波跟踪

    公开(公告)号:US07885911B2

    公开(公告)日:2011-02-08

    申请号:US11389040

    申请日:2006-03-24

    IPC分类号: G06F17/00 G06N5/02 G06F15/18

    CPC分类号: G06K9/00516

    摘要: The first fast solution to the problem of tracking wavelet representations of one-dimensional and multi-dimensional data streams based on a stream synopsis, the Group-Count Sketch (GCS) is provided. By imposing a hierarchical structure of groups over the data and applying the GCS, our algorithms can quickly recover the most important wavelet coefficients with guaranteed accuracy. A tradeoff between query time and update time is established, by varying the hierarchical structure of groups, allowing the right balance to be found for specific data streams. Experimental analysis confirmed this tradeoff, and showed that all the methods significantly outperformed previously known methods in terms of both update time and query time, while maintaining a high level of accuracy.

    摘要翻译: 提供了基于流概要的一维和多维数据流的小波表示的问题的第一个快速解决方案,提供了组计数草图(GCS)。 通过在数据上施加组的层次结构并应用GCS,我们的算法可以保证精度快速恢复最重要的小波系数。 通过改变组的层次结构,建立查询时间和更新时间之间的折衷,从而为特定的数据流找到适当的平衡。 实验分析证实了这种权衡,并且表明所有方法在更新时间和查询时间方面都显着优于先前已知的方法,同时保持高水准的准确性。

    Method for distributed tracking of approximate join size and related summaries
    3.
    发明授权
    Method for distributed tracking of approximate join size and related summaries 有权
    分布式跟踪连接大小和相关摘要的方法

    公开(公告)号:US07756805B2

    公开(公告)日:2010-07-13

    申请号:US11392440

    申请日:2006-03-29

    IPC分类号: G06F17/00 G06N5/02

    摘要: A method of distributed approximate query tracking relies on tracking general-purpose randomized sketch summaries of local streams at remote sites along with concise prediction models of local site behavior in order to produce highly communication-efficient and space/time-efficient solutions. A powerful approximate query tracking framework readily incorporates several complex analysis queries, including distributed join and multi-join aggregates and approximate wavelet representations, thus giving the first known low-overhead tracking solution for such queries in the distributed-streams model.

    摘要翻译: 分布式近似查询跟踪的方法依赖于跟踪远程站点的本地流的通用随机草图摘要以及本地站点行为的简洁预测模型,以生成高通信效率和空间/时间效率的解决方案。 强大的近似查询跟踪框架容易地并入了多个复杂的分析查询,包括分布式连接和多连接聚合以及近似小波表示,从而为分布式流模型中的这种查询提供了第一个已知的低开销跟踪解决方案。

    Methods and apparatus to anonymize a dataset of spatial data
    4.
    发明授权
    Methods and apparatus to anonymize a dataset of spatial data 有权
    对空间数据数据集进行匿名化的方法和装置

    公开(公告)号:US08627488B2

    公开(公告)日:2014-01-07

    申请号:US13311388

    申请日:2011-12-05

    摘要: Methods and apparatus are disclosed to anonymize a dataset of spatial data. An example method includes generating a spatial indexing structure with spatial data, establishing a height value associated with the spatial indexing structure to generate a plurality of tree nodes, each of the plurality of tree nodes associated with spatial data counts, calculating a localized noise budget value for respective ones of the tree nodes based on the height value and an overall noise budget, and anonymizing the plurality of tree nodes with a anonymization process, the anonymization process using the localized noise budget value for respective ones of the tree nodes.

    摘要翻译: 公开了匿名化空间数据的数据集的方法和装置。 示例性方法包括:利用空间数据生成空间索引结构,建立与空间索引结构相关联的高度值,以生成多个树节点,多个树节点中的每一个与空间数据计数相关联,计算局部噪声预算值 基于所述高度值和总体噪声预算对所述树节点中的相应树节点进行匿名处理,并且使用所述树节点中的相应树节点的所述局部噪声预算值对所述多个树节点进行匿名化。

    COMMUNICATION-EFFICIENT DISTRIBUTED MONITORING OF THRESHOLDED COUNTS
    5.
    发明申请
    COMMUNICATION-EFFICIENT DISTRIBUTED MONITORING OF THRESHOLDED COUNTS 有权
    通讯高效的分布式监测

    公开(公告)号:US20070286071A1

    公开(公告)日:2007-12-13

    申请号:US11423322

    申请日:2006-06-09

    IPC分类号: H04L12/26

    CPC分类号: H04L43/16 H04L43/0894

    摘要: A system, method, and computer program product for distributed monitoring of local thresholds at each of a number of monitoring nodes and initiating communication only after the locally observed data exceeds the local threshold. Both static thresholds and adaptive thresholds are considered. In the static case, a combination of two alternate strategies for considering thresholds minimizes communication overhead. In the adaptive case, local thresholds are adjusted based on the observed distributions of updated information in the distributed monitoring system. Both approaches yield significant savings over the naïve approach of performing processing at a centralized location.

    摘要翻译: 一种系统,方法和计算机程序产品,用于在多个监视节点中的每一个上分布式监视本地阈值,并且仅在本地观察数据超过本地阈值后启动通信。 考虑静态阈值和自适应阈值。 在静态情况下,考虑阈值的两种替代策略的组合可最大限度地减少通信开销。 在自适应情况下,基于分布式监控系统中观察到的更新信息分布来调整局部阈值。 这两种方法比在中央位置执行处理的朴素方法产生了显着的节省。

    METHODS AND APPARATUS TO ANONYMIZE A DATASET OF SPATIAL DATA
    6.
    发明申请
    METHODS AND APPARATUS TO ANONYMIZE A DATASET OF SPATIAL DATA 有权
    分析空间数据数据的方法和装置

    公开(公告)号:US20130145473A1

    公开(公告)日:2013-06-06

    申请号:US13311388

    申请日:2011-12-05

    IPC分类号: G06F17/30 G06F21/24

    摘要: Methods and apparatus are disclosed to anonymize a dataset of spatial data. An example method includes generating a spatial indexing structure with spatial data, establishing a height value associated with the spatial indexing structure to generate a plurality of tree nodes, each of the plurality of tree nodes associated with spatial data counts, calculating a localized noise budget value for respective ones of the tree nodes based on the height value and an overall noise budget, and anonymizing the plurality of tree nodes with a anonymization process, the anonymization process using the localized noise budget value for respective ones of the tree nodes.

    摘要翻译: 公开了匿名化空间数据的数据集的方法和装置。 示例性方法包括:利用空间数据生成空间索引结构,建立与空间索引结构相关联的高度值,以生成多个树节点,多个树节点中的每一个与空间数据计数相关联,计算局部噪声预算值 基于所述高度值和总体噪声预算对所述树节点中的相应树节点进行匿名处理,以及使用所述树节点中的相应树节点的所述局部噪声预算值对所述多个树节点进行匿名化。

    Communication-efficient distributed monitoring of thresholded counts
    7.
    发明授权
    Communication-efficient distributed monitoring of thresholded counts 有权
    通信高效的分布式监控阈值计数

    公开(公告)号:US07742424B2

    公开(公告)日:2010-06-22

    申请号:US11423322

    申请日:2006-06-09

    IPC分类号: G01R31/08

    CPC分类号: H04L43/16 H04L43/0894

    摘要: A system, method, and computer program product for distributed monitoring of local thresholds at each of a number of monitoring nodes and initiating communication only after the locally observed data exceeds the local threshold. Both static thresholds and adaptive thresholds are considered. In the static case, a combination of two alternate strategies for considering thresholds minimizes communication overhead. In the adaptive case, local thresholds are adjusted based on the observed distributions of updated information in the distributed monitoring system. Both approaches yield significant savings over the naïve approach of performing processing at a centralized location.

    摘要翻译: 一种系统,方法和计算机程序产品,用于在多个监视节点中的每一个上分布式监视本地阈值,并且仅在本地观察数据超过本地阈值后启动通信。 考虑静态阈值和自适应阈值。 在静态情况下,考虑阈值的两种替代策略的组合可最大限度地减少通信开销。 在自适应情况下,基于分布式监控系统中观察到的更新信息分布来调整局部阈值。 这两种方法比在中央位置执行处理的朴素方法产生了显着的节省。