Fast approximate wavelet tracking on streams
    41.
    发明申请
    Fast approximate wavelet tracking on streams 有权
    在流上快速近似小波跟踪

    公开(公告)号:US20070237410A1

    公开(公告)日:2007-10-11

    申请号:US11389040

    申请日:2006-03-24

    IPC分类号: G06K9/46 G06F17/30

    CPC分类号: G06K9/00516

    摘要: The first fast solution to the problem of tracking wavelet representations of one-dimensional and multi-dimensional data streams based on a stream synopsis, the Group-Count Sketch (GCS) is provided. By imposing a hierarchical structure of groups over the data and applying the GCS, our algorithms can quickly recover the most important wavelet coefficients with guaranteed accuracy. A tradeoff between query time and update time is established, by varying the hierarchical structure of groups, allowing the right balance to be found for specific data streams. Experimental analysis confirmed this tradeoff, and showed that all the methods significantly outperformed previously known methods in terms of both update time and query time, while maintaining a high level of accuracy.

    摘要翻译: 提供了基于流概要的一维和多维数据流的小波表示的问题的第一个快速解决方案,提供了组计数草图(GCS)。 通过在数据上施加组的层次结构并应用GCS,我们的算法可以保证精度快速恢复最重要的小波系数。 通过改变组的层次结构,建立查询时间和更新时间之间的折衷,从而为特定的数据流找到适当的平衡。 实验分析证实了这种权衡,并且表明所有方法在更新时间和查询时间方面都显着优于先前已知的方法,同时保持高水准的准确性。

    Efficient publication of sparse data
    42.
    发明授权
    Efficient publication of sparse data 有权
    有效发布稀疏数据

    公开(公告)号:US09251216B2

    公开(公告)日:2016-02-02

    申请号:US13111154

    申请日:2011-05-19

    IPC分类号: G06F17/30 G06F21/62

    CPC分类号: G06F17/30522 G06F21/6254

    摘要: The present disclosure is directed to systems, methods, and computer-readable storage media for publishing data. A data summary summarizing the data can be generated and published according to several publishing schemes. In some embodiments, non-zero entries are selected and modified and zero entries are sampled according to one or more distribution functions. The sampled and modified values are added to a data summary, or a sample of the sampled and modified values are added to the data summary. The data summary is published, released, used, or otherwise output. In other embodiments, priority values are assigned to each value associated with the data, and a number of entries with the highest values are selected and added to the data summary.

    摘要翻译: 本公开涉及用于发布数据的系统,方法和计算机可读存储介质。 总结数据的数据摘要可以根据多个发布方案生成和发布。 在一些实施例中,选择和修改非零条目,并根据一个或多个分配函数对零条目进行采样。 将采样和修改的值添加到数据摘要中,或将采样和修改的值的样本添加到数据摘要中。 数据摘要已发布,发布,使用或以其他方式输出。 在其他实施例中,将优先级值分配给与数据相关联的每个值,并且选择具有最高值的多个条目并将其添加到数据摘要。

    Generating minimality-attack-resistant data
    43.
    发明授权
    Generating minimality-attack-resistant data 有权
    生成最低限度的抗攻击数据

    公开(公告)号:US08631500B2

    公开(公告)日:2014-01-14

    申请号:US12825466

    申请日:2010-06-29

    IPC分类号: G06F21/00

    CPC分类号: G06F21/6254

    摘要: The present disclosure is directed to systems, methods, and computer-readable storage media for generating data and data sets that are resistant to minimality attacks. Data sets having a number of tuples are received, and the tuples are ordered according to an aspect of the tuples. The tuples can be split into groups of tuples, and each of the groups may be analyzed to determine if the group complies with a privacy requirement. Groups that satisfy the privacy requirement may be output as new data sets that are resistant to minimality attacks.

    摘要翻译: 本公开涉及用于生成抵御最低限度攻击的数据和数据集的系统,方法和计算机可读存储介质。 接收到具有多个元组的数据集,并且元组根据元组的一个方面被排序。 元组可以分为组元组,并且可以分析每个组以确定组是否符合隐私要求。 满足隐私要求的组可以作为抵抗最低限度攻击的新数据集输出。

    Forward decay temporal data analysis
    44.
    发明授权
    Forward decay temporal data analysis 有权
    正向衰减时间数据分析

    公开(公告)号:US08595194B2

    公开(公告)日:2013-11-26

    申请号:US12560214

    申请日:2009-09-15

    IPC分类号: G06F17/30

    摘要: A disclosed method for implementing time decay in the analysis of streaming data objects is based on the age, referred to herein as the forward age, of a data object measured from a landmark time in the past to a time associated with the occurrence of the data object, e.g., an object's timestamp. A forward time decay function is parameterized on the forward age. Because a data object's forward age does not depend on the current time, a value of the forward time decay function is determined just once for each data object. A scaling factor or weight associated with a data object may be weighted according to its decay function value. Forward time decay functions are beneficial in determining decayed aggregates, including decayed counts, sums, and averages, decayed minimums and maximums, and for drawing decay-influenced samples.

    摘要翻译: 用于在流数据对象的分析中实现时间衰减的公开方法基于从过去的地标时间测量到与数据的出现相关联的时间的数据对象的年龄(这里称为远期时间) 对象,例如对象的时间戳。 前进时间衰减函数在前进时间参数化。 因为数据对象的转发时间不依赖于当前时间,因此对于每个数据对象仅确定一次正向时间衰减函数的值。 可以根据其衰减函数值对与数据对象相关联的缩放因子或权重进行加权。 前向时间衰减函数有助于确定衰变的聚集体,包括衰变计数,总和和平均值,衰减最小值和最大值,以及绘制衰变影响样本。

    Interactive proof to validate outsourced data stream processing
    45.
    发明授权
    Interactive proof to validate outsourced data stream processing 有权
    验证外包数据流处理的互动证明

    公开(公告)号:US08538938B2

    公开(公告)日:2013-09-17

    申请号:US12959063

    申请日:2010-12-02

    申请人: Graham Cormode Ke Yi

    发明人: Graham Cormode Ke Yi

    CPC分类号: G06F17/30516

    摘要: A method for validating outsourced processing of a data stream arriving at a streaming data warehouse of a data service provider includes a proof protocol. A verifier acting on behalf of a data owner of the data stream may interact with a prover acting on behalf of the data service provider. The verifier may calculate a first root hash value of a binary tree during single-pass processing of the original data stream with limited computational effort. A second root hash value may be calculated using the proof protocol between the verifier and the prover. The prover is requested to provide certain queried values before receiving random numbers used to generate subsequent responses dependent on the provided values. The proof protocol may be used to validate the data processing performed by the data service provider.

    摘要翻译: 用于验证到达数据服务提供商的流数据仓库的数据流的外包处理的方法包括验证协议。 代表数据流的数据所有者的验证者可以与代表数据服务提供者的证明者交互。 验证者可以在计算量有限的原始数据流的单次处理期间计算二叉树的第一根哈希值。 可以使用验证者和证明者之间的验证协议来计算第二个根哈希值。 要求证明者提供某些查询值,然后才能接收随机数字,用于根据提供的值产生后续响应。 验证协议可以用于验证由数据服务提供商执行的数据处理。

    Methods and apparatus to construct histogram and wavelet synopses for probabilistic data
    46.
    发明授权
    Methods and apparatus to construct histogram and wavelet synopses for probabilistic data 有权
    用于构建概率数据的直方图和小波概要的方法和装置

    公开(公告)号:US08386412B2

    公开(公告)日:2013-02-26

    申请号:US12334264

    申请日:2008-12-12

    IPC分类号: G06F9/44 G06N7/02 G06N7/06

    摘要: Example methods and apparatus to construct histogram and wavelet synopses for probabilistic data are disclosed. A disclosed example method involves receiving probabilistic data associated with probability measures and generating a plurality of histograms based on the probabilistic data. Each histogram is generated based on items represented by the probabilistic data. In addition, each histogram is generated using a different quantity of buckets containing different ones of the items. An error measure associated with each of the plurality of histograms is determined and one of the plurality of histograms is selected based on its associated error measure. The method also involves displaying parameter information associated with the one of the plurality of histograms to represent the data.

    摘要翻译: 公开了构建用于概率数据的直方图和小波概要的示例方法和装置。 所公开的示例性方法包括接收与概率测量相关联的概率数据,并且基于概率数据生成多个直方图。 基于由概率数据表示的项目生成每个直方图。 此外,使用不同数量的包含不同项目的桶来生成每个直方图。 确定与多个直方图中的每一个相关联的误差测量,并且基于其相关联的误差测量来选择多个直方图中的一个。 该方法还涉及显示与多个直方图之一相关联的参数信息以表示数据。

    VALIDATION OF PRIORITY QUEUE PROCESSING
    47.
    发明申请
    VALIDATION OF PRIORITY QUEUE PROCESSING 有权
    验证优先级队列处理

    公开(公告)号:US20120159500A1

    公开(公告)日:2012-06-21

    申请号:US12971913

    申请日:2010-12-17

    IPC分类号: G06F9/46

    CPC分类号: G06F11/3688

    摘要: A method for validating outsourced processing of a priority queue includes configuring a verifier for independent, single-pass processing of priority queue operations that include insertion operations and extraction operations and priorities associated with each operation. The verifier may be configured to validate N operations using a memory space having a size that is proportional to the square root of N using an algorithm to buffer the operations as a series of R epochs. Extractions associated with each individual epoch may be monitored using arrays Y and Z. Insertions for the epoch k may monitored using arrays X and Z. The processing of the priority queue operations may be verified based on the equality or inequality of the arrays X, Y, and Z. Hashed values for the arrays may be used to test their equality to conserve storage requirements.

    摘要翻译: 用于验证优先级队列的外包处理的方法包括配置用于对包括插入操作和提取操作以及与每个操作相关联的优先级的优先级队列操作进行独立,单程处理的验证器。 验证器可以被配置为使用具有与N的平方根成比例的大小的存储器空间来验证N个操作,使用算法将该操作缓冲为一系列R个时期。 可以使用阵列Y和Z来监视与每个单个时期相关联的抽取。可以使用阵列X和Z监视历元k的插入。可以基于阵列X,Y的相等或不等式来验证优先级队列操作的处理 ,并且Z.阵列的哈希值可以用于测试它们的相等性以节省存储要求。

    INTERACTIVE PROOF TO VALIDATE OUTSOURCED DATA STREAM PROCESSING
    48.
    发明申请
    INTERACTIVE PROOF TO VALIDATE OUTSOURCED DATA STREAM PROCESSING 有权
    验证外部数据流处理的交互性证明

    公开(公告)号:US20120143830A1

    公开(公告)日:2012-06-07

    申请号:US12959063

    申请日:2010-12-02

    申请人: Graham Cormode Ke Yi

    发明人: Graham Cormode Ke Yi

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30516

    摘要: A method for validating outsourced processing of a data stream arriving at a streaming data warehouse of a data service provider includes a proof protocol. A verifier acting on behalf of a data owner of the data stream may interact with a prover acting on behalf of the data service provider. The verifier may calculate a first root hash value of a binary tree during single-pass processing of the original data stream with limited computational effort. A second root hash value may be calculated using the proof protocol between the verifier and the prover. The prover is requested to provide certain queried values before receiving random numbers used to generate subsequent responses dependent on the provided values. The proof protocol may be used to validate the data processing performed by the data service provider.

    摘要翻译: 用于验证到达数据服务提供商的流数据仓库的数据流的外包处理的方法包括验证协议。 代表数据流的数据所有者的验证者可以与代表数据服务提供者的证明者交互。 验证者可以在计算量有限的原始数据流的单次处理期间计算二叉树的第一根哈希值。 可以使用验证者和证明者之间的验证协议来计算第二个根哈希值。 要求证明者提供某些查询值,然后才能接收随机数字,用于根据提供的值产生后续响应。 验证协议可以用于验证由数据服务提供商执行的数据处理。

    METHOD AND APPARATUS FOR PROVIDING ANONYMIZATION OF DATA
    49.
    发明申请
    METHOD AND APPARATUS FOR PROVIDING ANONYMIZATION OF DATA 有权
    提供数据解密的方法和装置

    公开(公告)号:US20110041184A1

    公开(公告)日:2011-02-17

    申请号:US12542173

    申请日:2009-08-17

    IPC分类号: G06F21/24 G06F17/30

    CPC分类号: G06F21/6254

    摘要: A method and apparatus for providing an anonymization of data are disclosed. For example, the method receives a request for anonymizing, wherein the request comprises a bipartite graph for a plurality of associations or a table that encodes the plurality of associations for the bipartite graph. The method places each node in the bipartite graph in a safe group and provides an anonymized graph that encodes the plurality of associations of the bipartite graph, if a safe group for all nodes of the bipartite graph is found.

    摘要翻译: 公开了一种用于提供数据匿名化的方法和装置。 例如,该方法接收匿名请求,其中该请求包括用于多个关联的二分图或编码该二分图的多个关联的表。 该方法将两部分图中的每个节点放置在安全组中,并提供一个匿名图,编码两部分图的多个关联,如果发现了二分图中所有节点的安全组。

    METHOD AND APPARATUS FOR MONITORING FUNCTIONS OF DISTRIBUTED DATA
    50.
    发明申请
    METHOD AND APPARATUS FOR MONITORING FUNCTIONS OF DISTRIBUTED DATA 有权
    用于监测分布式数据的功能的方法和装置

    公开(公告)号:US20100312872A1

    公开(公告)日:2010-12-09

    申请号:US11963005

    申请日:2007-12-21

    申请人: Graham Cormode Ke Yi

    发明人: Graham Cormode Ke Yi

    IPC分类号: G06F15/173

    CPC分类号: H04L43/024 H04L43/16

    摘要: This invention discloses continuous functional monitoring of distributed network activity using algorithms based on frequency moment calculations given by Fp=Σimip. The frequency moment calculations are used to raise an alarm when a value exceeds a certain threshold. Frequency moments for p=0, 1, and 2 are described.

    摘要翻译: 本发明公开了基于Fp =&Sgr; imip给出的频率矩计算的算法的分布式网络活动的连续功能监控。 当频率值超过某个阈值时,频率矩计算用于提高警报。 描述p = 0,1和2的频率矩。