Systems and methods for rights protection of datasets with dataset structure preservation
    1.
    发明授权
    Systems and methods for rights protection of datasets with dataset structure preservation 有权
    使用数据集结构保护的数据集权利保护的系统和方法

    公开(公告)号:US08769293B2

    公开(公告)日:2014-07-01

    申请号:US11867826

    申请日:2007-10-05

    IPC分类号: G06F17/30

    CPC分类号: G06F21/16

    摘要: A system and method for rights protection of a dataset that includes multiple trajectory objects includes determining an intensity power for embedding a watermarking key in a data trajectory. The data trajectory is modified to embed a watermarking key at the intensity power such that the intensity power guarantees an original pair-wise relationship between distance-based neighboring objects before and after embedding of the key such that a modified trajectory provides a watermarked version of the data trajectory.

    摘要翻译: 包括多个轨迹对象的数据集的权利保护的系统和方法包括确定用于将数字轨迹中嵌入水印密钥的强度功率。 修改数据轨迹以将水印密钥嵌入强度功率,使得强度功率保证在嵌入密钥之前和之后的基于距离的相邻对象之间的原始成对关系,使得修改的轨迹提供水印版本的 数据轨迹。

    SYSTEMS AND METHODS FOR COMPUTATION OF OPTIMAL DISTANCE BOUNDS ON COMPRESSED TIME-SERIES DATA
    2.
    发明申请
    SYSTEMS AND METHODS FOR COMPUTATION OF OPTIMAL DISTANCE BOUNDS ON COMPRESSED TIME-SERIES DATA 有权
    用于计算压缩时间序列数据的最佳距离边界的系统和方法

    公开(公告)号:US20090204574A1

    公开(公告)日:2009-08-13

    申请号:US12027294

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30548 G06F2216/03

    摘要: There are provided a method and a system for computation of optimal distance bounds on compressed time-series data. In a method for similarity search, the method includes the step of transforming sequence data into a compressed sequence represented by top-k coefficients of the sequence data and a sum of the energy of omitted coefficients of the sequence data. The method further includes the step of computing at least one of a lower bound and an upper bound on a distance range between a query sequence and the compressed sequence, given a first and a second constraint. The first constraint is that a sum of squares of the omitted coefficients is less than a sum of the energy of the omitted coefficients. The second constraint is that the energy of the omitted coefficients is less than the energy of a lowest energy one of the top-k coefficients.

    摘要翻译: 提供了一种用于在压缩时间序列数据上计算最佳距离界限的方法和系统。 在相似搜索的方法中,该方法包括将序列数据变换为由序列数据的顶部k个系数表示的压缩序列和序列数据的省略系数的能量之和的步骤。 该方法还包括在给定第一和第二约束的情况下,计算查询序列和压缩序列之间的距离范围上的下限和上限中的至少一个的步骤。 第一个约束是省略的系数的平方和小于所省略的系数的能量之和。 第二个约束是省略的系数的能量小于顶部k系数中最低能量的能量。

    SYSTEMS AND METHODS FOR RIGHTS PROTECTION OF DATASETS WITH DATASET STRUCTURE PRESERVATION
    3.
    发明申请
    SYSTEMS AND METHODS FOR RIGHTS PROTECTION OF DATASETS WITH DATASET STRUCTURE PRESERVATION 有权
    使用数据库结构保存的数据保护的权利和方法

    公开(公告)号:US20090094265A1

    公开(公告)日:2009-04-09

    申请号:US11867826

    申请日:2007-10-05

    IPC分类号: G06F17/30

    CPC分类号: G06F21/16

    摘要: A system and method for rights protection of a dataset that includes multiple trajectory objects includes determining an intensity power for embedding a watermarking key in a data trajectory. The data trajectory is modified to embed a watermarking key at the intensity power such that the intensity power guarantees an original pair-wise relationship between distance-based neighboring objects before and after embedding of the key such that a modified trajectory provides a watermarked version of the data trajectory.

    摘要翻译: 包括多个轨迹对象的数据集的权利保护的系统和方法包括确定用于将数字轨迹中嵌入水印密钥的强度功率。 修改数据轨迹以将水印密钥嵌入强度功率,使得强度功率保证在嵌入密钥之前和之后的基于距离的相邻对象之间的原始成对关系,使得修改的轨迹提供水印版本的 数据轨迹。

    Systems and methods for computation of optimal distance bounds on compressed time-series data
    4.
    发明授权
    Systems and methods for computation of optimal distance bounds on compressed time-series data 有权
    用于计算压缩时间序列数据的最佳距离界限的系统和方法

    公开(公告)号:US07882126B2

    公开(公告)日:2011-02-01

    申请号:US12027294

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30548 G06F2216/03

    摘要: There are provided a method and a system for computation of optimal distance bounds on compressed time-series data. In a method for similarity search, the method includes the step of transforming sequence data into a compressed sequence represented by top-k coefficients of the sequence data and a sum of the energy of omitted coefficients of the sequence data. The method further includes the step of computing at least one of a lower bound and an upper bound on a distance range between a query sequence and the compressed sequence, given a first and a second constraint. The first constraint is that a sum of squares of the omitted coefficients is less than a sum of the energy of the omitted coefficients. The second constraint is that the energy of the omitted coefficients is less than the energy of a lowest energy one of the top-k coefficients.

    摘要翻译: 提供了一种用于在压缩时间序列数据上计算最佳距离界限的方法和系统。 在相似搜索的方法中,该方法包括将序列数据变换为由序列数据的顶部k个系数表示的压缩序列和序列数据的省略系数的能量之和的步骤。 该方法还包括在给定第一和第二约束的情况下,计算查询序列和压缩序列之间的距离范围上的下限和上限中的至少一个的步骤。 第一个约束是省略的系数的平方和小于所省略的系数的能量之和。 第二个约束是省略的系数的能量小于顶部k系数中最低能量的能量。

    Methods and apparatus for data stream clustering for abnormality monitoring
    5.
    发明授权
    Methods and apparatus for data stream clustering for abnormality monitoring 有权
    用于异常监测的数据流聚类的方法和装置

    公开(公告)号:US07970772B2

    公开(公告)日:2011-06-28

    申请号:US11753232

    申请日:2007-05-24

    IPC分类号: G06F7/00 G06F17/30 G06F15/16

    CPC分类号: G06K9/6284 Y10S707/952

    摘要: Techniques for monitoring abnormalities in a data stream are provided. A plurality of objects are received from the data stream and one or more clusters are created from these objects. At least a portion of the one or more clusters have statistical data of the respective cluster. It is determined from the statistical data whether one or more abnormalities exist in the data stream.

    摘要翻译: 提供了用于监视数据流异常的技术。 从数据流接收多个对象,并从这些对象创建一个或多个聚类。 一个或多个集群的至少一部分具有相应集群的统计数据。 从统计数据确定数据流中是否存在一个或多个异常。

    Method and apparatus for query processing of uncertain data
    6.
    发明授权
    Method and apparatus for query processing of uncertain data 有权
    不确定性数据查询处理方法与装置

    公开(公告)号:US07917517B2

    公开(公告)日:2011-03-29

    申请号:US12039091

    申请日:2008-02-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30657

    摘要: Techniques are disclosed for indexing uncertain data in query processing systems. For example, a method for processing queries in an application that involves an uncertain data set includes the following steps. A representation of records of the uncertain data set is created based on mean values and uncertainty values. The representation is utilized for processing a query received on the uncertain data set.

    摘要翻译: 公开了用于在查询处理系统中索引不确定数据的技术。 例如,在涉及不确定数据集的应用程序中处理查询的方法包括以下步骤。 基于平均值和不确定性值创建不确定数据集的记录表示。 该表示用于处理在不确定数据集上接收到的查询。

    Method and Apparatus for Variable Privacy Preservation in Data Mining
    7.
    发明申请
    Method and Apparatus for Variable Privacy Preservation in Data Mining 失效
    数据挖掘中可变隐藏保护的方法和装置

    公开(公告)号:US20090319526A1

    公开(公告)日:2009-12-24

    申请号:US12119766

    申请日:2008-05-13

    IPC分类号: G06F17/30

    摘要: Improved privacy preservation techniques are disclosed for use in accordance with data mining. By way of example, a technique for preserving privacy of data records for use in a data mining application comprises the following steps/operations. Different privacy levels are assigned to the data records. Condensed groups are constructed from the data records based on the privacy levels, wherein summary statistics are maintained for each condensed group. Pseudo-data is generated from the summary statistics, wherein the pseudo-data is available for use in the data mining application. Principles of the invention are capable of handling both static and dynamic data sets

    摘要翻译: 公开了根据数据挖掘使用的改进的隐私保护技术。 作为示例,用于保留用于数据挖掘应用的数据记录的隐私的技术包括以下步骤/操作。 不同的隐私级别被分配给数据记录。 基于隐私级别的数据记录构建简化组,其中为每个缩合组维护概要统计。 从总结统计生成伪数据,其中伪数据可用于数据挖掘应用程序。 本发明的原理能够处理静态和动态数据集

    Method and Apparatus for Aggregation in Uncertain Data
    8.
    发明申请
    Method and Apparatus for Aggregation in Uncertain Data 有权
    不确定数据聚合的方法和装置

    公开(公告)号:US20090222472A1

    公开(公告)日:2009-09-03

    申请号:US12039076

    申请日:2008-02-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30489

    摘要: Techniques are disclosed for aggregation in uncertain data in data processing systems. For example, a method of aggregation in an application that involves an uncertain data set includes the following steps. The uncertain data set along with uncertainty information is obtained. One or more clusters of data points are constructed from the data set. Aggregate statistics of the one or more clusters and uncertainty information are stored. The data set may be data from a data stream. It is realized that the use of even modest uncertainty information during an application such as a data mining process is sufficient to greatly improve the quality of the underlying results.

    摘要翻译: 公开了用于在数据处理系统中的不确定数据中聚合的技术。 例如,涉及不确定数据集的应用程序中的聚合方法包括以下步骤。 获得不确定性数据集以及不确定性信息。 从数据集构建一个或多个数据点簇。 存储一个或多个聚类和不确定性信息的聚合统计信息。 数据集可以是来自数据流的数据。 实现在诸如数据挖掘过程的应用中使用甚至适度的不确定性信息足以大大提高底层结果的质量。

    Method and Apparatus for Query Processing of Uncertain Data
    9.
    发明申请
    Method and Apparatus for Query Processing of Uncertain Data 有权
    不确定数据查询处理方法与装置

    公开(公告)号:US20090222410A1

    公开(公告)日:2009-09-03

    申请号:US12039091

    申请日:2008-02-28

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30657

    摘要: Techniques are disclosed for indexing uncertain data in query processing systems. For example, a method for processing queries in an application that involves an uncertain data set includes the following steps. A representation of records of the uncertain data set is created based on mean values and uncertainty values. The representation is utilized for processing a query received on the uncertain data set.

    摘要翻译: 公开了用于在查询处理系统中索引不确定数据的技术。 例如,在涉及不确定数据集的应用程序中处理查询的方法包括以下步骤。 基于平均值和不确定性值创建不确定数据集的记录表示。 该表示用于处理在不确定数据集上接收到的查询。

    Methods and Apparatus for Perturbing an Evolving Data Stream for Time Series Compressibility and Privacy
    10.
    发明申请
    Methods and Apparatus for Perturbing an Evolving Data Stream for Time Series Compressibility and Privacy 有权
    扰动不断发展的数据流的时间序列压缩和隐私的方法和装置

    公开(公告)号:US20090077148A1

    公开(公告)日:2009-03-19

    申请号:US11855378

    申请日:2007-09-14

    IPC分类号: G06F17/10 G06F17/14

    摘要: Techniques for perturbing an evolving data stream are provided. The evolving data stream is received. An online linear transformation is applied to received values of the evolving data stream generating a plurality of transform coefficients. A plurality of significant transform coefficients are selected from the plurality of transform coefficients. Noise is embedded into each of the plurality of significant transform coefficients, thereby perturbing the evolving data stream. A total noise variance does not exceed a defined noise variance threshold.

    摘要翻译: 提供了扰乱演进数据流的技术。 收到不断发展的数据流。 在线线性变换被应用于产生多个变换系数的演进数据流的接收值。 从多个变换系数中选择多个有效变换系数。 噪声嵌入到多个有效变换系数中的每一个中,从而扰乱演进数据流。 总噪声方差不超过定义的噪声方差阈值。