Using index partitioning and reconciliation for data deduplication
    1.
    发明授权
    Using index partitioning and reconciliation for data deduplication 有权
    使用索引分区和对帐进行重复数据删除

    公开(公告)号:US09110936B2

    公开(公告)日:2015-08-18

    申请号:US12979748

    申请日:2010-12-28

    Abstract: The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.

    Abstract translation: 本发明涉及一种数据重复数据删除技术,其中散列索引服务的索引被分割成子空间索引,其中小于整个散列索引服务的索引来缓存存储器。 访问子空间索引以确定数据块是否已经存在或需要进行索引和存储。 索引可以根据与索引的数据相关联的标准被划分为子空间,例如文件类型,数据类型,上次使用的时间等等。 还描述了子空间协调,其中检测子空间中的重复条目,以便从重复数据删除系统中删除条目和块。 当更多的系统资源可用时,子空间协调可以在非高峰时间执行,并且如果需要资源,则可能被中断。 调和的子空间可以基于相似性,包括通过每个紧密地表示子空间的散列的签名的相似性。

    Data Synchronization Policies
    2.
    发明申请
    Data Synchronization Policies 有权
    数据同步策略

    公开(公告)号:US20130064336A1

    公开(公告)日:2013-03-14

    申请号:US13229527

    申请日:2011-09-09

    CPC classification number: G06F17/30176

    Abstract: Techniques for data synchronization policies are described. In one or more implementations, techniques may be employed to set data synchronization (“sync”) policies for devices in a data sync environment. The sync policies specify parameters for sync operations in the sync environment, such as how frequently data sync operations are performed, what types of data are synced to particular devices, how frequently particular types of data are synced, and so on. In implementations, the sync policies consider the number of devices that are participating in a sync environment and attributes of the devices in specifying parameters for sync operations. Data can be synchronized among devices in the sync environment based on the sync policies.

    Abstract translation: 描述了数据同步策略的技术。 在一个或多个实现中,可以采用技术来为数据同步环境中的设备设置数据同步(sync)策略。 同步策略指定同步环境中同步操作的参数,例如执行数据同步操作的频率,同步到特定设备的数据类型,特定数据类型的频率同步等等。 在实现中,同步策略考虑参与同步环境的设备的数量和设备的属性指定用于同步操作的参数。 基于同步策略,可以在同步环境中的设备之间同步数据。

    Data synchronization policies
    4.
    发明授权
    Data synchronization policies 有权
    数据同步策略

    公开(公告)号:US09449016B2

    公开(公告)日:2016-09-20

    申请号:US13229527

    申请日:2011-09-09

    CPC classification number: G06F17/30176

    Abstract: Techniques for data synchronization policies are described. In one or more implementations, techniques may be employed to set data synchronization (“sync”) policies for devices in a data sync environment. The sync policies specify parameters for sync operations in the sync environment, such as how frequently data sync operations are performed, what types of data are synced to particular devices, how frequently particular types of data are synced, and so on. In implementations, the sync policies consider the number of devices that are participating in a sync environment and attributes of the devices in specifying parameters for sync operations. Data can be synchronized among devices in the sync environment based on the sync policies.

    Abstract translation: 描述了数据同步策略的技术。 在一个或多个实现中,可以采用技术来为数据同步环境中的设备设置数据同步(“同步”)策略。 同步策略指定同步环境中同步操作的参数,例如执行数据同步操作的频率,同步到特定设备的数据类型,特定数据类型的频率同步等等。 在实现中,同步策略考虑参与同步环境的设备的数量和设备的属性指定用于同步操作的参数。 基于同步策略,可以在同步环境中的设备之间同步数据。

    Using Index Partitioning and Reconciliation for Data Deduplication
    5.
    发明申请
    Using Index Partitioning and Reconciliation for Data Deduplication 有权
    使用索引分区和调整进行重复数据删除

    公开(公告)号:US20120166401A1

    公开(公告)日:2012-06-28

    申请号:US12979748

    申请日:2010-12-28

    Abstract: The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.

    Abstract translation: 本发明涉及一种数据重复数据删除技术,其中散列索引服务的索引被分割成子空间索引,其中小于整个散列索引服务的索引来缓存存储器。 访问子空间索引以确定数据块是否已经存在或需要进行索引和存储。 索引可以根据与索引的数据相关联的标准被划分为子空间,例如文件类型,数据类型,最后使用时间等等。 还描述了子空间协调,其中检测子空间中的重复条目,以便从重复数据删除系统中删除条目和块。 当更多的系统资源可用时,子空间协调可以在非高峰时间执行,并且如果需要资源,则可能被中断。 调和的子空间可以基于相似性,包括通过每个紧密地表示子空间的散列的签名的相似性。

Patent Agency Ranking