Elimination of redundant objects in storage systems
    1.
    发明授权
    Elimination of redundant objects in storage systems 有权
    消除存储系统中的冗余对象

    公开(公告)号:US08554744B2

    公开(公告)日:2013-10-08

    申请号:US13092777

    申请日:2011-04-22

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30489

    摘要: Provided are a method, system, and article of manufacture, wherein a data structure corresponding to a set of client nodes selected from a plurality of client nodes is generated. Objects from the selected set of client nodes are stored in the data structure. A determination is made that an object corresponding to a client node of the selected set of client nodes has to be stored. An additional determination is made as to whether the object has already been stored in the data structure by any client node of the selected set of client nodes. The object is stored in the data structure, in response to determining that the object has not already been stored in the data structure by any client node of the selected set of client nodes.

    摘要翻译: 提供了一种方法,系统和制品,其中生成对应于从多个客户端节点中选择的一组客户机节点的数据结构。 来自所选择的一组客户端节点的对象被存储在数据结构中。 确定必须存储与所选择的一组客户端节点的客户端节点对应的对象。 另外确定对象是否已经被所选择的一组客户端节点的任何客户端节点存储在数据结构中。 响应于确定对象尚未被所选择的客户端节点集合的任何客户机节点存储在数据结构中,对象被存储在数据结构中。

    Data selection for movement from a source to a target
    3.
    发明授权
    Data selection for movement from a source to a target 有权
    从源到目标的数据选择

    公开(公告)号:US09087011B2

    公开(公告)日:2015-07-21

    申请号:US13484119

    申请日:2012-05-30

    IPC分类号: G06F17/00 G06F11/14

    CPC分类号: G06F11/1453

    摘要: In one aspect of the present description, in connection with storing a first deduplicated data object in a primary storage pool, described operations include determining the duration of time that the first data object has resided in the primary storage pool, and comparing the determined duration of time to a predetermined time interval. In addition, described operations include, after the determined duration of time meets or exceeds the predetermined time interval, determining if the first data object has an extent referenced by another data object, and determining whether to move the first data object from the primary storage pool to a secondary storage pool as a function of whether the first data object has an extent referenced by another data object after the determined duration of time meets or exceeds the predetermined time interval. Other features and aspects may be realized, depending upon the particular application.

    摘要翻译: 在本说明书的一个方面,结合将第一重复数据删除的数据对象存储在主存储池中,所描述的操作包括确定第一数据对象已经驻留在主存储池中的持续时间,以及将所确定的持续时间 时间到预定的时间间隔。 此外,所描述的操作包括在所确定的持续时间满足或超过预定时间间隔之后,确定第一数据对象是否具有由另一数据对象引用的范围,以及确定是否从主存储池移动第一数据对象 作为在所确定的持续时间达到或超过预定时间间隔之后第一数据对象是否具有由另一数据对象引用的盘区的函数的次级存储池。 可以根据具体应用实现其它特征和方面。

    System and method for relating files in a distributed data storage environment
    5.
    发明授权
    System and method for relating files in a distributed data storage environment 有权
    用于在分布式数据存储环境中关联文件的系统和方法

    公开(公告)号:US06615225B1

    公开(公告)日:2003-09-02

    申请号:US09561252

    申请日:2000-04-27

    IPC分类号: G06F1730

    摘要: A system and method for relating files in a distributed data storage environment allows for positive identification of membership of a file within a group, even in a loosely coupled environment where files are not available for comparison in real time. In disclosed embodiments, base files of a client are stored on a server and are accompanied by tokens uniquely identifying the base files. The tokens are generated on the client and may be derived from the contents of the base file using a digital signature. Each file transmitted to the server is accompanied with a token. Incremental backups may be used, and may employ file differencing. Accordingly, sub-files related to the base files may be transmitted to the server for backup. The sub-files are related to their respective base files using the tokens and are cross-linked to the base files so that any sub-files can be retrieved together with the base file from which the sub-file was derived.

    摘要翻译: 用于在分布式数据存储环境中关联文件的系统和方法允许对组内的文件的成员资格进行肯定的识别,即使在文件不可用于实时比较的松耦合环境中。 在所公开的实施例中,客户端的基本文件被存储在服务器上,并且伴随着唯一地标识基本文件的令牌。 令牌在客户端上生成,并且可以使用数字签名从基础文件的内容导出。 发送到服务器的每个文件都附带一个令牌。 可以使用增量备份,并可能采用文件差异化。 因此,可以将与基本文件相关的子文件发送到服务器进行备份。 子文件与使用令牌的各自的基本文件相关,并与基本文件交链,以便可以将所有子文件与从其导出子文件的基本文件一起检索。

    Applying replication rules to determine whether to replicate objects
    6.
    发明授权
    Applying replication rules to determine whether to replicate objects 有权
    应用复制规则来确定是否复制对象

    公开(公告)号:US08838529B2

    公开(公告)日:2014-09-16

    申请号:US13221691

    申请日:2011-08-30

    IPC分类号: G06F17/00 G06F17/30

    CPC分类号: G06F17/30575

    摘要: A source server maintains a replication rule specifying a condition for a replication attribute and a replication action to take if the condition with respect to the replication attribute is satisfied, wherein the replication action indicates to include or exclude the object having an attribute value for the replication attribute that satisfies the condition. For each of the objects, the replication rule is applied by determining an attribute value of the object corresponding to the replication attribute in the replication rule and determining whether the determined attribute value satisfies the condition for the replication attribute defined in the determined replication rule. The replication action on the object in response to determining that the determined attribute value satisfies the condition for the replication attribute.

    摘要翻译: 如果满足关于复制属性的条件,则源服务器维护指定复制属性的条件的复制规则和复制动作,其中复制动作指示包括或排除具有复制属性值的对象 属性满足条件。 对于每个对象,通过确定与复制规则中的复制属性相对应的对象的属性值来应用复制规则,并确定所确定的属性值是否满足在确定的复制规则中定义的复制属性的条件。 响应于确定所确定的属性值满足复制属性的条件,对对象的复制动作。

    Data recovery in a hierarchical data storage system
    7.
    发明授权
    Data recovery in a hierarchical data storage system 有权
    分层数据存储系统中的数据恢复

    公开(公告)号:US08738575B2

    公开(公告)日:2014-05-27

    申请号:US11856688

    申请日:2007-09-17

    IPC分类号: G06F17/30 G06F7/00 G06F11/14

    摘要: Systems and methods for retrieving data from a storage system having a plurality of storage pools are provided. The method comprises processing configurable data retrieval instructions to determine a first storage pool from which target backup data is to be retrieved, in response to a data restore request; and retrieving the target backup data from the first storage pool to satisfy the restore request. The configurable data retrieval instructions are managed by a source external to the storage system with administrative authority to change the configurable data retrieval instructions to optimize data restoration from the storage system.

    摘要翻译: 提供了用于从具有多个存储池的存储系统检索数据的系统和方法。 所述方法包括:响应于数据恢复请求,处理可配置数据检索指令以确定要从中检索目标备份数据的第一存储池; 以及从所述第一存储池检索所述目标备份数据以满足所述恢复请求。 可配置数据检索指令由存储系统外部的源管理,具有管理权限,以更改可配置数据检索指令以优化从存储系统的数据恢复。

    DATA SELECTION FOR MOVEMENT FROM A SOURCE TO A TARGET

    公开(公告)号:US20130159648A1

    公开(公告)日:2013-06-20

    申请号:US13484119

    申请日:2012-05-30

    IPC分类号: G06F12/16

    CPC分类号: G06F11/1453

    摘要: In one aspect of the present description, in connection with storing a first deduplicated data object in a primary storage pool, described operations include determining the duration of time that the first data object has resided in the primary storage pool, and comparing the determined duration of time to a predetermined time interval. In addition, described operations include, after the determined duration of time meets or exceeds the predetermined time interval, determining if the first data object has an extent referenced by another data object, and determining whether to move the first data object from the primary storage pool to a secondary storage pool as a function of whether the first data object has an extent referenced by another data object after the determined duration of time meets or exceeds the predetermined time interval. Other features and aspects may be realized, depending upon the particular application.

    Policy based tiered data deduplication strategy
    9.
    发明授权
    Policy based tiered data deduplication strategy 有权
    基于策略的分层重复数据消除策略

    公开(公告)号:US07567188B1

    公开(公告)日:2009-07-28

    申请号:US12100695

    申请日:2008-04-10

    IPC分类号: H03M7/46

    摘要: The present invention provides for a method, system, and computer program for the application of data deduplication according to a policy-based strategy of tiered data. The method operates by defining a plurality of data storage policies for data in a deduplication system, policies which may be arranged in tiers. Data objects are classified according to a selected data storage policy and are split into data chunks. If the selected data storage policy for the data object does not allow deduplication, the data chunks are stored in a deduplication pool. If the selected data storage policy for the data object allows deduplication, deduplication is performed. The data storage policy may specify a maximum number of references to data chunks, facilitating storage of new copies of the data chunks when the maximum number of references is met.

    摘要翻译: 本发明提供了一种用于根据分层数据的基于策略的策略应用重复数据删除的方法,系统和计算机程序。 该方法通过为重复数据消除系统中的数据定义多个数据存储策略来操作,该策略可以以层级排列。 数据对象根据选定的数据存储策略进行分类,并分为数据块。 如果数据对象的选定数据存储策略不允许重复数据删除,则数据块将存储在重复数据删除池中。 如果数据对象的所选数据存储策略允许重复数据删除,则执行重复数据删除。 数据存储策略可以指定对数据块的引用的最大数量,当满足最大引用数量时便于存储数据块的新副本。

    Data selection for movement from a source to a target

    公开(公告)号:US09087010B2

    公开(公告)日:2015-07-21

    申请号:US13327571

    申请日:2011-12-15

    IPC分类号: G06F17/00 G06F11/14

    CPC分类号: G06F11/1453

    摘要: In one aspect of the present description, in connection with storing a first deduplicated data object in a primary storage pool, described operations include determining the duration of time that the first data object has resided in the primary storage pool, and comparing the determined duration of time to a predetermined time interval. In addition, described operations include, after the determined duration of time meets or exceeds the predetermined time interval, determining if the first data object has an extent referenced by another data object, and determining whether to move the first data object from the primary storage pool to a secondary storage pool as a function of whether the first data object has an extent referenced by another data object after the determined duration of time meets or exceeds the predetermined time interval. Other features and aspects may be realized, depending upon the particular application.