System and method for storing redundant information
    12.
    发明授权
    System and method for storing redundant information 有权
    用于存储冗余信息的系统和方法

    公开(公告)号:US08285683B2

    公开(公告)日:2012-10-09

    申请号:US13251022

    申请日:2011-09-30

    IPC分类号: G06F7/00 G06F17/00

    摘要: A method and system for reducing storage requirements and speeding up storage operations by reducing the storage of redundant data includes receiving a request that identifies one or more data objects to which to apply a storage operation. For each data object, the storage system determines if the data object contains data that matches another data object to which the storage operation was previously applied. If the data objects do not match, then the storage system performs the storage operation in a usual manner. However, if the data objects do match, then the storage system may avoid performing the storage operation.

    摘要翻译: 通过减少冗余数据的存储来减少存储要求和加速存储操作的方法和系统包括接收标识用于应用存储操作的一个或多个数据对象的请求。 对于每个数据对象,存储系统确定数据对象是否包含与之前应用存储操作的另一个数据对象相匹配的数据。 如果数据对象不匹配,则存储系统以通常的方式执行存储操作。 但是,如果数据对象匹配,则存储系统可能会避免执行存储操作。

    Systems and methods for managing single instancing data
    14.
    发明授权
    Systems and methods for managing single instancing data 有权
    用于管理单一实例数据的系统和方法

    公开(公告)号:US09015181B2

    公开(公告)日:2015-04-21

    申请号:US12565576

    申请日:2009-09-23

    IPC分类号: G06F7/00 G06F17/30 G06F11/14

    摘要: Described in detail herein are systems and methods for managing single instancing data. Using a single instance database and other constructs (e.g. sparse files), data density on archival media (e.g. magnetic tape) is improved, and the number of files per storage operation is reduced. According to one aspect of a method for managing single instancing data, for each storage operation, a chunk folder is created on a storage device that stores single instancing data. The chunk folder contains three files: 1) a file that contains data objects that have been single instanced; 2) a file that contains data objects that have not been eligible for single instancing; and 3) a metadata file used to track the location of data objects within the other files. A second storage operation subsequent to a first storage operation contains references to data objects in the chunk folder created by the first storage operation instead of the data objects themselves.

    摘要翻译: 在此详细描述了用于管理单个实例数据的系统和方法。 使用单个实例数据库和其他构造(例如稀疏文件),归档介质(例如磁带)上的数据密度得到改善,并且每个存储操作的文件数量减少。 根据用于管理单个实例数据的方法的一个方面,对于每个存储操作,在存储单个实例数据的存储设备上创建块文件夹。 该块文件夹包含三个文件:1)一个包含已被单实例化的数据对象的文件; 2)包含不符合单一实例资格的数据对象的文件; 和3)用于跟踪其他文件中的数据对象的位置的元数据文件。 在第一存储操作之后的第二存储操作包含对由第一存储操作而不是数据对象本身创建的块文件夹中的数据对象的引用。

    SYSTEMS AND METHODS FOR MANAGING SINGLE INSTANCING DATA
    15.
    发明申请
    SYSTEMS AND METHODS FOR MANAGING SINGLE INSTANCING DATA 有权
    用于管理单个实时数据的系统和方法

    公开(公告)号:US20100082672A1

    公开(公告)日:2010-04-01

    申请号:US12565576

    申请日:2009-09-23

    IPC分类号: G06F12/16 G06F12/00

    摘要: Described in detail herein are systems and methods for managing single instancing data. Using a single instance database and other constructs (e.g. sparse files), data density on archival media (e.g. magnetic tape) is improved, and the number of files per storage operation is reduced. According to one aspect of a method for managing single instancing data, for each storage operation, a chunk folder is created on a storage device that stores single instancing data. The chunk folder contains three files: 1) a file that contains data objects that have been single instanced; 2) a file that contains data objects that have not been eligible for single instancing; and 3) a metadata file used to track the location of data objects within the other files. A second storage operation subsequent to a first storage operation contains references to data objects in the chunk folder created by the first storage operation instead of the data objects themselves.

    摘要翻译: 在此详细描述了用于管理单个实例数据的系统和方法。 使用单个实例数据库和其他构造(例如稀疏文件),归档介质(例如磁带)上的数据密度得到改善,并且每个存储操作的文件数量减少。 根据用于管理单个实例数据的方法的一个方面,对于每个存储操作,在存储单个实例数据的存储设备上创建块文件夹。 该块文件夹包含三个文件:1)一个包含已被单实例化的数据对象的文件; 2)包含不符合单一实例资格的数据对象的文件; 和3)用于跟踪其他文件中的数据对象的位置的元数据文件。 在第一存储操作之后的第二存储操作包含对由第一存储操作而不是数据对象本身创建的块文件夹中的数据对象的引用。

    Block-level single instancing
    18.
    发明授权

    公开(公告)号:US08578120B2

    公开(公告)日:2013-11-05

    申请号:US12647906

    申请日:2009-12-28

    IPC分类号: G06F12/00

    摘要: Described in detail herein are systems and methods for single instancing blocks of data in a data storage system. For example, the data storage system may include multiple computing devices (e.g., client computing devices) that store primary data. The data storage system may also include a secondary storage computing device, a single instance database, and one or more storage devices that store copies of the primary data (e.g., secondary copies, tertiary copies, etc.). The secondary storage computing device receives blocks of data from the computing devices and accesses the single instance database to determine whether the blocks of data are unique (meaning that no instances of the blocks of data are stored on the storage devices). If a block of data is unique, the single instance database stores it on a storage device. If not, the secondary storage computing device can avoid storing the block of data on the storage devices.

    BLOCK-LEVEL SINGLE INSTANCING
    19.
    发明申请
    BLOCK-LEVEL SINGLE INSTANCING 有权
    块级单一实现

    公开(公告)号:US20100299490A1

    公开(公告)日:2010-11-25

    申请号:US12647906

    申请日:2009-12-28

    摘要: Described in detail herein are systems and methods for single instancing blocks of data in a data storage system. For example, the data storage system may include multiple computing devices (e.g., client computing devices) that store primary data. The data storage system may also include a secondary storage computing device, a single instance database, and one or more storage devices that store copies of the primary data (e.g., secondary copies, tertiary copies, etc.). The secondary storage computing device receives blocks of data from the computing devices and accesses the single instance database to determine whether the blocks of data are unique (meaning that no instances of the blocks of data are stored on the storage devices). If a block of data is unique, the single instance database stores it on a storage device. If not, the secondary storage computing device can avoid storing the block of data on the storage devices.

    摘要翻译: 在此详细描述的是用于数据存储系统中单个实例数据块的系统和方法。 例如,数据存储系统可以包括存储主数据的多个计算设备(例如,客户端计算设备)。 数据存储系统还可以包括辅助存储计算设备,单个实例数据库以及存储主要数据的副本(例如,次要副本,三级副本等)的一个或多个存储设备。 辅助存储计算设备从计算设备接收数据块并访问单个实例数据库以确定数据块是否是唯一的(意味着数据块的实例不存储在存储设备上)。 如果一个数据块是唯一的,那么单个实例数据库将其存储在存储设备上。 如果不是,则辅助存储计算设备可以避免将数据块存储在存储设备上。

    STORING A VARIABLE NUMBER OF INSTANCES OF DATA OBJECTS
    20.
    发明申请
    STORING A VARIABLE NUMBER OF INSTANCES OF DATA OBJECTS 有权
    存储可变数据数据对象的数据

    公开(公告)号:US20100250549A1

    公开(公告)日:2010-09-30

    申请号:US12649454

    申请日:2009-12-30

    IPC分类号: G06F17/30

    摘要: Described are systems and methods for storing a variable number of instances of data objects (e.g., 1, 2, 3, or up to N-1 instances, where N is the number of instances of the data object included in primary data) in secondary storage across a data storage network. In some examples, a system for storing a variable number of instances of data objects includes, one or more computing devices storing a set of data objects and multiple storage devices distinct from the one or more computing devices. Each of the multiple storage devices is configured to store at least a single instance of a data object. The system also includes a database configured to store information associated with the data objects. This information includes substantially unique identifiers for the data objects and, for each of the data objects, a number of instances of the data object stored on the multiple storage devices.

    摘要翻译: 描述了用于存储可变数目的数据对象(例如,1,2,3或至多N-1个实例的系统和方法,其中N是主数据中包括的数据对象的实例的数量)在次级 存储在数据存储网络中。 在一些示例中,用于存储可变数量的数据对象的实例的系统包括存储一组数据对象的一个​​或多个计算设备和与一个或多个计算设备不同的多个存储设备。 多个存储设备中的每一个被配置为存储数据对象的至少一个实例。 该系统还包括配置为存储与数据对象相关联的信息的数据库。 该信息包括用于数据对象的基本上唯一的标识符,并且对于每个数据对象,存储多个存储设备上存储的数据对象的多个实例。