REDUNDANT DISTRIBUTED DATA STORAGE SYSTEM

    公开(公告)号:US20210334002A1

    公开(公告)日:2021-10-28

    申请号:US17236882

    申请日:2021-04-21

    Abstract: An improved information management system is provided herein for managing secondary storage computing devices when one or more become unavailable. For example, a first secondary storage computing device in a pool of secondary storage computing devices can run a storage manager virtual machine (VM) that manages secondary copy and restore operations. A second secondary storage computing device in the pool can run a virtualization engine that manages the storage manager VM. The virtualization engine can replicate the storage manager VM data to other secondary storage computing devices in the pool. If the first secondary storage computing devices becomes unavailable, the second secondary storage computing device can instruct a third secondary storage computing device in the pool to launch a replicated storage manager VM using the replicated storage manager VM data stored therein. Thus, the third secondary storage computing device can act as the now unavailable first secondary storage computing device.

    SYSTEMS AND METHODS FOR MANAGING SINGLE INSTANCING DATA

    公开(公告)号:US20210232460A1

    公开(公告)日:2021-07-29

    申请号:US17233187

    申请日:2021-04-16

    Abstract: Described in detail herein are systems and methods for managing single instancing data. Using a single instance database and other constructs (e.g. sparse files), data density on archival media (e.g. magnetic tape) is improved, and the number of files per storage operation is reduced. According to one aspect of a method for managing single instancing data, for each storage operation, a chunk folder is created on a storage device that stores single instancing data. The chunk folder contains three files: 1) a file that contains data objects that have been single instanced; 2) a file that contains data objects that have not been eligible for single instancing; and 3) a metadata file used to track the location of data objects within the other files. A second storage operation subsequent to a first storage operation contains references to data objects in the chunk folder created by the first storage operation instead of the data objects themselves.

    SYSTEMS AND METHODS FOR OPTIMIZING RESTORATION OF DEDUPLICATED DATA STORED IN CLOUD-BASED STORAGE RESOURCES

    公开(公告)号:US20210173744A1

    公开(公告)日:2021-06-10

    申请号:US17079023

    申请日:2020-10-23

    Abstract: Techniques disclosed herein are well suited to restoring deduplicated backup data from cloud-based storage and from multi-node replicated files systems, and they also improve performance in more traditional data storage technologies. Pre-restore steps include analysis of deduplication indexes to identify data segments that are stored consecutively on storage media. Reading data in aggregate runs of consecutively stored data segments reduces interactions with storage media that hosts the deduplicated data and speeds up retrieval. Parallel reads from multiple storage devices in multi-node replicated file systems also speed up retrieval. An illustrative enhanced media agent pre-fetches data (stored in deduplicated form) in anticipation of read requests that are expected in the restore operation. The pre-fetched data is temporarily stored locally at the media agent, which is responsible for interfacing with storage media and is further responsible for orchestrating the disclosed techniques within an illustrative data storage management system.

    EFFECIENT VERIFICATION OF DATA BLOCKS IN DEDUPLICATED STORAGE SYSTEMS

    公开(公告)号:US20200293213A1

    公开(公告)日:2020-09-17

    申请号:US16817399

    申请日:2020-03-12

    Abstract: The exemplary system and methods provide a solution for reclaiming the space occupied by invalid blocks of data stored on secondary storage devices. The space reclamation techniques may employ a primary table, a deduplication chunk table, or an index data structure located with the secondary copy of data on secondary storage devices. One exemplary method uses information from the deduplication database media agent and a deduplication chunk table. Another exemplary method uses an index (e.g., single instance file index) that is associated with and stored with each chunk of data. Based on the information provided by the deduplication chunk table and the index, exemplary system and methods identify blocks of invalid data and copy over only valid data blocks to a new container file.

    RESTORE OF SECONDARY DATA USING THREAD POOLING

    公开(公告)号:US20200241766A1

    公开(公告)日:2020-07-30

    申请号:US16722756

    申请日:2019-12-20

    Abstract: A system according to certain aspects may include a secondary storage controller computer configured to: in response to a first instruction to obtain a first secondary copy of a first data set from a secondary storage device(s), the first instruction associated with a first restore operation: instantiate a first restore thread on a processor of the secondary storage controller computer; using the first restore thread, retrieve the first secondary copy from the secondary storage device(s); and forward the retrieved first secondary copy to a primary storage subsystem for storage; and in response to a second instruction to obtain a second secondary copy of a second data set from the secondary storage device(s), the second instruction associated with a second restore operation: using the first restore thread, retrieve the second secondary copy from the secondary storage device(s); and forward the retrieved second secondary copy to the primary storage subsystem for storage.

    ASSIGNING BACKUP RESOURCES BASED ON FAILOVER OF PARTNERED DATA STORAGE SERVERS IN A DATA STORAGE MANAGEMENT SYSTEM

    公开(公告)号:US20200183802A1

    公开(公告)日:2020-06-11

    申请号:US16212485

    申请日:2018-12-06

    Abstract: An illustrative data storage management system is aware that certain data storage resources for storing/serving primary data operate in a partnered configuration. Illustrative components of the data storage management system analyze the failover status of the partnered primary data storage resources to determine which is currently serving/storing primary data and/or snapshots targeted for backup. When detecting that a first partnered primary data storage resource has failed over to a second primary data storage resource, the example storage manager changes the assignment of backup resources that are pre-administered for the targeted data. Accordingly, the example storage manager assigns backup resources, including at least one media agent, that are associated with the second primary data storage resource, and which are “closer” thereto from a geography and/or network topology perspective, even if the pre-administered backup resources are available for backup.

    MANAGING DELETIONS FROM A DEDUPLICATION DATABASE

    公开(公告)号:US20200089659A1

    公开(公告)日:2020-03-19

    申请号:US16550094

    申请日:2019-08-23

    Abstract: An information management system can manage the removal of data block entries in a deduplicated data store using working copies of the data block entries residing in a local data store of a secondary storage computing device. The system can use the working copies to identify data blocks for removal. Once the deduplication database is updated with the changes to the working copies (e.g., using a transaction based update scheme), the system can query the deduplication database for the database entries identified for removal. Once identified, the system can remove the database entries identified for pruning and/or the corresponding deduplication data blocks from secondary storage.

    SNAPSHOT READINESS CHECKING AND REPORTING
    59.
    发明申请

    公开(公告)号:US20180196821A1

    公开(公告)日:2018-07-12

    申请号:US15860520

    申请日:2018-01-02

    Abstract: An information management system according to certain aspects may determine whether snapshot operations will work prior to executing them. The system may check various factors or parameters relating to a snapshot storage policy to verify whether the storage policy will work at runtime without actually executing the policy. Some examples of factors can include: availability of primary storage devices for which a snapshot should be obtained, availability of secondary storage devices, license availability for snapshot software, user credentials for connecting to primary and/or second storage devices, available storage capacity, connectivity to storage devices, etc. The system may also check whether a particular system configuration is supported in connection with snapshot operations. The result of the determination can be provided in the form of a report summarizing any problems found with the snapshot storage policy. The report can include recommended courses of action or solutions for resolving any identified issues.

    SCALABLE AUXILIARY COPY PROCESSING IN A DATA STORAGE MANAGEMENT SYSTEM USING MEDIA AGENT RESOURCES

    公开(公告)号:US20180121115A1

    公开(公告)日:2018-05-03

    申请号:US15851539

    申请日:2017-12-21

    CPC classification number: G06F3/0619 G06F3/061 G06F3/065 G06F3/067 G06F3/0683

    Abstract: A scalable approach is disclosed for processing auxiliary-copy jobs in a storage management system by using distributed media agent resources instead of a centralized storage manager. Enhanced media agents coordinate and control auxiliary-copy jobs and tap the storage manager to reserve data streams and provide job-specific metadata on demand. An enhanced storage manager may initially select a media agent as “coordinator” to coordinate auxiliary-copy jobs with any number of other media agents, which act as “controllers.” A coordinator media agent is generally responsible for obtaining data stream reservation information from the storage manager and assigning auxiliary-copy jobs to respective controller media agents, based on the components involved in the respective reserved data streams.

Patent Agency Ranking