OPTIMIZED RESTORATION OF DEDUPLICATED DATA STORED IN CLOUD-BASED STORAGE RESOURCES

    公开(公告)号:US20230315681A1

    公开(公告)日:2023-10-05

    申请号:US18330564

    申请日:2023-06-07

    Abstract: Techniques disclosed herein are well suited to restoring deduplicated backup data from cloud-based storage and from multi-node replicated files systems, and they also improve performance in more traditional data storage technologies. Pre-restore steps include analysis of deduplication indexes to identify data segments that are stored consecutively on storage media. Reading data in aggregate runs of consecutively stored data segments reduces interactions with storage media that hosts the deduplicated data and speeds up retrieval. Parallel reads from multiple storage devices in multi-node replicated file systems also speed up retrieval. An illustrative enhanced media agent pre-fetches data (stored in deduplicated form) in anticipation of read requests that are expected in the restore operation. The pre-fetched data is temporarily stored locally at the media agent, which is responsible for interfacing with storage media and is further responsible for orchestrating the disclosed techniques within an illustrative data storage management system.

    OPTIMIZING THE RESTORATION OF DEDUPLICATED DATA STORED IN MULTI-NODE REPLICATED FILE SYSTEMS

    公开(公告)号:US20210173811A1

    公开(公告)日:2021-06-10

    申请号:US17079142

    申请日:2020-10-23

    Abstract: Techniques disclosed herein are well suited to restoring deduplicated backup data from cloud-based storage and from multi-node replicated files systems, and they also improve performance in more traditional data storage technologies. Pre-restore steps include analysis of deduplication indexes to identify data segments that are stored consecutively on storage media. Reading data in aggregate runs of consecutively stored data segments reduces interactions with storage media that hosts the deduplicated data and speeds up retrieval. Parallel reads from multiple storage devices in multi-node replicated file systems also speed up retrieval. An illustrative enhanced media agent pre-fetches data (stored in deduplicated form) in anticipation of read requests that are expected in the restore operation. The pre-fetched data is temporarily stored locally at the media agent, which is responsible for interfacing with storage media and is further responsible for orchestrating the disclosed techniques within an illustrative data storage management system.

    EFFICIENT DEDUPLICATION DATABASE VALIDATION
    4.
    发明申请

    公开(公告)号:US20200226030A1

    公开(公告)日:2020-07-16

    申请号:US16743879

    申请日:2020-01-15

    Abstract: According to certain aspects, a method can include receiving an indication that a restoration of a deduplication database using a secondary copy of a file associated with a secondary copy job is complete; retrieving a first data fingerprint from a data storage database, wherein the first data fingerprint is associated with the secondary copy job used to restore the deduplication database; retrieving a second data fingerprint from a deduplication database media agent, wherein the second data fingerprint is associated with the secondary copy job used to restore the deduplication database; comparing the first data fingerprint with the second data fingerprint to determine whether the first data fingerprint and the second data fingerprint match; and transmitting an instruction to the deduplication database media agent to rebuild the restored deduplication database in response to a determination that the first data fingerprint and the second data fingerprint do not match.

    RESTORE OF SECONDARY DATA USING THREAD POOLING

    公开(公告)号:US20180329635A1

    公开(公告)日:2018-11-15

    申请号:US15978019

    申请日:2018-05-11

    Abstract: A system according to certain aspects may include a secondary storage controller computer configured to: in response to a first instruction to obtain a first secondary copy of a first data set from a secondary storage device(s), the first instruction associated with a first restore operation: instantiate a first restore thread on a processor of the secondary storage controller computer; using the first restore thread, retrieve the first secondary copy from the secondary storage device(s); and forward the retrieved first secondary copy to a primary storage subsystem for storage; and in response to a second instruction to obtain a second secondary copy of a second data set from the secondary storage device(s), the second instruction associated with a second restore operation: using the first restore thread, retrieve the second secondary copy from the secondary storage device(s); and forward the retrieved second secondary copy to the primary storage subsystem for storage.

    SYSTEMS AND METHODS FOR OPTIMIZING RESTORATION OF DEDUPLICATED DATA STORED IN CLOUD-BASED STORAGE RESOURCES

    公开(公告)号:US20210173744A1

    公开(公告)日:2021-06-10

    申请号:US17079023

    申请日:2020-10-23

    Abstract: Techniques disclosed herein are well suited to restoring deduplicated backup data from cloud-based storage and from multi-node replicated files systems, and they also improve performance in more traditional data storage technologies. Pre-restore steps include analysis of deduplication indexes to identify data segments that are stored consecutively on storage media. Reading data in aggregate runs of consecutively stored data segments reduces interactions with storage media that hosts the deduplicated data and speeds up retrieval. Parallel reads from multiple storage devices in multi-node replicated file systems also speed up retrieval. An illustrative enhanced media agent pre-fetches data (stored in deduplicated form) in anticipation of read requests that are expected in the restore operation. The pre-fetched data is temporarily stored locally at the media agent, which is responsible for interfacing with storage media and is further responsible for orchestrating the disclosed techniques within an illustrative data storage management system.

    EFFECIENT VERIFICATION OF DATA BLOCKS IN DEDUPLICATED STORAGE SYSTEMS

    公开(公告)号:US20200293213A1

    公开(公告)日:2020-09-17

    申请号:US16817399

    申请日:2020-03-12

    Abstract: The exemplary system and methods provide a solution for reclaiming the space occupied by invalid blocks of data stored on secondary storage devices. The space reclamation techniques may employ a primary table, a deduplication chunk table, or an index data structure located with the secondary copy of data on secondary storage devices. One exemplary method uses information from the deduplication database media agent and a deduplication chunk table. Another exemplary method uses an index (e.g., single instance file index) that is associated with and stored with each chunk of data. Based on the information provided by the deduplication chunk table and the index, exemplary system and methods identify blocks of invalid data and copy over only valid data blocks to a new container file.

    RESTORE OF SECONDARY DATA USING THREAD POOLING

    公开(公告)号:US20200241766A1

    公开(公告)日:2020-07-30

    申请号:US16722756

    申请日:2019-12-20

    Abstract: A system according to certain aspects may include a secondary storage controller computer configured to: in response to a first instruction to obtain a first secondary copy of a first data set from a secondary storage device(s), the first instruction associated with a first restore operation: instantiate a first restore thread on a processor of the secondary storage controller computer; using the first restore thread, retrieve the first secondary copy from the secondary storage device(s); and forward the retrieved first secondary copy to a primary storage subsystem for storage; and in response to a second instruction to obtain a second secondary copy of a second data set from the secondary storage device(s), the second instruction associated with a second restore operation: using the first restore thread, retrieve the second secondary copy from the secondary storage device(s); and forward the retrieved second secondary copy to the primary storage subsystem for storage.

Patent Agency Ranking