Hash migration using a gold image library management system

    公开(公告)号:US11797206B2

    公开(公告)日:2023-10-24

    申请号:US17200506

    申请日:2021-03-12

    Abstract: Embodiments for migrating hash values for backup data blocks in a network of data protection targets (DPTs) and a common data protection target implementing a Gold image library management system in which backups of Gold images used as templates for physical machines and virtual machines are stored on the CDPT. The CDPT and each DPT stores backup data split into chunks that are uniquely identified by a respective hash of its contents, and maintains data structures comprising the hash, chunk size, chunk data, and a list of DPT and CDPT identifiers. The hashes are partitioned into a set of buckets in the CDPT. A Bloom filter is generated for each bucket of hashes, and stored in each DPT so that each DPT stores Bloom filters for all CDPTs in the network. Each DPT checks its list of hashes against the Bloom filters in each of the DPTs to determine whether to keep or free chunks of data.

    Automatic detection and identification of gold image library files and directories

    公开(公告)号:US11514100B2

    公开(公告)日:2022-11-29

    申请号:US17174840

    申请日:2021-02-12

    Abstract: Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. Gold image data is stored by a user in a defined location or tagged with a defined tag. The stored or tagged Gold image data is identified by the backup software and copied to the CDPT. The identified Gold image data is cataloged. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.

    COPY REUSE USING GOLD IMAGES
    3.
    发明申请

    公开(公告)号:US20220197752A1

    公开(公告)日:2022-06-23

    申请号:US17174921

    申请日:2021-02-12

    Abstract: Facilitating efficient copy reuse of point-in-time (PIT) backup data in a data storage system by providing a data protection target (DPT) for storing user the PIT backup data, and a common data protection target (CDPT) accessible to but separate from the data protection target for storing Gold image data comprising structural data for operating system and application programs as defined by a manufacturer and different from the backed user content data. A Gold image copy reuse coordinator component or process receives a selection of a Gold image to be combined with a specified PIT backup dataset, and combines the specified PIT backup dataset with the selected Gold image to form a synthetic copy of the specified PIT backup dataset stored in the DPT. The synthetic copy can then be exposed to a system through a file share protocol for reuse by a user.

    Primary array data dedup/compression using block backup statistics

    公开(公告)号:US10565068B1

    公开(公告)日:2020-02-18

    申请号:US15491011

    申请日:2017-04-19

    Abstract: A backup copy of a production device is used to quantify suitability of host application data for placement on individual managed drives and virtualized managed drives based on storage capabilities associated with those drives. A data analysis program on a data backup storage array may generate block backup statistics to indicate that a production device or certain chunks, blocks or volumes of host application data are highly compressible or reducible via deduplication. The block backup statistics are sent from the data backup storage array to the primary storage array. The primary storage array uses the block backup statistics to select a particular storage resource with suitable storage capabilities for the data. Highly compressible data may be stored on a storage virtualization storage array with data compression capability, and data that is neither highly compressible nor reducible with deduplication may be stored on local resources.

    Gold image library management system to reduce backup storage and bandwidth utilization

    公开(公告)号:US11513904B2

    公开(公告)日:2022-11-29

    申请号:US17124957

    申请日:2020-12-17

    Abstract: Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. During backup, user content data is copied from the client to the data protection target, and Gold image data is copied to the CDPT. It is also referenced in the DP target to prevent redundant storage. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.

    System and method of improving deduplicated storage tier management for primary storage arrays by including workload aggregation statistics

    公开(公告)号:US10705733B1

    公开(公告)日:2020-07-07

    申请号:US15280686

    申请日:2016-09-29

    Abstract: Embodiments provide for a primary storage array having multiple storage tiers. The multiple storage tiers include one or more performance tiers and at least one deduplicated storage tier storing deduplicated data. One embodiment provides for a data storage system including management logic to manage the deduplicated storage tier. The management logic can receive a request to store deduplicated data and respond to the request to store the deduplicated data based on workload statistics associated with the data. In one embodiment the response to the request includes storing at least a portion of the deduplicated data to a first storage device in the second set of storage devices and the workload statistics include input/output (I/O) statistics associated with the unit of deduplicated data. In other embodiments the response includes reallocating drives between storage tiers and/or providing migration hints with respect to data on the deduplicated storage tier.

    Sustaining backup service level objectives using dynamic resource allocation

    公开(公告)号:US10587686B2

    公开(公告)日:2020-03-10

    申请号:US16259819

    申请日:2019-01-28

    Abstract: A backup engine of a first storage system receives a request to perform a backup session from the first storage system to a second storage system based on a backup service level objective (SLO) that has been configured for the backup session. In response to the request, it is determined that a first backup resource allocated for the backup session by the first storage system cannot satisfy the SLO based on statistics of prior backup sessions in view of characteristics of the backup session to be performed. A dynamic resource allocation (DRA) module is to dynamically perform a first DRA to modify the first backup resource to satisfy the SLO. The backup engine then initiates the backup session by transmitting backup data from the first storage system to the second storage system using the modified first backup resource.

    HASH MIGRATION USING A GOLD IMAGE LIBRARY MANAGEMENT SYSTEM

    公开(公告)号:US20220197526A1

    公开(公告)日:2022-06-23

    申请号:US17200506

    申请日:2021-03-12

    Abstract: Embodiments for migrating hash values for backup data blocks in a network of data protection targets (DPTs) and a common data protection target implementing a Gold image library management system in which backups of Gold images used as templates for physical machines and virtual machines are stored on the CDPT. The CDPT and each DPT stores backup data split into chunks that are uniquely identified by a respective hash of its contents, and maintains data structures comprising the hash, chunk size, chunk data, and a list of DPT and CDPT identifiers. The hashes are partitioned into a set of buckets in the CDPT. A Bloom filter is generated for each bucket of hashes, and stored in each DPT so that each DPT stores Bloom filters for all CDPTs in the network. Each DPT checks its list of hashes against the Bloom filters in each of the DPTs to determine whether to keep or free chunks of data.

Patent Agency Ranking