-
公开(公告)号:US11797206B2
公开(公告)日:2023-10-24
申请号:US17200506
申请日:2021-03-12
Applicant: EMC IP Holding Company LLC
Inventor: Arun Murti , Mark Malamut , Stephen Smaldone
CPC classification number: G06F3/0641 , G06F3/067 , G06F3/0608 , G06F3/0619 , G06F3/0647 , G06F9/5083
Abstract: Embodiments for migrating hash values for backup data blocks in a network of data protection targets (DPTs) and a common data protection target implementing a Gold image library management system in which backups of Gold images used as templates for physical machines and virtual machines are stored on the CDPT. The CDPT and each DPT stores backup data split into chunks that are uniquely identified by a respective hash of its contents, and maintains data structures comprising the hash, chunk size, chunk data, and a list of DPT and CDPT identifiers. The hashes are partitioned into a set of buckets in the CDPT. A Bloom filter is generated for each bucket of hashes, and stored in each DPT so that each DPT stores Bloom filters for all CDPTs in the network. Each DPT checks its list of hashes against the Bloom filters in each of the DPTs to determine whether to keep or free chunks of data.
-
公开(公告)号:US11514100B2
公开(公告)日:2022-11-29
申请号:US17174840
申请日:2021-02-12
Applicant: EMC IP Holding Company LLC
Inventor: Mark Malamut , Stephen Smaldone , Arun Murti
IPC: G06F16/50 , G06F16/535 , G06F16/51 , G06F9/455
Abstract: Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. Gold image data is stored by a user in a defined location or tagged with a defined tag. The stored or tagged Gold image data is identified by the backup software and copied to the CDPT. The identified Gold image data is cataloged. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.
-
公开(公告)号:US20220197752A1
公开(公告)日:2022-06-23
申请号:US17174921
申请日:2021-02-12
Applicant: EMC IP Holding Company LLC
Inventor: Arun Murti , Mark Malamut , Stephen Smaldone
IPC: G06F11/14
Abstract: Facilitating efficient copy reuse of point-in-time (PIT) backup data in a data storage system by providing a data protection target (DPT) for storing user the PIT backup data, and a common data protection target (CDPT) accessible to but separate from the data protection target for storing Gold image data comprising structural data for operating system and application programs as defined by a manufacturer and different from the backed user content data. A Gold image copy reuse coordinator component or process receives a selection of a Gold image to be combined with a specified PIT backup dataset, and combines the specified PIT backup dataset with the selected Gold image to form a synthetic copy of the specified PIT backup dataset stored in the DPT. The synthetic copy can then be exposed to a system through a file share protocol for reuse by a user.
-
公开(公告)号:US10565068B1
公开(公告)日:2020-02-18
申请号:US15491011
申请日:2017-04-19
Applicant: EMC IP HOLDING COMPANY LLC
Inventor: Arieh Don , Gabriel Benhanokh , Ian Wigmore , Stephen Smaldone
Abstract: A backup copy of a production device is used to quantify suitability of host application data for placement on individual managed drives and virtualized managed drives based on storage capabilities associated with those drives. A data analysis program on a data backup storage array may generate block backup statistics to indicate that a production device or certain chunks, blocks or volumes of host application data are highly compressible or reducible via deduplication. The block backup statistics are sent from the data backup storage array to the primary storage array. The primary storage array uses the block backup statistics to select a particular storage resource with suitable storage capabilities for the data. Highly compressible data may be stored on a storage virtualization storage array with data compression capability, and data that is neither highly compressible nor reducible with deduplication may be stored on local resources.
-
公开(公告)号:US11513904B2
公开(公告)日:2022-11-29
申请号:US17124957
申请日:2020-12-17
Applicant: EMC IP Holding Company LLC
Inventor: Mark Malamut , Stephen Smaldone
IPC: G06F16/24 , G06F11/14 , G06F16/2455
Abstract: Reducing backup data by providing a data protection target for storing content data from clients running operating system and applications, and a common data protection target (CDPT) separate from the data protection target for storing Gold image data for the operating system and applications. During backup, user content data is copied from the client to the data protection target, and Gold image data is copied to the CDPT. It is also referenced in the DP target to prevent redundant storage. During backup, the CDPT is queried to determine if the Gold image exists, and if so, the DP target does not store the Gold image data in the DP target, but uses the reference to indicate the location of the Gold image data corresponding to the backed up content data. During a restore, the restore stream is built from the DP target and CDPT to combine user content data and Gold image data.
-
公开(公告)号:US11269554B2
公开(公告)日:2022-03-08
申请号:US16901388
申请日:2020-06-15
Applicant: EMC IP Holding Company, LLC
Inventor: Stephen Smaldone , Ian Wigmore , Arieh Don
Abstract: In a data processing system having multiple production hosts, each with a backup agent, interfaced to a storage array and to a backup server, an array I/O priority engine on the storage array establishes and enforces I/O priorities for I/O backup and production communications with the array to ensure that backup does not consume a disproportionate CPU processing and array bandwidth load or disrupt more critical production processing.
-
公开(公告)号:US10705733B1
公开(公告)日:2020-07-07
申请号:US15280686
申请日:2016-09-29
Applicant: EMC IP Holding Company LLC
Inventor: Stephen Smaldone , Ian Wigmore , Arieh Don
IPC: G06F3/06
Abstract: Embodiments provide for a primary storage array having multiple storage tiers. The multiple storage tiers include one or more performance tiers and at least one deduplicated storage tier storing deduplicated data. One embodiment provides for a data storage system including management logic to manage the deduplicated storage tier. The management logic can receive a request to store deduplicated data and respond to the request to store the deduplicated data based on workload statistics associated with the data. In one embodiment the response to the request includes storing at least a portion of the deduplicated data to a first storage device in the second set of storage devices and the workload statistics include input/output (I/O) statistics associated with the unit of deduplicated data. In other embodiments the response includes reallocating drives between storage tiers and/or providing migration hints with respect to data on the deduplicated storage tier.
-
公开(公告)号:US10587686B2
公开(公告)日:2020-03-10
申请号:US16259819
申请日:2019-01-28
Applicant: EMC IP Holding Company LLC
Inventor: Ian Wigmore , Stephen Smaldone , Arieh Don
Abstract: A backup engine of a first storage system receives a request to perform a backup session from the first storage system to a second storage system based on a backup service level objective (SLO) that has been configured for the backup session. In response to the request, it is determined that a first backup resource allocated for the backup session by the first storage system cannot satisfy the SLO based on statistics of prior backup sessions in view of characteristics of the backup session to be performed. A dynamic resource allocation (DRA) module is to dynamically perform a first DRA to modify the first backup resource to satisfy the SLO. The backup engine then initiates the backup session by transmitting backup data from the first storage system to the second storage system using the modified first backup resource.
-
公开(公告)号:US10452543B1
公开(公告)日:2019-10-22
申请号:US15717779
申请日:2017-09-27
Applicant: EMC IP Holding Company LLC
Inventor: Stephen Smaldone , Ian Wigmore , Arieh Don
IPC: G06F13/00 , G06F12/0804 , G06F16/172 , G06F16/182
Abstract: Embodiments are described for a multi-node file system, such as a clustered or distributed file system, with a file system buffer cache and an additional host-side tier non-volatile storage cache such as 3DXP storage. Cache coherency can be maintained by one of three models: (i) host-side tier management, (ii) file system management, or (iii) storage array management. performing a storage tier-specific file system action in a file system that comprises a namespace that spans multiple tiers of storage.
-
公开(公告)号:US20220197526A1
公开(公告)日:2022-06-23
申请号:US17200506
申请日:2021-03-12
Applicant: EMC IP Holding Company LLC
Inventor: Arun Murti , Mark Malamut , Stephen Smaldone
Abstract: Embodiments for migrating hash values for backup data blocks in a network of data protection targets (DPTs) and a common data protection target implementing a Gold image library management system in which backups of Gold images used as templates for physical machines and virtual machines are stored on the CDPT. The CDPT and each DPT stores backup data split into chunks that are uniquely identified by a respective hash of its contents, and maintains data structures comprising the hash, chunk size, chunk data, and a list of DPT and CDPT identifiers. The hashes are partitioned into a set of buckets in the CDPT. A Bloom filter is generated for each bucket of hashes, and stored in each DPT so that each DPT stores Bloom filters for all CDPTs in the network. Each DPT checks its list of hashes against the Bloom filters in each of the DPTs to determine whether to keep or free chunks of data.
-
-
-
-
-
-
-
-
-