-
公开(公告)号:US09846718B1
公开(公告)日:2017-12-19
申请号:US14231162
申请日:2014-03-31
申请人: EMC Corporation
发明人: Richard P. Ruef , Ying Hu , Kurt William Everson
CPC分类号: G06F17/30371 , G06F3/0608 , G06F3/0641 , G06F3/067 , G06F3/0689 , G06F17/30159
摘要: A method is used in deduplicating sets of data blocks. A candidate data object is identified for deduplicating a data object. A digest associated with the candidate data object matches a digest associated with the data object. Digest information of a set of data objects is evaluated. The set of data objects are selected for evaluation based on an association between location of the set of data objects and location of the candidate data object. Based on the evaluation, a deduplicating technique is applied for deduplicating the data object.
-
公开(公告)号:US11327948B1
公开(公告)日:2022-05-10
申请号:US15198425
申请日:2016-06-30
申请人: EMC Corporation
IPC分类号: G06F16/23 , G06F16/215 , G06F16/22
摘要: A method is used in managing deduplication of data in storage systems. A candidate data object is identified for deduplicating a data object by evaluating digests stored in a current digest segment to determine whether another digest matching a digest associated with the data block is stored in the current digest segment. The current digest segment includes a set of digests associated with a set of data blocks previously received for deduplication. Based on the evaluation, a deduplicating technique is applied to the data object. The current digest segment is stored in an index table. A previous digest segment associated with a digest stored in the index table matches the digest associated with the data block is replaced by the current digest segment.
-
公开(公告)号:US11074232B1
公开(公告)日:2021-07-27
申请号:US15198334
申请日:2016-06-30
申请人: EMC Corporation
IPC分类号: G06F16/215 , G06F16/22
摘要: A method is used in managing deduplication of data in storage systems. A digest is determined for a data object received for deduplication. A candidate data object is identified for deduplicating the data object. A digest associated with the candidate data object matches the digest associated with the data object. The digest in a digest segment is maintained based on identification of the candidate data object. The digest segment includes a set of digests associated with a set of data blocks identified for deduplication in an ordered arrangement.
-
-