Managing deduplication of data in storage systems

    公开(公告)号:US11327948B1

    公开(公告)日:2022-05-10

    申请号:US15198425

    申请日:2016-06-30

    申请人: EMC Corporation

    摘要: A method is used in managing deduplication of data in storage systems. A candidate data object is identified for deduplicating a data object by evaluating digests stored in a current digest segment to determine whether another digest matching a digest associated with the data block is stored in the current digest segment. The current digest segment includes a set of digests associated with a set of data blocks previously received for deduplication. Based on the evaluation, a deduplicating technique is applied to the data object. The current digest segment is stored in an index table. A previous digest segment associated with a digest stored in the index table matches the digest associated with the data block is replaced by the current digest segment.

    Managing deduplication of data in storage systems

    公开(公告)号:US11074232B1

    公开(公告)日:2021-07-27

    申请号:US15198334

    申请日:2016-06-30

    申请人: EMC Corporation

    IPC分类号: G06F16/215 G06F16/22

    摘要: A method is used in managing deduplication of data in storage systems. A digest is determined for a data object received for deduplication. A candidate data object is identified for deduplicating the data object. A digest associated with the candidate data object matches the digest associated with the data object. The digest in a digest segment is maintained based on identification of the candidate data object. The digest segment includes a set of digests associated with a set of data blocks identified for deduplication in an ordered arrangement.