Invention Grant
- Patent Title: Data deduplication utilizing extent ID database
-
Application No.: US14559317Application Date: 2014-12-03
-
Publication No.: US09659047B2Publication Date: 2017-05-23
- Inventor: Alok Sharma , Satbir Singh , Sudhanshu Gupta
- Applicant: NetApp, Inc.
- Applicant Address: US CA Sunnyvale
- Assignee: NetApp, Inc.
- Current Assignee: NetApp, Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Gilliam IP PLLC
- Main IPC: G06F12/00
- IPC: G06F12/00 ; G06F13/00 ; G06F13/28 ; G06F17/30 ; G06F3/06

Abstract:
An extent map (EMAP) database may include one or more extent map entries configured to map extent IDs to PVBNs. Each extent ID may be apportioned into a most significant bit (MSB) portion, i.e., checksum bits, and a least significant bit (LSB) portion, i.e., duplicate bits. A hash may be applied to the data of the extent to calculate the checksum bits, which illustratively represent a fingerprint of the data. The duplicate bits may be configured to denote any reoccurrence of the checksum bits in the EMAP database, i.e., whether there is an existing extent with potentially identical data in a volume of the aggregate. Each extent map entry may be inserted on a node having one or more key/value pairs, wherein the key is the extent ID and the value is the PVBN. The EMAP database may be scanned and utilized to perform data deduplication.
Public/Granted literature
- US20160162207A1 SYSTEM AND METHOD FOR DATA DEDUPLICATION UTILIZING EXTENT ID DATABASE Public/Granted day:2016-06-09
Information query