- 专利标题: Data deduplication utilizing extent ID database
-
申请号: US14559317申请日: 2014-12-03
-
公开(公告)号: US09659047B2公开(公告)日: 2017-05-23
- 发明人: Alok Sharma , Satbir Singh , Sudhanshu Gupta
- 申请人: NetApp, Inc.
- 申请人地址: US CA Sunnyvale
- 专利权人: NetApp, Inc.
- 当前专利权人: NetApp, Inc.
- 当前专利权人地址: US CA Sunnyvale
- 代理机构: Gilliam IP PLLC
- 主分类号: G06F12/00
- IPC分类号: G06F12/00 ; G06F13/00 ; G06F13/28 ; G06F17/30 ; G06F3/06
摘要:
An extent map (EMAP) database may include one or more extent map entries configured to map extent IDs to PVBNs. Each extent ID may be apportioned into a most significant bit (MSB) portion, i.e., checksum bits, and a least significant bit (LSB) portion, i.e., duplicate bits. A hash may be applied to the data of the extent to calculate the checksum bits, which illustratively represent a fingerprint of the data. The duplicate bits may be configured to denote any reoccurrence of the checksum bits in the EMAP database, i.e., whether there is an existing extent with potentially identical data in a volume of the aggregate. Each extent map entry may be inserted on a node having one or more key/value pairs, wherein the key is the extent ID and the value is the PVBN. The EMAP database may be scanned and utilized to perform data deduplication.
公开/授权文献
信息查询