-
公开(公告)号:US11068182B2
公开(公告)日:2021-07-20
申请号:US16683466
申请日:2019-11-14
Applicant: NetApp inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US20200159432A1
公开(公告)日:2020-05-21
申请号:US16774127
申请日:2020-01-28
Applicant: NetApp Inc.
Inventor: Mukul Sharma , Kartik Rathnakar , Dnyaneshwar Nagorao Pawar , Venkateswarlu Tella , Kiran Nenmeli Srinivasan , Rajesh Khandelwal , Alok Sharma
IPC: G06F3/06
Abstract: One or more techniques and/or computing devices are provided for inline deduplication. For example, a checksum hash table and/or a block number hash table may be maintained within memory (e.g., a storage controller may maintain the hash tables in-core). The checksum hash table may be utilized for inline deduplication to identify potential donor blocks that may comprise the same data as an incoming storage operation. Data within an in-core buffer cache is eligible as potential donor blocks so that inline deduplication may be performed using data from the in-core buffer cache, which may mitigate disk access to underlying storage for which the in-core buffer cache is used for caching. The block number hash table may be used for updating or removing entries from the hash tables, such as for blocks that are no longer eligible as potential donor blocks (e.g., deleted blocks, blocks evicted from the in-core buffer cache, etc.).
-
公开(公告)号:US20210405882A1
公开(公告)日:2021-12-30
申请号:US17028020
申请日:2020-09-22
Applicant: NetApp Inc.
Inventor: Girish Hebbale Venkatasubbaiah , Rahul Thapliyal , Dnyaneshwar Nagorao Pawar , Kartik Rathnakar , Venkateswarlu Tella , Ananthan Subramanian
IPC: G06F3/06 , G06F16/215 , G06F16/25 , G06F16/28 , H03M7/30
Abstract: Techniques are provided for a layout format for compressed data. A first set of data blocks are grouped into a first group based upon a first frequency of access to the first set of data blocks. A second set of data blocks are grouped into a second group based upon a second frequency of access to the second set of data blocks. The first set of data blocks are compressed into a first compression group using a first compression algorithm. The second set of data blocks are compressed into a second compression group using a second compression algorithm.
-
公开(公告)号:US20180181339A1
公开(公告)日:2018-06-28
申请号:US15903328
申请日:2018-02-23
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/0608 , G06F3/067 , G06F3/0683 , G06F3/0685
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US11861169B2
公开(公告)日:2024-01-02
申请号:US17028020
申请日:2020-09-22
Applicant: NetApp Inc.
Inventor: Girish Hebbale Venkatasubbaiah , Rahul Thapliyal , Dnyaneshwar Nagorao Pawar , Kartik Rathnakar , Venkateswarlu Tella , Ananthan Subramanian
IPC: G06F3/06 , G06F16/215 , G06F16/25 , H03M7/30 , G06F16/28
CPC classification number: G06F3/0608 , G06F3/067 , G06F3/0641 , G06F16/215 , G06F16/258 , G06F16/285 , H03M7/3091
Abstract: Techniques are provided for a layout format for compressed data. A first set of data blocks are grouped into a first group based upon a first frequency of access to the first set of data blocks. A second set of data blocks are grouped into a second group based upon a second frequency of access to the second set of data blocks. The first set of data blocks are compressed into a first compression group using a first compression algorithm. The second set of data blocks are compressed into a second compression group using a second compression algorithm.
-
公开(公告)号:US20210342082A1
公开(公告)日:2021-11-04
申请号:US17373820
申请日:2021-07-13
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
IPC: G06F3/06
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US11010078B2
公开(公告)日:2021-05-18
申请号:US16774127
申请日:2020-01-28
Applicant: NetApp Inc.
Inventor: Mukul Sharma , Kartik Rathnakar , Dnyaneshwar Nagorao Pawar , Venkateswarlu Telia , Kiran Nenmeli Srinivasan , Rajesh Khandelwal , Alok Sharma
IPC: G06F3/06
Abstract: One or more techniques and/or computing devices are provided for inline deduplication. For example, a checksum hash table and/or a block number hash table may be maintained within memory (e.g., a storage controller may maintain the hash tables in-core). The checksum hash table may be utilized for inline deduplication to identify potential donor blocks that may comprise the same data as an incoming storage operation. Data within an in-core buffer cache is eligible as potential donor blocks so that inline deduplication may be performed using data from the in-core buffer cache, which may mitigate disk access to underlying storage for which the in-core buffer cache is used for caching. The block number hash table may be used for updating or removing entries from the hash tables, such as for blocks that are no longer eligible as potential donor blocks (e.g., deleted blocks, blocks evicted from the in-core buffer cache, etc.).
-
公开(公告)号:US11620064B2
公开(公告)日:2023-04-04
申请号:US17373820
申请日:2021-07-13
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US11055005B2
公开(公告)日:2021-07-06
申请号:US16254741
申请日:2019-01-23
Applicant: NetApp Inc.
Inventor: Dnyaneshwar Nagorao Pawar , Kartik Rathnakar
IPC: G06F3/06
Abstract: Techniques are provided for background deduplication using trusted fingerprints. Trusted fingerprints of blocks are inserted into a trusted fingerprint store as the blocks are being allocated by a file system sequentially according to block numbers of the blocks. In this way, the trusted fingerprint store is indexed by block numbers of where the blocks are stored. Blocks that are to be deduplicated are identifying by sorting the blocks based upon weak fingerprints, and moving duplicates to a dup file. The dup file is sorted based upon block numbers. Trusted fingerprints are loaded from the trusted fingerprint store for deduplicating the blocks within the dup file.
-
公开(公告)号:US20180173449A1
公开(公告)日:2018-06-21
申请号:US15386544
申请日:2016-12-21
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/0608 , G06F3/0683
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
-
-
-
-
-
-
-
-