-
公开(公告)号:US12210431B2
公开(公告)日:2025-01-28
申请号:US17731900
申请日:2022-04-28
Applicant: NetApp Inc.
Inventor: Palak Sharma , Dibyasri Nandi , Sindhushree K N , Cheryl Marie Thompson , Qinghua Zheng , Venkateswarlu Tella , Debanjan Paul , Dinakaran Narayanan
Abstract: Techniques are provided for implementing a unified object format. The unified object format is used to format data in a performance tier (e.g., infrequently accessed data, snapshot data, etc.) into objects that are stored into an object store for low cost, scalable, long term storage compared to storage of the performance tier. With the unified object format, compression of the data may be retained when the data is stored as the objects into the object store. Additional compression may also be provided for the data in the objects. The unified object format includes slot header metadata used to track the location of the data within the object notwithstanding the data being compressed and/or stored at non-fixed boundaries. The slot header metadata may be cached at the performance tier for improved read performance and may be repaired by a repair subsystem (a slot header repair subsystem).
-
公开(公告)号:US11861169B2
公开(公告)日:2024-01-02
申请号:US17028020
申请日:2020-09-22
Applicant: NetApp Inc.
Inventor: Girish Hebbale Venkatasubbaiah , Rahul Thapliyal , Dnyaneshwar Nagorao Pawar , Kartik Rathnakar , Venkateswarlu Tella , Ananthan Subramanian
IPC: G06F3/06 , G06F16/215 , G06F16/25 , H03M7/30 , G06F16/28
CPC classification number: G06F3/0608 , G06F3/067 , G06F3/0641 , G06F16/215 , G06F16/258 , G06F16/285 , H03M7/3091
Abstract: Techniques are provided for a layout format for compressed data. A first set of data blocks are grouped into a first group based upon a first frequency of access to the first set of data blocks. A second set of data blocks are grouped into a second group based upon a second frequency of access to the second set of data blocks. The first set of data blocks are compressed into a first compression group using a first compression algorithm. The second set of data blocks are compressed into a second compression group using a second compression algorithm.
-
公开(公告)号:US20210342082A1
公开(公告)日:2021-11-04
申请号:US17373820
申请日:2021-07-13
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
IPC: G06F3/06
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
4.
公开(公告)号:US20240362124A1
公开(公告)日:2024-10-31
申请号:US18308337
申请日:2023-04-27
Applicant: NetApp Inc.
Inventor: Palak Sharma , Cheryl Marie Thompson , Qinghua Zheng , Venkateswarlu Tella
IPC: G06F11/14 , G06F16/215 , G06F16/25
CPC classification number: G06F11/1464 , G06F16/1744 , G06F16/184 , G06F16/215 , G06F16/258 , G06F2201/84
Abstract: Techniques are provided for mirroring objects between object stores hosted by cloud providers that have different data layout requirements. An object may be stored within a first object store that supports a fix offset format where uncompressed data is stored according to fixed offsets and boundaries within fixed size objects. A mirroring operation may be used to mirror the object to a second object store that supports a unified object format where compressed data can be stored at non-fixed offsets and boundaries within variable sized objects. The mirroring operation selects a compression algorithm and compresses the object on the fly to create a mirrored object having the unified object format. The mirrored object, populated with the compressed data and slot header metadata comprising compression information for how to locate and decompress the data in the mirrored object, is stored into the second object store.
-
公开(公告)号:US11068182B2
公开(公告)日:2021-07-20
申请号:US16683466
申请日:2019-11-14
Applicant: NetApp inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US20200159432A1
公开(公告)日:2020-05-21
申请号:US16774127
申请日:2020-01-28
Applicant: NetApp Inc.
Inventor: Mukul Sharma , Kartik Rathnakar , Dnyaneshwar Nagorao Pawar , Venkateswarlu Tella , Kiran Nenmeli Srinivasan , Rajesh Khandelwal , Alok Sharma
IPC: G06F3/06
Abstract: One or more techniques and/or computing devices are provided for inline deduplication. For example, a checksum hash table and/or a block number hash table may be maintained within memory (e.g., a storage controller may maintain the hash tables in-core). The checksum hash table may be utilized for inline deduplication to identify potential donor blocks that may comprise the same data as an incoming storage operation. Data within an in-core buffer cache is eligible as potential donor blocks so that inline deduplication may be performed using data from the in-core buffer cache, which may mitigate disk access to underlying storage for which the in-core buffer cache is used for caching. The block number hash table may be used for updating or removing entries from the hash tables, such as for blocks that are no longer eligible as potential donor blocks (e.g., deleted blocks, blocks evicted from the in-core buffer cache, etc.).
-
公开(公告)号:US20240362183A1
公开(公告)日:2024-10-31
申请号:US18308313
申请日:2023-04-27
Applicant: NetApp Inc.
Inventor: Palak Sharma , Cheryl Marie Thompson , Qinghua Zheng , Venkateswarlu Tella
IPC: G06F16/11 , G06F16/174
CPC classification number: G06F16/125 , G06F16/1744
Abstract: Techniques are provided for mirroring objects between object stores hosted by cloud providers that could have different data layout requirements. An object may be stored within an object store that supports a unified object format where the object is capable of storing compressed data. The object may be mirrored to a destination object store that may also support the unified object format or to a destination object store that does not support the unified object format. If the destination object store does not support the unified object format, then slot header metadata within the object is used to decompress the data within the object into an uncompressed format. The data is packaged from being in the uncompressed format into a fixed offset format supported by the destination object store to create a mirrored object that is stored into the destination object store while retaining compression of the data.
-
公开(公告)号:US11620064B2
公开(公告)日:2023-04-04
申请号:US17373820
申请日:2021-07-13
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US20180173449A1
公开(公告)日:2018-06-21
申请号:US15386544
申请日:2016-12-21
Applicant: NetApp Inc.
Inventor: Alok Sharma , Girish Hebbale Venkata Subbaiah , Kartik Rathnakar , Venkateswarlu Tella , Mukul Sharma
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/0608 , G06F3/0683
Abstract: Techniques are provided for asynchronous semi-inline deduplication. A multi-tiered storage arrangement comprises a first storage tier, a second storage tier, etc. An in-memory change log of data recently written to the first storage tier is evaluate to identify a fingerprint of a data block recently written to the first storage tier. A donor data store, comprising fingerprints of data blocks already stored within the first storage tier, is queried using the fingerprint. If the fingerprint is found, then deduplication is performed for the data block to create deduplicated data based upon a potential donor data block within the first storage tier. The deduplicated data is moved from the first storage tier to the second storage tier, such as in response to a determination that the deduplicated data has not been recently accessed. The deduplication is performed before cold data is moved from first storage tier to second storage tier.
-
公开(公告)号:US11960448B2
公开(公告)日:2024-04-16
申请号:US17731858
申请日:2022-04-28
Applicant: NetApp Inc.
Inventor: Palak Sharma , Dibyasri Nandi , Sindhushree K N , Cheryl Marie Thompson , Qinghua Zheng , Venkateswarlu Tella , Debanjan Paul , Dinakaran Narayanan
CPC classification number: G06F16/1744 , G06F3/0604 , G06F3/0611 , G06F3/0619 , G06F3/064 , G06F3/0653 , G06F3/067 , G06F11/0712 , G06F11/0718 , G06F11/0727 , G06F11/1453 , G06F11/1464 , G06F11/2094 , G06F16/116 , G06F11/1004 , G06F2201/84
Abstract: Techniques are provided for implementing a unified object format. The unified object format is used to format data in a performance tier (e.g., infrequently accessed data, snapshot data, etc.) into objects that are stored into an object store for low cost, scalable, long term storage compared to storage of the performance tier. With the unified object format, compression of the data may be retained when the data is stored as the objects into the object store. Additional compression may also be provided for the data in the objects. The unified object format includes slot header metadata used to track the location of the data within the object notwithstanding the data being compressed and/or stored at non-fixed boundaries. The slot header metadata may be cached at the performance tier for improved read performance and may be repaired by a repair subsystem (a slot header repair subsystem).
-
-
-
-
-
-
-
-
-