-
公开(公告)号:US20250094295A1
公开(公告)日:2025-03-20
申请号:US18962013
申请日:2024-11-27
Applicant: NetApp, Inc.
Inventor: Wei Sun , Anil Paul Thoppil , Anne Maria Vasu
Abstract: Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. According to one embodiment, an instance of a key-value (KV) store of a first node of a plurality of nodes of a cluster of a distributed storage system manages storage of data blocks as values and corresponding block identifiers (IDs) as keys. A list of missing block IDs that are in use for one or more volumes associated with the first node but that are missing from the instance of the KV store are identified by performing a data integrity check on the instance of the KV store. After identifying the list of missing block IDs, instead of treating the first node as failed, restoring the missing block IDs by writing redundant data blocks retrieved from other nodes within the cluster to the first node.
-
公开(公告)号:US20250044964A1
公开(公告)日:2025-02-06
申请号:US18921156
申请日:2024-10-21
Applicant: NetApp, Inc.
Inventor: Ganga Bhavani Kondapalli , Kevin Daniel Varghese , Ananthan Subramanian , Cheryl Marie Thompson , Anil Paul Thoppil
IPC: G06F3/06
Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.
-
公开(公告)号:US12204800B2
公开(公告)日:2025-01-21
申请号:US17732065
申请日:2022-04-28
Applicant: NetApp Inc.
Inventor: Anil Paul Thoppil , Wei Sun , Meera Odugoudar , Szu-Wen Kuo , Santhosh Selvaraj
Abstract: Techniques are provided for implementing a garbage collection process and a prediction read ahead mechanism to prefetch keys into memory to improve the efficiency and speed of the garbage collection process. A log structured merge tree is used to store keys of key-value pairs within a key-value store. If a key is no longer referenced by any worker nodes of a distributed storage architecture, then the key can be freed to store other data. Accordingly, garbage collection is performed to identify and free unused keys. The speed and efficiency of garbage collection is improved by dynamically adjusting the amount and rate at which keys are prefetched from disk and cached into faster memory for processing by the garbage collection process.
-
公开(公告)号:US12164397B2
公开(公告)日:2024-12-10
申请号:US18478149
申请日:2023-09-29
Applicant: NetApp, Inc.
Inventor: Wei Sun , Anil Paul Thoppil , Anne Maria Vasu
Abstract: Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. According to one embodiment, a first node of multiple nodes of distributed storage system represented in a form of a cluster of the multiple of nodes, identifies the potential existence of an error associated with a Redundant Array of Independent Disks (RAID) stripe. A list of block identifiers (IDs) associated with the RAID stripe may then be identified. Rather than performing a traditional RAID recovery/reconstruction approach that is resource intensive in nature and that requires an excessive amount of rebuild time, a more efficient RAID stripe resynchronization process may be performed to restore data associated with the RAID stripe.
-
公开(公告)号:US20240281411A1
公开(公告)日:2024-08-22
申请号:US18648989
申请日:2024-04-29
Applicant: NetApp, Inc.
Inventor: Anil Paul Thoppil , Wei Sun , Meera Odugoudar , Szu-Wen Kuo , Santhosh Selvaraj
IPC: G06F16/174 , G06F16/182
CPC classification number: G06F16/1748 , G06F16/182
Abstract: Techniques are provided for implementing a defragmentation process during a merge operation performed by a re-compaction process upon a log structured merge tree. The log structured merge tree is used to store keys of key-value pairs within a key-value store. As the log structured merge tree fills with keys over time, the re-compaction process is performed to merge keys down to lower levels of the log structured merge tree to re-compact the keys. Re-compaction can result in fragmentation because there is a lack of spatial locality of where the re-compaction operations re-writes the keys within storage. Fragmentation increases read and write amplification when accessing the keys stored in different locations within the storage. Accordingly, the defragmentation process is performed during a last merge operation of the re-compaction process in order to store keys together within the storage, thus reducing read and write amplification when accessing the keys.
-
6.
公开(公告)号:US20240220377A1
公开(公告)日:2024-07-04
申请号:US18608742
申请日:2024-03-18
Applicant: NetApp, Inc.
Inventor: Wei Sun , Anil Paul Thoppil , Anne Maria Vasu
CPC classification number: G06F11/1662 , G06F3/0622 , G06F3/064 , G06F3/0679 , G06F11/1088 , G06F11/3034 , G06F16/27
Abstract: Systems and methods that make use of cluster-level redundancy within a distributed storage management system to address various node-level error scenarios are provided. Rather than using a generalized one-size-fits-all approach to reduce complexity, an approach tailored to the node-level error scenario at issue may be performed to avoid doing more than necessary. According to one embodiment, after identifying a missing branch of a tree implemented by a KV store of a first node of a cluster of a distributed storage management system, a branch resynchronization process may be performed, including, for each block ID in the range of block IDs of the missing branch (i) reading a data block corresponding to the block ID from a second node of the cluster that maintains redundant information relating to the block ID; and (ii) restoring the block ID within the KV store by writing the data block to the first node.
-
公开(公告)号:US20230052732A1
公开(公告)日:2023-02-16
申请号:US17976946
申请日:2022-10-31
Applicant: NetApp Inc.
Inventor: Ananthan Subramanian , Sridevi Jantli , Anil Paul Thoppil , Cheryl Marie Thompson , Qinghua Zheng
IPC: G06F16/215 , G06F16/22 , G06F16/23
Abstract: Techniques are provided for orphan object detection, invalid sequence number detection, and asynchronous object cleanup. A storage system may store data within one or more tiers of storage, such as a storage tier (e.g., solid state storage and disks maintained by the storage system), a remote object store (e.g., storage provided by a third party storage provider), and/or other storage tiers. Orphan objects, within the remote object store, that are no longer used by the storage system may be detected and/or deleted. When an aggregate of volumes is deleted, corresponding objects, within the remote object store, may be identified and/or deleted. Invalid sequence numbers (e.g., lost or corrupt sequence numbers locally maintained in a metafile) assigned to objects within the remote object store may be identified, deleted, and/or fixed.
-
公开(公告)号:US20220300178A1
公开(公告)日:2022-09-22
申请号:US17833046
申请日:2022-06-06
Applicant: NetApp Inc.
Inventor: Ganga Bhavani Kondapalli , Kevin Daniel Varghese , Ananthan Subramanian , Cheryl Marie Thompson , Anil Paul Thoppil
IPC: G06F3/06
Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.
-
公开(公告)号:US20200159705A1
公开(公告)日:2020-05-21
申请号:US16751766
申请日:2020-01-24
Applicant: NetApp Inc.
Inventor: Ananthan Subramanian , Sridevi Jantli , Anil Paul Thoppil , Cheryl Marie Thompson , Qinghua Zheng
IPC: G06F16/215 , G06F16/23 , G06F16/22
Abstract: Techniques are provided for orphan object detection, invalid sequence number detection, and asynchronous object cleanup. A storage system may store data within one or more tiers of storage, such as a storage tier (e.g., solid state storage and disks maintained by the storage system), a remote object store (e.g., storage provided by a third party storage provider), and/or other storage tiers. Orphan objects, within the remote object store, that are no longer used by the storage system may be detected and/or deleted. When an aggregate of volumes is deleted, corresponding objects, within the remote object store, may be identified and/or deleted. Invalid sequence numbers (e.g., lost or corrupt sequence numbers locally maintained in a metafile) assigned to objects within the remote object store may be identified, deleted, and/or fixed.
-
公开(公告)号:US12124716B2
公开(公告)日:2024-10-22
申请号:US18357206
申请日:2023-07-24
Applicant: NetApp Inc.
Inventor: Ganga Bhavani Kondapalli , Kevin Daniel Varghese , Ananthan Subramanian , Cheryl Marie Thompson , Anil Paul Thoppil
IPC: G06F3/06
CPC classification number: G06F3/0631 , G06F3/061 , G06F3/0665 , G06F3/067
Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.
-
-
-
-
-
-
-
-
-