-
公开(公告)号:US20250013544A1
公开(公告)日:2025-01-09
申请号:US18808517
申请日:2024-08-19
Applicant: NetApp, Inc.
Inventor: Daniel McCarthy , Lisa Week
IPC: G06F11/20 , G06F11/14 , H04L41/0668 , H04L67/104 , H04L67/143 , H04L67/75
Abstract: The disclosed technology enables quicker initialization of a new master node for a cluster when a previous master node fails by tracking node state in the cluster prior to being designated the new master node. In a particular example, a method includes, in a first node, designated as a current master node for the cluster, managing the cluster based on states of the nodes determined by the first node. While the first node is designated the master node, the method includes each of the nodes collecting, and storing locally, the states of the nodes. In response to a failure of the first node, the method includes selecting a second node of the nodes a new master node. Upon being designated the new master node, the method includes the second node managing the cluster of nodes based on the states of the nodes that the second node collected and stored locally.
-
公开(公告)号:US12153504B2
公开(公告)日:2024-11-26
申请号:US18089192
申请日:2022-12-27
Applicant: NetApp Inc.
Inventor: Yash Hetal Trivedi , Daniel McCarthy , Ananthan Subramanian , William Arthur Gutknecht , Kevin Daniel Varghese , Swaroop Vikram Choudhari , Abdul Basit , Christopher Clark Corey , Chad Edmund Schneider
Abstract: Techniques are provided for failing over an aggregate from one file system instance to a different file system instance of a distributed scale-out storage system. The aggregate may be stored within distributed storage that is accessible to a plurality of file system instances of the distributed scale-out storage system. When the aggregate is failed over from a first file system instance to a second file system instance, the first file system instance may still have a valid read lease that allows the first file system instance to serve client I/O, directed to the aggregate, using a cache. In order to prevent the first file system instance from serving stale data from the cache before the read lease expires, state machines and a set of control data are used to ensure that the second file system instance attaches to the aggregate only after the read lease has expired.
-
公开(公告)号:US12141104B2
公开(公告)日:2024-11-12
申请号:US17449760
申请日:2021-10-01
Applicant: NetApp, Inc.
Inventor: Ananthan Subramanian , Daniel McCarthy , Arindam Banerjee
IPC: G06F16/00 , G06F16/16 , G06F16/182 , G06F16/188 , H04L67/00
Abstract: A method for reducing write latency in a distributed file system. A write request that includes a volume identifier is received at a data management subsystem deployed on a node within a distributed storage system. The data management subsystem maps the volume identifier to a file system volume and maps the file system volume to a set of logical block addresses in a logical block device in a storage management subsystem deployed on the node. The storage management subsystem maps the logical block device to a metadata object for the logical block device on the node that is used to process the write request. The mapping of the file system volume to the set of logical block addresses in the logical block device enables co-locating the metadata object with the file system volume on the node, which reduces the write latency associated with processing the write request.
-
公开(公告)号:US12135880B1
公开(公告)日:2024-11-05
申请号:US18307097
申请日:2023-04-26
Applicant: NetApp Inc.
Inventor: Abdul Basit , Daniel McCarthy , Christopher Lee Cason , Jian Hu
IPC: G06F3/06
Abstract: Techniques are provided for dynamically implementing quality of service policies for a distributed storage system based upon resources saturation. A quality of service policy is defined for throttling I/O operations received by a node of the distributed storage system based upon whether resources of the node have become saturated. The quality of service policy is dynamically implemented based upon ever changing resource utilization and saturation. Dynamically implementing the quality of service policy improves the ability to efficiently utilize resources of the node compared to conventional static polices that cannot adequately react to such changing considerations and resource utilization/saturation. With conventional static policies, an administrator manually defines a minimum amount of guaranteed resources and/or a maximum resource usage cap that could be set to values that result in inefficient operation and resource starvation. Dynamically implementing the quality of service policy results in more efficient operation and mitigates resource starvation.
-
公开(公告)号:US20240338128A1
公开(公告)日:2024-10-10
申请号:US18744814
申请日:2024-06-17
Applicant: NetApp, Inc.
IPC: G06F3/06
CPC classification number: G06F3/0619 , G06F3/064 , G06F3/067
Abstract: Techniques are provided for repairing a primary slice file, affected by a storage device error, by using one or more dead replica slice files. The primary slice file is used by a node of a distributed storage architecture as an indirection layer between storage containers (e.g., a volume or LUN) and physical storage where data is physically stored. To improve resiliency of the distributed storage architecture, changes to the primary slice file are replicated to replica slice files hosted by other nodes. If a replica slice file falls out of sync with the primary slice file, then the replica slice file is considered dead (out of sync) and could potentially comprise stale data. If a storage device error affects blocks storing data of the primary slice file, then the techniques provided herein can repair the primary slice file using non-stale data from one or more dead replica slice files.
-
公开(公告)号:US12066907B1
公开(公告)日:2024-08-20
申请号:US18308922
申请日:2023-04-28
Applicant: NetApp, Inc.
Inventor: Daniel McCarthy , Lisa Week
IPC: G06F11/20 , G06F11/14 , H04L41/0668 , H04L67/104 , H04L67/143 , H04L67/75
CPC classification number: G06F11/203 , G06F11/1464 , G06F11/2025 , G06F11/2046 , H04L41/0668 , H04L67/1051 , H04L67/143 , H04L67/75
Abstract: The disclosed technology enables quicker initialization of a new master node for a cluster when a previous master node fails by tracking node state in the cluster prior to being designated the new master node. In a particular example, a method includes, in a first node, designated as a current master node for the cluster, managing the cluster based on states of the nodes determined by the first node. While the first node is designated the master node, the method includes each of the nodes collecting, and storing locally, the states of the nodes. In response to a failure of the first node, the method includes selecting a second node of the nodes a new master node. Upon being designated the new master node, the method includes the second node managing the cluster of nodes based on the states of the nodes that the second node collected and stored locally.
-
公开(公告)号:US12045207B2
公开(公告)日:2024-07-23
申请号:US17449758
申请日:2021-10-01
Applicant: NetApp, Inc.
Inventor: Ravikanth Dronamraju , Ananthan Subramanian , Daniel McCarthy , Christopher Cason , Arindam Banerjee
IPC: G06F16/18 , G06F9/50 , G06F16/182 , G06F16/188
CPC classification number: G06F16/188 , G06F9/5077 , G06F16/182
Abstract: A distributed storage management system comprising nodes that form a cluster, a distributed block layer that spans the nodes in the cluster, and file system instances deployed on the nodes. Each file system instance comprises a data management subsystem and a storage management subsystem disaggregated from the data management subsystem. The storage management subsystem comprises a node block store that forms a portion of the distributed block layer and a storage manager that manages a key-value store and virtualized storage supporting the node block store. A file system volume hosted by the data management subsystem maps to a logical block device hosted by the virtualized storage in the storage management subsystem. The key-value store includes, for a data block of the logical block device, a key that comprises a block identifier for the logical block device and a value that comprises the data block.
-
公开(公告)号:US12038886B2
公开(公告)日:2024-07-16
申请号:US18359192
申请日:2023-07-26
Applicant: NetApp, Inc.
Inventor: Ravikanth Dronamraju , Ananthan Subramanian , Daniel McCarthy , Christopher Cason , Arindam Banerjee
IPC: G06F16/18 , G06F9/50 , G06F16/182 , G06F16/188
CPC classification number: G06F16/188 , G06F9/5077 , G06F16/182
Abstract: In various examples, data storage is managed using a distributed storage management system that is resilient. Data blocks of a logical block device may be distributed across multiple nodes in a cluster. The logical block device may correspond to a file system volume associated with a file system instance deployed on a selected node within a distributed block layer of a distributed file system. Each data block may have a location in the cluster identified by a block identifier associated with each data block. Each data block may be replicated on at least one other node in the cluster. A metadata object corresponding to a logical block device that maps to the file system volume may be replicated on at least another node in the cluster. Each data block and the metadata object may be hosted on virtualized storage that is protected using redundant array independent disks (RAID).
-
公开(公告)号:US20240118981A1
公开(公告)日:2024-04-11
申请号:US18089192
申请日:2022-12-27
Applicant: NetApp Inc.
Inventor: Yash Hetal Trivedi , Daniel McCarthy , Ananthan Subramanian , William Arthur Gutknecht , Kevin Daniel Varghese , Swaroop Vikram Choudhari , Abdul Basit , Christopher Clark Corey , Chad Edmund Schneider
CPC classification number: G06F11/2025 , G06F11/1662
Abstract: Techniques are provided for failing over an aggregate from one file system instance to a different file system instance of a distributed scale-out storage system. The aggregate may be stored within distributed storage that is accessible to a plurality of file system instances of the distributed scale-out storage system. When the aggregate is failed over from a first file system instance to a second file system instance, the first file system instance may still have a valid read lease that allows the first file system instance to serve client I/O, directed to the aggregate, using a cache. In order to prevent the first file system instance from serving stale data from the cache before the read lease expires, state machines and a set of control data are used to ensure that the second file system instance attaches to the aggregate only after the read lease has expired.
-
公开(公告)号:US20220278943A1
公开(公告)日:2022-09-01
申请号:US17187336
申请日:2021-02-26
Applicant: NetApp, Inc.
Inventor: Austino Longo , Christopher Cason , Daniel McCarthy
IPC: H04L12/927
Abstract: Systems and methods for quality of service management are provided. According to one embodiment, a non-transitory computer-readable medium comprises instructions that when executed by the processing resource cause the processing resource to receive, in a normalizing agent, one or more compute load parameters from one or more background compute processes executing on the one or more computer systems and one or more Quality of Service (QoS) parameters for one or more client compute processes executing on the one or more computer systems, convert the one or more compute load parameters to one or more normalized utilization metrics, and execute instructions that cause a processor to adjust a compute resource allocation dedicated to the one or more background compute processes based at least in part on the one or more normalized utilization metrics and the one or more QoS parameters
-
-
-
-
-
-
-
-
-