Collection Of State Information By Nodes In A Cluster To Handle Cluster Management After Master-Node Failover

    公开(公告)号:US20250013544A1

    公开(公告)日:2025-01-09

    申请号:US18808517

    申请日:2024-08-19

    Applicant: NetApp, Inc.

    Abstract: The disclosed technology enables quicker initialization of a new master node for a cluster when a previous master node fails by tracking node state in the cluster prior to being designated the new master node. In a particular example, a method includes, in a first node, designated as a current master node for the cluster, managing the cluster based on states of the nodes determined by the first node. While the first node is designated the master node, the method includes each of the nodes collecting, and storing locally, the states of the nodes. In response to a failure of the first node, the method includes selecting a second node of the nodes a new master node. Upon being designated the new master node, the method includes the second node managing the cluster of nodes based on the states of the nodes that the second node collected and stored locally.

    Distributed file system with reduced write and read latencies

    公开(公告)号:US12141104B2

    公开(公告)日:2024-11-12

    申请号:US17449760

    申请日:2021-10-01

    Applicant: NetApp, Inc.

    Abstract: A method for reducing write latency in a distributed file system. A write request that includes a volume identifier is received at a data management subsystem deployed on a node within a distributed storage system. The data management subsystem maps the volume identifier to a file system volume and maps the file system volume to a set of logical block addresses in a logical block device in a storage management subsystem deployed on the node. The storage management subsystem maps the logical block device to a metadata object for the logical block device on the node that is used to process the write request. The mapping of the file system volume to the set of logical block addresses in the logical block device enables co-locating the metadata object with the file system volume on the node, which reduces the write latency associated with processing the write request.

    Dynamic quality of service implementation based upon resource saturation

    公开(公告)号:US12135880B1

    公开(公告)日:2024-11-05

    申请号:US18307097

    申请日:2023-04-26

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for dynamically implementing quality of service policies for a distributed storage system based upon resources saturation. A quality of service policy is defined for throttling I/O operations received by a node of the distributed storage system based upon whether resources of the node have become saturated. The quality of service policy is dynamically implemented based upon ever changing resource utilization and saturation. Dynamically implementing the quality of service policy improves the ability to efficiently utilize resources of the node compared to conventional static polices that cannot adequately react to such changing considerations and resource utilization/saturation. With conventional static policies, an administrator manually defines a minimum amount of guaranteed resources and/or a maximum resource usage cap that could be set to values that result in inefficient operation and resource starvation. Dynamically implementing the quality of service policy results in more efficient operation and mitigates resource starvation.

    SLICE FILE RECOVERY USING DEAD REPLICA SLICE FILES

    公开(公告)号:US20240338128A1

    公开(公告)日:2024-10-10

    申请号:US18744814

    申请日:2024-06-17

    Applicant: NetApp, Inc.

    CPC classification number: G06F3/0619 G06F3/064 G06F3/067

    Abstract: Techniques are provided for repairing a primary slice file, affected by a storage device error, by using one or more dead replica slice files. The primary slice file is used by a node of a distributed storage architecture as an indirection layer between storage containers (e.g., a volume or LUN) and physical storage where data is physically stored. To improve resiliency of the distributed storage architecture, changes to the primary slice file are replicated to replica slice files hosted by other nodes. If a replica slice file falls out of sync with the primary slice file, then the replica slice file is considered dead (out of sync) and could potentially comprise stale data. If a storage device error affects blocks storing data of the primary slice file, then the techniques provided herein can repair the primary slice file using non-stale data from one or more dead replica slice files.

    Distributed file system that provides scalability and resiliency

    公开(公告)号:US12045207B2

    公开(公告)日:2024-07-23

    申请号:US17449758

    申请日:2021-10-01

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: A distributed storage management system comprising nodes that form a cluster, a distributed block layer that spans the nodes in the cluster, and file system instances deployed on the nodes. Each file system instance comprises a data management subsystem and a storage management subsystem disaggregated from the data management subsystem. The storage management subsystem comprises a node block store that forms a portion of the distributed block layer and a storage manager that manages a key-value store and virtualized storage supporting the node block store. A file system volume hosted by the data management subsystem maps to a logical block device hosted by the virtualized storage in the storage management subsystem. The key-value store includes, for a data block of the logical block device, a key that comprises a block identifier for the logical block device and a value that comprises the data block.

    Distributed file system that provides scalability and resiliency

    公开(公告)号:US12038886B2

    公开(公告)日:2024-07-16

    申请号:US18359192

    申请日:2023-07-26

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: In various examples, data storage is managed using a distributed storage management system that is resilient. Data blocks of a logical block device may be distributed across multiple nodes in a cluster. The logical block device may correspond to a file system volume associated with a file system instance deployed on a selected node within a distributed block layer of a distributed file system. Each data block may have a location in the cluster identified by a block identifier associated with each data block. Each data block may be replicated on at least one other node in the cluster. A metadata object corresponding to a logical block device that maps to the file system volume may be replicated on at least another node in the cluster. Each data block and the metadata object may be hosted on virtualized storage that is protected using redundant array independent disks (RAID).

    CENTRALIZED QUALITY OF SERVICE MANAGEMENT

    公开(公告)号:US20220278943A1

    公开(公告)日:2022-09-01

    申请号:US17187336

    申请日:2021-02-26

    Applicant: NetApp, Inc.

    Abstract: Systems and methods for quality of service management are provided. According to one embodiment, a non-transitory computer-readable medium comprises instructions that when executed by the processing resource cause the processing resource to receive, in a normalizing agent, one or more compute load parameters from one or more background compute processes executing on the one or more computer systems and one or more Quality of Service (QoS) parameters for one or more client compute processes executing on the one or more computer systems, convert the one or more compute load parameters to one or more normalized utilization metrics, and execute instructions that cause a processor to adjust a compute resource allocation dedicated to the one or more background compute processes based at least in part on the one or more normalized utilization metrics and the one or more QoS parameters

Patent Agency Ranking