Distributed File System that Provides Scalability and Resiliency

    公开(公告)号:US20250013614A1

    公开(公告)日:2025-01-09

    申请号:US18780353

    申请日:2024-07-22

    Applicant: NetApp, Inc.

    Abstract: A distributed storage management system comprising nodes that form a cluster, a distributed block layer that spans the nodes in the cluster, and file system instances deployed on the nodes. Each file system instance comprises a data management subsystem and a storage management subsystem disaggregated from the data management subsystem. The storage management subsystem comprises a node block store that forms a portion of the distributed block layer and a storage manager that manages a key-value store and virtualized storage supporting the node block store. A file system volume hosted by the data management subsystem maps to a logical block device hosted by the virtualized storage in the storage management subsystem. The key-value store includes, for a data block of the logical block device, a key that comprises a block identifier for the logical block device and a value that comprises the data block.

    OBJECT AND SEQUENCE NUMBER MANAGEMENT

    公开(公告)号:US20250005003A1

    公开(公告)日:2025-01-02

    申请号:US18828077

    申请日:2024-09-09

    Applicant: NetApp, Inc.

    Abstract: Techniques are provided for orphan object detection, invalid sequence number detection, and asynchronous object cleanup. A storage system may store data within one or more tiers of storage, such as a storage tier (e.g., solid state storage and disks maintained by the storage system), a remote object store (e.g., storage provided by a third party storage provider), and/or other storage tiers. Orphan objects, within the remote object store, that are no longer used by the storage system may be detected and/or deleted. When an aggregate of volumes is deleted, corresponding objects, within the remote object store, may be identified and/or deleted. Invalid sequence numbers (e.g., lost or corrupt sequence numbers locally maintained in a metafile) assigned to objects within the remote object store may be identified, deleted, and/or fixed.

    METADATA COMPACTION
    93.
    发明申请

    公开(公告)号:US20240419363A1

    公开(公告)日:2024-12-19

    申请号:US18815747

    申请日:2024-08-26

    Applicant: NetApp, Inc.

    Abstract: Techniques are provided for compacting indirect blocks. For example, an object is represented as a structure comprising data blocks within which data of the object is stored and indirect blocks comprising block numbers of where the data blocks are located in storage. Block numbers within a set of indirect blocks are compacted into a compacted indirect block comprising a base block number, a count of additional block numbers after the base block number in the compacted indirect block, and a pattern of the block numbers in the compacted indirect block. The compacted indirect block is stored into memory for processing access operations to the object. Storing compacted indirect blocks into memory allows for more block numbers to be stored within memory. This improves the processing of access operations because reading the block numbers from memory is faster than loading the block numbers from disk.

    Write ordering for persistent memory

    公开(公告)号:US12141481B2

    公开(公告)日:2024-11-12

    申请号:US18513894

    申请日:2023-11-20

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for implementing write ordering for persistent memory. A set of actions are identified for commitment to persistent memory of a node for executing an operation upon the persistent memory. An episode is created to comprise a first subset of actions of the set of actions that can be committed to the persistent memory in any order with respect to one another such that a consistent state of the persistent memory can be reconstructed in the event of a crash of the node during execution of the operation. The first subset of actions within the episode are committed to the persistent memory and further execution of the operation is blocked until the episode completes.

    Distributed file system with reduced write and read latencies

    公开(公告)号:US12141104B2

    公开(公告)日:2024-11-12

    申请号:US17449760

    申请日:2021-10-01

    Applicant: NetApp, Inc.

    Abstract: A method for reducing write latency in a distributed file system. A write request that includes a volume identifier is received at a data management subsystem deployed on a node within a distributed storage system. The data management subsystem maps the volume identifier to a file system volume and maps the file system volume to a set of logical block addresses in a logical block device in a storage management subsystem deployed on the node. The storage management subsystem maps the logical block device to a metadata object for the logical block device on the node that is used to process the write request. The mapping of the file system volume to the set of logical block addresses in the logical block device enables co-locating the metadata object with the file system volume on the node, which reduces the write latency associated with processing the write request.

    SLICE FILE RECOVERY USING DEAD REPLICA SLICE FILES

    公开(公告)号:US20240338128A1

    公开(公告)日:2024-10-10

    申请号:US18744814

    申请日:2024-06-17

    Applicant: NetApp, Inc.

    CPC classification number: G06F3/0619 G06F3/064 G06F3/067

    Abstract: Techniques are provided for repairing a primary slice file, affected by a storage device error, by using one or more dead replica slice files. The primary slice file is used by a node of a distributed storage architecture as an indirection layer between storage containers (e.g., a volume or LUN) and physical storage where data is physically stored. To improve resiliency of the distributed storage architecture, changes to the primary slice file are replicated to replica slice files hosted by other nodes. If a replica slice file falls out of sync with the primary slice file, then the replica slice file is considered dead (out of sync) and could potentially comprise stale data. If a storage device error affects blocks storing data of the primary slice file, then the techniques provided herein can repair the primary slice file using non-stale data from one or more dead replica slice files.

    Distributed file system that provides scalability and resiliency

    公开(公告)号:US12045207B2

    公开(公告)日:2024-07-23

    申请号:US17449758

    申请日:2021-10-01

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: A distributed storage management system comprising nodes that form a cluster, a distributed block layer that spans the nodes in the cluster, and file system instances deployed on the nodes. Each file system instance comprises a data management subsystem and a storage management subsystem disaggregated from the data management subsystem. The storage management subsystem comprises a node block store that forms a portion of the distributed block layer and a storage manager that manages a key-value store and virtualized storage supporting the node block store. A file system volume hosted by the data management subsystem maps to a logical block device hosted by the virtualized storage in the storage management subsystem. The key-value store includes, for a data block of the logical block device, a key that comprises a block identifier for the logical block device and a value that comprises the data block.

    Distributed file system that provides scalability and resiliency

    公开(公告)号:US12038886B2

    公开(公告)日:2024-07-16

    申请号:US18359192

    申请日:2023-07-26

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/188 G06F9/5077 G06F16/182

    Abstract: In various examples, data storage is managed using a distributed storage management system that is resilient. Data blocks of a logical block device may be distributed across multiple nodes in a cluster. The logical block device may correspond to a file system volume associated with a file system instance deployed on a selected node within a distributed block layer of a distributed file system. Each data block may have a location in the cluster identified by a block identifier associated with each data block. Each data block may be replicated on at least one other node in the cluster. A metadata object corresponding to a logical block device that maps to the file system volume may be replicated on at least another node in the cluster. Each data block and the metadata object may be hosted on virtualized storage that is protected using redundant array independent disks (RAID).

Patent Agency Ranking