Zero-copy volume move within a distributed storage system

    公开(公告)号:US12204784B1

    公开(公告)日:2025-01-21

    申请号:US18649906

    申请日:2024-04-29

    Applicant: NetApp, Inc.

    Abstract: Systems and methods for performing a zero-copy volume move between nodes of a distributed storage system are provided. In one example, an approach for performing a zero-copy volume move is proposed in which volume data may be maintained in place within a storage pod and need not be copied to move a given volume between the source node and the destination node. In one embodiment, metadata (e.g., a top-most physical volume block number (PVBN) of a node tree representing the volume at issue) of a write-anywhere file system is copied from the source node to the destination node. Since the storage pod is associated with a global PVBN space that is visible and accessible to all nodes of the distributed storage system, as a result of copying the top-mode PVBN of the volume to the destination node, anything below the top-most PVBN will automatically be visible to the destination node.

    Distributed File System that Provides Scalability and Resiliency

    公开(公告)号:US20240370410A1

    公开(公告)日:2024-11-07

    申请号:US18773483

    申请日:2024-07-15

    Applicant: NetApp, Inc.

    Abstract: In various examples, data storage is managed using a distributed storage management system that is resilient. Data blocks of a logical block device may be distributed across multiple nodes in a cluster. The logical block device may correspond to a file system volume associated with a file system instance deployed on a selected node within a distributed block layer of a distributed file system. Each data block may have a location in the cluster identified by a block identifier associated with each data block. Each data block may be replicated on at least one other node in the cluster. A metadata object corresponding to a logical block device that maps to the file system volume may be replicated on at least another node in the cluster. Each data block and the metadata object may be hosted on virtualized storage that is protected using redundant array independent disks (RAID).

    JOURNAL REPLAY OPTIMIZATION
    4.
    发明公开

    公开(公告)号:US20240143210A1

    公开(公告)日:2024-05-02

    申请号:US18399555

    申请日:2023-12-28

    Applicant: NetApp Inc.

    CPC classification number: G06F3/064 G06F3/0619 G06F3/0656 G06F3/0659 G06F3/067

    Abstract: Techniques are provided for journal replay optimization. A distributed storage architecture can implement a journal within memory for logging write operations into log records. Latency of executing the write operations is improved because the write operations can be responded back to clients as complete once logged within the journal without having to store the data to higher latency disk storage. If there is a failure, then a replay process is performed to replay the write operations logged within the journal in order to bring a file system up-to-date. The time to complete the replay of the write operations is significantly reduced by caching metadata (e.g., indirect blocks, checksums, buftree identifiers, file block numbers, and consistency point counts) directly into log records. Replay can quickly access this metadata for replaying the write operations because the metadata does not need to be retrieved from the higher latency disk storage into memory.

    MULTI-TIER WRITE ALLOCATION
    5.
    发明公开

    公开(公告)号:US20240012571A1

    公开(公告)日:2024-01-11

    申请号:US18357206

    申请日:2023-07-24

    Applicant: NetApp Inc.

    CPC classification number: G06F11/1076 G06F11/1004 G06F11/1435 G06F21/602

    Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.

    Block allocation for persistent memory during aggregate transition

    公开(公告)号:US11836363B2

    公开(公告)日:2023-12-05

    申请号:US17750793

    申请日:2022-05-23

    Applicant: NetApp Inc.

    CPC classification number: G06F3/0631 G06F3/0604 G06F3/064 G06F3/065 G06F3/0679

    Abstract: Techniques are provided for block allocation for persistent memory during aggregate transition. In a high availability pair including first and second nodes, the first node makes a determination that control of a first aggregate is to transition from the first node to the second node. A portion of available free storage space is allocated from a first persistent memory of the first node as allocated pages within the first persistent memory. Metadata information for the allocated pages is updated with an identifier of the first aggregate to create updated metadata information reserving the allocated pages for the first aggregate. The updated metadata information is mirrored to the second node, so that the second node also reserves those pages. Control of the first aggregate is transitioned to the second node. As a result, the nodes do not attempt allocating the same free pages to different aggregates during a transition.

    JOURNAL REPLAY OPTIMIZATION
    9.
    发明公开

    公开(公告)号:US20230342053A1

    公开(公告)日:2023-10-26

    申请号:US17728441

    申请日:2022-04-25

    Applicant: NetApp Inc.

    CPC classification number: G06F3/064 G06F3/0659 G06F3/0656 G06F3/0619 G06F3/067

    Abstract: Techniques are provided for journal replay optimization. A distributed storage architecture can implement a journal within memory for logging write operations into log records. Latency of executing the write operations is improved because the write operations can be responded back to clients as complete once logged within the journal without having to store the data to higher latency disk storage. If there is a failure, then a replay process is performed to replay the write operations logged within the journal in order to bring a file system up-to-date. The time to complete the replay of the write operations is significantly reduced by caching metadata (e.g., indirect blocks, checksums, buftree identifiers, file block numbers, and consistency point counts) directly into log records. Replay can quickly access this metadata for replaying the write operations because the metadata does not need to be retrieved from the higher latency disk storage into memory.

    PERSISTENT MEMORY FILE SYSTEM RECONCILIATION
    10.
    发明公开

    公开(公告)号:US20230251932A1

    公开(公告)日:2023-08-10

    申请号:US18302212

    申请日:2023-04-18

    Applicant: NetApp Inc.

    CPC classification number: G06F11/1435 G06F13/1668 G06F16/183

    Abstract: Techniques are provided for persistent memory file system reconciliation. As part of the persistent memory file system reconciliation, high level file system metadata associated with a persistent memory file system of persistent memory is reconciled. Client access to the persistent memory file system is inaccessible until reconciliation of the high level file system metadata has completed. A first scanner is executed to traverse pages of the persistent memory in order to fix local inconsistencies associated with the pages. A local inconsistency of a first set of metadata or data of a page is fixed using a second set of metadata or data of the page. The first scanner is executed asynchronously in parallel with processing client I/O directed to the persistent memory file system.

Patent Agency Ranking