Zero-copy volume move within a distributed storage system

    公开(公告)号:US12204784B1

    公开(公告)日:2025-01-21

    申请号:US18649906

    申请日:2024-04-29

    Applicant: NetApp, Inc.

    Abstract: Systems and methods for performing a zero-copy volume move between nodes of a distributed storage system are provided. In one example, an approach for performing a zero-copy volume move is proposed in which volume data may be maintained in place within a storage pod and need not be copied to move a given volume between the source node and the destination node. In one embodiment, metadata (e.g., a top-most physical volume block number (PVBN) of a node tree representing the volume at issue) of a write-anywhere file system is copied from the source node to the destination node. Since the storage pod is associated with a global PVBN space that is visible and accessible to all nodes of the distributed storage system, as a result of copying the top-mode PVBN of the volume to the destination node, anything below the top-most PVBN will automatically be visible to the destination node.

    JOURNAL REPLAY OPTIMIZATION
    32.
    发明公开

    公开(公告)号:US20240143210A1

    公开(公告)日:2024-05-02

    申请号:US18399555

    申请日:2023-12-28

    Applicant: NetApp Inc.

    CPC classification number: G06F3/064 G06F3/0619 G06F3/0656 G06F3/0659 G06F3/067

    Abstract: Techniques are provided for journal replay optimization. A distributed storage architecture can implement a journal within memory for logging write operations into log records. Latency of executing the write operations is improved because the write operations can be responded back to clients as complete once logged within the journal without having to store the data to higher latency disk storage. If there is a failure, then a replay process is performed to replay the write operations logged within the journal in order to bring a file system up-to-date. The time to complete the replay of the write operations is significantly reduced by caching metadata (e.g., indirect blocks, checksums, buftree identifiers, file block numbers, and consistency point counts) directly into log records. Replay can quickly access this metadata for replaying the write operations because the metadata does not need to be retrieved from the higher latency disk storage into memory.

    MULTI-TIER WRITE ALLOCATION
    33.
    发明公开

    公开(公告)号:US20240012571A1

    公开(公告)日:2024-01-11

    申请号:US18357206

    申请日:2023-07-24

    Applicant: NetApp Inc.

    CPC classification number: G06F11/1076 G06F11/1004 G06F11/1435 G06F21/602

    Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.

    JOURNAL REPLAY OPTIMIZATION
    35.
    发明公开

    公开(公告)号:US20230342053A1

    公开(公告)日:2023-10-26

    申请号:US17728441

    申请日:2022-04-25

    Applicant: NetApp Inc.

    CPC classification number: G06F3/064 G06F3/0659 G06F3/0656 G06F3/0619 G06F3/067

    Abstract: Techniques are provided for journal replay optimization. A distributed storage architecture can implement a journal within memory for logging write operations into log records. Latency of executing the write operations is improved because the write operations can be responded back to clients as complete once logged within the journal without having to store the data to higher latency disk storage. If there is a failure, then a replay process is performed to replay the write operations logged within the journal in order to bring a file system up-to-date. The time to complete the replay of the write operations is significantly reduced by caching metadata (e.g., indirect blocks, checksums, buftree identifiers, file block numbers, and consistency point counts) directly into log records. Replay can quickly access this metadata for replaying the write operations because the metadata does not need to be retrieved from the higher latency disk storage into memory.

    Multi-tier write allocation
    36.
    发明授权

    公开(公告)号:US11709603B2

    公开(公告)日:2023-07-25

    申请号:US17833046

    申请日:2022-06-06

    Applicant: NetApp Inc.

    CPC classification number: G06F3/0631 G06F3/061 G06F3/067 G06F3/0665

    Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.

    CO-LOCATED JOURNALING AND DATA STORAGE FOR WRITE REQUESTS

    公开(公告)号:US20220342577A1

    公开(公告)日:2022-10-27

    申请号:US17849290

    申请日:2022-06-24

    Applicant: NetApp, Inc.

    Abstract: A method and system for co-locating journaling and data storage based on write requests. A write request that includes metadata and data is received from a client. A logical storage unit for storing the metadata and the data is identified. The logical storage unit is divided into a journal partition and a volume partition. The journal partition includes a first log and a second log. Which of the first log and the second log is an active log and which of the first log and the second log is an inactive log are identified. The metadata is recorded in a first location in the active log and the data is recorded in a second location in the active log during a single I/O operation. A reply is sent to the client after the metadata and the data are recorded in the journal partition.

    Object format resilient to remote object store errors

    公开(公告)号:US10853166B2

    公开(公告)日:2020-12-01

    申请号:US16459790

    申请日:2019-07-02

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for remote object store error handling. A storage system may store data within one or more tiers of storage, such as a local storage tier (e.g., solid state storage and disks maintained by the storage system), a remote object store (e.g., storage provided by a third party storage provider), and/or other storage tiers. Because the remote object store may not provide the same data consistency and guarantees that the storage system provides for clients such as through the local storage tier, additional validation is provided by the storage system for the remote object store. For example, when data is put into an object of the remote object store, a verification get operation is performed to read and validate information within a header of the object. Other verifications and checks are performed such as using a locally stored metafile to detect corrupt or lost metadata and/or objects.

    MULTI-TIER WRITE ALLOCATION
    39.
    发明申请

    公开(公告)号:US20200073567A1

    公开(公告)日:2020-03-05

    申请号:US16679914

    申请日:2019-11-11

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for multi-tier write allocation. A storage system may store data within a multi-tier storage environment comprising a first storage tier (e.g., storage devices maintained by the storage system), a second storage tier (e.g., a remote object store provided by a third party storage provider), and/or other storage tiers. A determination is made that data (e.g., data of a write request received by the storage system) is to be stored within the second storage tier. The data is stored into a staging area of the first storage tier. A second storage tier location identifier, for referencing the data according to a format utilized by the second storage tier, is assigned to the data and provided to a file system hosting the data. The data is then destaged from the staging area into the second storage tier, such as within an object stored within the remote object store.

    OBJECT FORMAT RESILIENT TO REMOTE OBJECT STORE ERRORS

    公开(公告)号:US20190324844A1

    公开(公告)日:2019-10-24

    申请号:US16459790

    申请日:2019-07-02

    Applicant: NetApp Inc.

    Abstract: Techniques are provided for remote object store error handling. A storage system may store data within one or more tiers of storage, such as a local storage tier (e.g., solid state storage and disks maintained by the storage system), a remote object store (e.g., storage provided by a third party storage provider), and/or other storage tiers. Because the remote object store may not provide the same data consistency and guarantees that the storage system provides for clients such as through the local storage tier, additional validation is provided by the storage system for the remote object store. For example, when data is put into an object of the remote object store, a verification get operation is performed to read and validate information within a header of the object. Other verifications and checks are performed such as using a locally stored metafile to detect corrupt or lost metadata and/or objects.

Patent Agency Ranking