TECHNIQUE FOR EFFICIENTLY INDEXING DATA OF AN ARCHIVAL STORAGE SYSTEM

    公开(公告)号:US20230029677A1

    公开(公告)日:2023-02-02

    申请号:US17487935

    申请日:2021-09-28

    Applicant: Nutanix, Inc.

    Abstract: An indexing technique provides an index data structure for efficient retrieval of a snapshot from a long-term storage service (LTSS) of an archival storage system. The snapshot is generated from typed data of a logical entity, such as a virtual disk (vdisk). The data of the snapshot is replicated to a frontend data service of the LTSS sequentially and organized as one or more data objects for storage by a backend data service of LTSS in an object store of the archival storage system. Metadata associated with the snapshot (i.e., snapshot metadata) is recorded as a log and persistently stored on storage media local to the frontend data service. The snapshot metadata includes information describing the snapshot data, e.g., a logical offset range of a snapshot of the vdisk and, thus, is used to construct the index data structure. Notably, construction of the index data structure is deferred until after the entirety of the snapshot data has been replicated and received by the frontend data service.

    BYPASSING TECHNIQUE TO ENABLE DIRECT ACCESS TO SNAPSHOT DATA IN OBJECT STORE

    公开(公告)号:US20240297786A1

    公开(公告)日:2024-09-05

    申请号:US18237315

    申请日:2023-08-23

    Applicant: Nutanix, Inc.

    CPC classification number: H04L9/088 G06F16/128

    Abstract: A bypassing technique bypasses an indexing service and provides a bypass data path for transferring/retrieving snapshots from a production cluster to an object store. In an embodiment, the production cluster may determine how extents of the snapshots are packed into objects of the object store and transfers the snapshots directly to the object store over the bypass data path. Once the snapshot transfer is completed, the production cluster provides location metadata as to how the snapshot extents are packed into objects to the indexing service. The indexing service is invoked to create an index of the location metadata and is not involved in the data transfer of the snapshots. In another embodiment, the production cluster identifies a snapshot to restore and queries the indexing service to compute the deltas between the snapshot to be restored and a reference snapshot. The indexing service returns a set of segments that indicates the changed delta regions between the two snapshots. The production cluster then issues a direct read to the object store over the bypass data path to read the changed delta segments.

    TECHNIQUE FOR MANAGING MULTIPLE SNAPSHOT STORAGE SERVICE INSTANCES ON-DEMAND

    公开(公告)号:US20240427733A1

    公开(公告)日:2024-12-26

    申请号:US18826953

    申请日:2024-09-06

    Applicant: Nutanix, Inc.

    Abstract: A technique allows instantiation and running on demand of long-term snapshot storage services of an archival storage system at various geographical locations. Storage service instances are configured to provide storage and retrieval of large amounts of point-in-time images or snapshots (e.g., recovery points) of application workloads stored as objects on one or more buckets of a shared object store. The storage service instances may contemporaneously serve snapshots of a same set of buckets on the shared object store without interfering with each other in a contention-free manner. That is, the technique enables storage service instances that are associated with snapshot workload data and/or metadata stored, e.g., as objects of a recovery point, on the same set of buckets to coexist without knowledge of each other. The storage service instances can be created and destroyed on-demand by splitting and merging the existing instances.

    High frequency snapshot technique for improving data replication in disaster recovery environment

    公开(公告)号:US12259790B2

    公开(公告)日:2025-03-25

    申请号:US17388735

    申请日:2021-07-29

    Applicant: Nutanix, Inc.

    Abstract: A high frequency snapshot technique improves data replication in a disaster recovery (DR) environment. A base snapshot is generated from failover data at a primary site and replicated to a placeholder file at a secondary site. Upon commencement of the base snapshot generation and replication, incremental light weight snapshots (LWSs) of the failover data are captured and replicated to the secondary site. A staging file at the secondary site accumulates the replicated LWSs (“high-frequency snapshots”). The staging file is populated with the LWSs in parallel with the replication of the base snapshot at the placeholder file. At a subsequent predetermined time interval, the accumulated LWSs are synthesized to capture a “checkpoint” snapshot by applying and pruning the accumulated LWSs at the staging file. Once the base snapshot is fully replicated, the pruned LWSs are merged to the base snapshot to synchronize the replicated failover data.

    HIGH FREQUENCY SNAPSHOT TECHNIQUE FOR IMPROVING DATA REPLICATION IN DISASTER RECOVERY ENVIRONMENT

    公开(公告)号:US20220398163A1

    公开(公告)日:2022-12-15

    申请号:US17388735

    申请日:2021-07-29

    Applicant: Nutanix, Inc.

    Abstract: A high frequency snapshot technique improves data replication in a disaster recovery (DR) environment. A base snapshot is generated from failover data at a primary site and replicated to a placeholder file at a secondary site. Upon commencement of the base snapshot generation and replication, incremental light weight snapshots (LWSs) of the failover data are captured and replicated to the secondary site. A staging file at the secondary site accumulates the replicated LWSs (“high-frequency snapshots”). The staging file is populated with the LWSs in parallel with the replication of the base snapshot at the placeholder file. At a subsequent predetermined time interval, the accumulated LWSs are synthesized to capture a “checkpoint” snapshot by applying and pruning the accumulated LWSs at the staging file. Once the base snapshot is fully replicated, the pruned LWSs are merged to the base snapshot to synchronize the replicated failover data.

    LAZY INDEX CONSTRUCTION OF SNAPSHOTS IN A REPLICATION RECEIVER

    公开(公告)号:US20240362185A1

    公开(公告)日:2024-10-31

    申请号:US18243980

    申请日:2023-09-08

    Applicant: Nutanix, Inc.

    CPC classification number: G06F16/128 G06F16/13 G06F16/184

    Abstract: A lazy index construction technique efficiently and cost effectively manages creation and storage of an index data structure based on characteristics of storage media used by an archival storage system. The index data structure (index) is configured to reference snapshot data of snapshots stored in the archival storage system. The technique is configured to defer creation and storage of the index on the archival storage system in a lazy manner until all snapshot data is received by a replication receiver and stored on the storage media so that updates/changes to the index on the storage media are minimized. The technique may be used with any type or combination of (i) “overwrite” data structure embodied as an index (i.e., an index data structure with overwrite capabilities) stored in an (ii) archival storage system having storage media (e.g., an object store) that is not conducive to overwrite capabilities.

    GARBAGE COLLECTION FROM ARCHIVAL OF STORAGE SNAPSHOTS

    公开(公告)号:US20230079621A1

    公开(公告)日:2023-03-16

    申请号:US17514603

    申请日:2021-10-29

    Applicant: Nutanix, Inc.

    Abstract: A technique improves storage efficiency of an object store configured to maintain numerous snapshots for long-term storage in an archival storage system by efficiently determining data that is exclusively owned by an expiring snapshot to allow deletion of the expiring snapshot from the object store. The technique involves managing index data structures to enable efficient garbage collection across a very large number of data objects. When a snapshot expires, the technique obviates the need to scan the numerous snapshot data objects to determine which index structures are no longer needed and can be reclaimed (garbage collected). The technique is directed to management of underlying storage based on different sets of policies. When certain snapshots expire and are ready for deletion, the technique is directed to finding those data blocks that are no longer referenced (used) by any valid snapshots.

Patent Agency Ranking