MANAGING REFERENCE SNAPSHOTS ACROSS MULTIPLE SITES FOR EFFICIENT FAILOVER/FAILBACK

    公开(公告)号:US20240394156A1

    公开(公告)日:2024-11-28

    申请号:US18797741

    申请日:2024-08-08

    Applicant: Nutanix, Inc.

    Abstract: A technique provides network efficient data failover by explicitly protecting one or more common snapshot references at sites of a multi-site data replication environment to improve granularity of control of recovery point objectives (RPO) for data across the sites. A common snapshot reference or recovery point (RP) ensures that, in the event of failure to a site, data designated for failover may be quickly protected by replicating only small incremental changes to the RP so as to maintain RPO requirements across the sites. Illustratively, the technique enhances and extends a disaster recovery (DR) application programming interface (API) protocol through an extension that defines and applies a tag to the RP, wherein the tag enables protection and/or preservation of the RP by ensuring that the sites honor the tag applied to the RP. The tag essentially functions as an advisory lock for the RP that is shared among the sites to prevent deletion of the RP at the sites throughout the duration of the lock.

    TECHNIQUE FOR EFFICIENT DATA FAILOVER IN A MULTI-SITE DATA REPLICATION ENVIRONMENT

    公开(公告)号:US20220374316A1

    公开(公告)日:2022-11-24

    申请号:US17877169

    申请日:2022-07-29

    Applicant: Nutanix, Inc.

    Abstract: A technique provides efficient data failover by creation and deployment of a protection policy that ensures maintenance of frequent common snapshots between sites of a multi-site data replication environment. A global constraint optimizer executes on a node of a cluster to create the protection policy for deployment among other nodes of clusters at the sites. Constraints such as protection rules (PRs) specifying, e.g., an amount of tolerable data loss are applied to a category of data designated for failover from a primary site over a network to a plurality of (secondary and tertiary) sites typically located at geographically separated distances. The optimizer processes the PRs to compute parameters such as frequency of snapshot generation and replication among the sites, as well as retention of the latest common snapshot maintained at each site to create a recovery point and configuration of the protection policy that reduces network traffic for efficient use of the network among the sites.

    System and method for on-demand recovery points

    公开(公告)号:US10963182B2

    公开(公告)日:2021-03-30

    申请号:US16506980

    申请日:2019-07-09

    Applicant: Nutanix, Inc.

    Abstract: An illustrative embodiment disclosed herein is an apparatus including a processor having programmed instructions to determine a frequency rate for tracking changed data of a virtual machine (VM), track the changed data at the determined frequency rate, receive a request to generate a recovery point associated with a specified time, and, responsive to receiving the request to generate the recovery point associated with the specified time, generate the recovery point.

    SYSTEM AND METHOD FOR ON-DEMAND RECOVERY POINTS

    公开(公告)号:US20200379646A1

    公开(公告)日:2020-12-03

    申请号:US16506980

    申请日:2019-07-09

    Applicant: Nutanix, Inc.

    Abstract: An illustrative embodiment disclosed herein is an apparatus including a processor having programmed instructions to determine a frequency rate for tracking changed data of a virtual machine (VM), track the changed data at the determined frequency rate, receive a request to generate a recovery point associated with a specified time, and, responsive to receiving the request to generate the recovery point associated with the specified time, generate the recovery point.

    Technique for creating an in-memory compact state of snapshot metadata

    公开(公告)号:US12189573B2

    公开(公告)日:2025-01-07

    申请号:US17376822

    申请日:2021-07-15

    Applicant: Nutanix, Inc.

    Abstract: A technique creates a compact state of snapshot metadata and associated selected snapshots that are frequently used and maintained in memory of a node of a cluster to facilitate processing of workflow operations associated with a logical entity in a disaster recovery (DR) environment. The compact state represents a minimal subset of snapshot metadata that is frequently used to perform operations in accordance with the DR workflow operations. In addition, metadata associated with the progress of the DR workflow operations processed by the node is periodically consolidated within the compact state. Illustratively, the selected frequently used snapshots of the logical entity include (i) a recently created snapshot; (ii) one or more reference snapshots; (iii) a snapshot scheduled for replication; and (iv) any snapshot that is queued for a current or future-scheduled operation. The technique is also directed to a snapshot and metadata eviction policy that is configured to evict infrequently used snapshots and snapshot metadata to improve memory space consumption of the memory.

    METERING FRAMEWORK FOR IMPROVING RESOURCE UTILIZATION FOR A DISASTER RECOVERY ENVIRONMENT

    公开(公告)号:US20230205653A1

    公开(公告)日:2023-06-29

    申请号:US17675263

    申请日:2022-02-18

    Applicant: Nutanix, Inc.

    Abstract: A framework is described that improves resource utilization during operations executing within workflows of the distributed data processing system (e.g., having a plurality of interconnected nodes) in a disaster recovery (DR) environment configured to support synchronous and asynchronous (i.e., heterogeneous) DR workflows (e.g., generating snapshots and replicating data) that include synchronous replication, asynchronous replication, nearsync (i.e., short duration snapshots of metadata) replication and migration of data objects associated with the workflows for failover (e.g., replication and/or migration) to a secondary site in the event of failure of the primary site. The framework meters (regulates) execution of the operations directed to the workloads so as to efficiently use the resources in a manner that allows timely progress (completion) of certain (e.g., high-frequency) operations and reduction in blocking (stalling) of other (e.g., low-frequency) operations by avoiding unnecessary resource hoarding/consumption and contention. Notably, the framework also provides metering and tuning of properties during execution of the workflows and maintains their state to provide for recovery.

    Technique for efficient data failover in a multi-site data replication environment

    公开(公告)号:US11436097B1

    公开(公告)日:2022-09-06

    申请号:US17322427

    申请日:2021-05-17

    Applicant: Nutanix, Inc.

    Abstract: A technique provides efficient data failover by creation and deployment of a protection policy that ensures maintenance of frequent common snapshots between sites of a multi-site data replication environment. A global constraint optimizer executes on a node of a cluster to create the protection policy for deployment among other nodes of clusters at the sites. Constraints such as protection rules (PRs) specifying, e.g., an amount of tolerable data loss are applied to a category of data designated for failover from a primary site over a network to a plurality of (secondary and tertiary) sites typically located at geographically separated distances. The optimizer processes the PRs to compute parameters such as frequency of snapshot generation and replication among the sites, as well as retention of the latest common snapshot maintained at each site to create a recovery point and configuration of the protection policy that reduces network traffic for efficient use of the network among the sites.

    System and method for efficient replication

    公开(公告)号:US11416351B2

    公开(公告)日:2022-08-16

    申请号:US17135166

    申请日:2020-12-28

    Applicant: Nutanix, Inc.

    Abstract: A system and method determining a Chain Identification Number (CID) of a source snapshot to be replicated from a source site to a target site of a virtual computing system, determining a predetermined number of potential reference snapshots based on the CID of the source snapshot, computing a closeness value between the source snapshot and each of the potential reference snapshots, and creating a list of the potential reference snapshots based on the closeness value of each of the potential reference snapshots. One snapshot from the list is selected as a reference snapshot. The source snapshot is replicated to the target site based on the reference snapshot.

    SYSTEM AND METHOD FOR EFFICIENT REPLICATION

    公开(公告)号:US20220027241A1

    公开(公告)日:2022-01-27

    申请号:US17135166

    申请日:2020-12-28

    Applicant: Nutanix, Inc.

    Abstract: A system and method determining a Chain Identification Number (CID) of a source snapshot to be replicated from a source site to a target site of a virtual computing system, determining a predetermined number of potential reference snapshots based on the CID of the source snapshot, computing a closeness value between the source snapshot and each of the potential reference snapshots, and creating a list of the potential reference snapshots based on the closeness value of each of the potential reference snapshots. One snapshot from the list is selected as a reference snapshot. The source snapshot is replicated to the target site based on the reference snapshot.

    SITE AND STORAGE TIER AWARE REFERENCE RESOLUTION

    公开(公告)号:US20240330118A1

    公开(公告)日:2024-10-03

    申请号:US18236160

    申请日:2023-08-21

    Applicant: Nutanix, Inc.

    CPC classification number: G06F11/1464 G06F11/1466

    Abstract: A site and storage tier aware technique replicates data as one or more recovery points (RPs) from a primary site to a secondary site in a multi-site data replication environment. A storage tier aware reference resolver determines (i) an amount of RP data transfer associated with the replication and (ii) location information associated with a cloud storage tier storing the RP data in an object store. The storage tier aware reference resolution aspect provides two additional factors to consider when retrieving data of a reference RP from cloud storage: (iii) the time (duration) needed to retrieve the data and (iv) the cost (financial expense) needed to retrieve the data. In addition, a site aware reference resolution aspect of the technique determines an optimal RP to use as the reference RP and involves consideration of (v) which RPs have been replicated from the primary site to the secondary site and (vi) which RPs have been retained for storage at the sites.

Patent Agency Ranking