Scalable leadership election in a multi-processing computing environment

    公开(公告)号:US11888599B2

    公开(公告)日:2024-01-30

    申请号:US17646480

    申请日:2021-12-29

    Applicant: Nutanix, Inc.

    CPC classification number: H04L47/70 H04L67/01 H04L67/10

    Abstract: A scalable multi-cluster resource sharing facility. A global witness process runs on a first computing platform that communicates over one or more networks to any number of nodes situated over two or more clusters. The global witness process listens on the network for occurrences of leadership and/or resource requests from nodes of different clusters. The global witness processes a request by retrieving a resource request and a respective last known state value, comparing the last known state value to a global stored state value, then storing a new state value when the respective last known state value is equal to the stored state value. Any number of contemporaneous requests can be processed by the global witness process, however only one request can be granted. The other requestors each receive a rejection of their resource request when their proffered last known state value is not equal to the stored state value.

    Efficient metadata management
    2.
    发明授权

    公开(公告)号:US11734040B2

    公开(公告)日:2023-08-22

    申请号:US17093462

    申请日:2020-11-09

    Applicant: Nutanix, Inc.

    CPC classification number: G06F9/45558 G06F2009/45583 H04L67/1097

    Abstract: Systems for high-performance distributed computing. The systems include techniques for managing data and metadata across multiple nodes. A method embodiment commences by storing data at a node using a first storage mechanism that is local to the node. A first set of metadata is configured to identify a storage location for the stored data. The first set of metadata is stored using the same first storage mechanism that is local to the node. For accessing the first set of metadata, a second set of metadata is configured to identify a storage location for the first set of metadata. The second set of metadata is stored using a second storage mechanism that comprises a distributed metadata storage facility that stores metadata across multiple storage locations having at least one of the multiple storage locations that is not local to the node that stores data and metadata using the first storage mechanism.

    Managing incremental snapshots for fast leader node bring-up

    公开(公告)号:US11768809B2

    公开(公告)日:2023-09-26

    申请号:US16947444

    申请日:2020-07-31

    Applicant: Nutanix, Inc.

    CPC classification number: G06F16/182 G06F16/128 G06F16/14

    Abstract: Methods, systems and computer program products for computer system data management. A leader node that is in a leader-follower relationship with a follower node is deployed to manage metadata across a plurality of interconnected computing nodes of the computer system. Upon a failure of any sort that results in a loss of operation or a disconnection of the leader node, the followers undertake steps to bring up a new leader. Such steps to bring up a new leader include electing a new leader to replace the lost or disconnected leader and initializing the data state of the new leader node by causing the leader to request incremental snapshots from followers. Additional metadata that might have changed since closing the incremental snapshots can also be retrieved from the followers. The metadata that might have changed since the time of closing the incremental snapshot is subjected to verification using a consensus algorithm.

    Scalable leadership election in a multi-processing computing environment

    公开(公告)号:US11218418B2

    公开(公告)日:2022-01-04

    申请号:US15160347

    申请日:2016-05-20

    Applicant: Nutanix, Inc.

    Abstract: A scalable multi-cluster resource sharing facility. A global witness process runs on a first computing platform that communicates over one or more networks to any number of nodes situated over two or more clusters. The global witness process listens on the network for occurrences of leadership and/or resource requests from nodes of different clusters. The global witness processes a request by retrieving a resource request and a respective last known state value, comparing the last known state value to a global stored state value, then storing a new state value when the respective last known state value is equal to the stored state value. Any number of contemporaneous requests can be processed by the global witness process, however only one request can be granted. The other requestors each receive a rejection of their resource request when their proffered last known state value is not equal to the stored state value.

    Efficient metadata management
    6.
    发明授权

    公开(公告)号:US10831521B2

    公开(公告)日:2020-11-10

    申请号:US15965656

    申请日:2018-04-27

    Applicant: Nutanix, Inc.

    Abstract: Systems for high-performance distributed computing. The systems include techniques for managing data and metadata across multiple nodes. A method embodiment commences by storing data at a node using a first storage mechanism that is local to the node. A first set of metadata is configured to identify a storage location for the stored data. The first set of metadata is stored using the same first storage mechanism that is local to the node. For accessing the first set of metadata, a second set of metadata is configured to identify a storage location for the first set of metadata. The second set of metadata is stored using a second storage mechanism that comprises a distributed metadata storage facility that stores metadata across multiple storage locations having at least one of the multiple storage locations that is not local to the node that stores data and metadata using the first storage mechanism.

    Method and apparatus for data driven and cluster specific version/update control

    公开(公告)号:US10824412B2

    公开(公告)日:2020-11-03

    申请号:US15964928

    申请日:2018-04-27

    Applicant: Nutanix, Inc

    Abstract: A method and apparatus for data driven and cluster specific version/update control. The apparatus includes an automated multi-clusters management apparatus that interfaces with a plurality of remote clusters to provide data driven version/update control on a cluster by cluster basis. Generally, operation includes collection/identification of cluster specific data pertaining to software, hardware, and cluster requirements. The cluster specific data is later compared/analyzed against multi-cluster data pertaining to software releases, hardware characteristics, and known bugs/issues for each. The results of the comparison/analysis can then be ranked according to various metrics to different possible solutions and to differentiate the less desirable results from the more desirable results. Thus, the automated multi-cluster management apparatus provides for selection of versions/updates that is dependent on the cluster specific data. Additionally, the present disclosure provides for scheduling and distribution planning for selected versions/updates.

    Low latency access to physical storage locations by implementing multiple levels of metadata

    公开(公告)号:US11562091B2

    公开(公告)日:2023-01-24

    申请号:US17099763

    申请日:2020-11-16

    Applicant: Nutanix, Inc.

    Abstract: Systems for low-latency data access in distributed computing systems. A method embodiment commences upon generating a first storage area in local storage of a first computing node. Access to the first storage area is provided through the first computing node. A second storage area is generated wherein the second storage area comprises a first set of metadata that comprises local storage device locations of at least some of the local storage areas of the first storage area. A set of physical access locations of the second storage area is stored to a database that manages updates to the second set of metadata pertaining to the second storage area. Accesses to the first storage area are accomplished by querying the database to retrieve a location of the second set of metadata, and then accessing the first storage area through one or more additional levels of metadata that are node-wise collocated.

    Two node clusters recovery on a failure

    公开(公告)号:US11194680B2

    公开(公告)日:2021-12-07

    申请号:US16041348

    申请日:2018-07-20

    Applicant: Nutanix, Inc.

    Abstract: Systems and methods for high availability computing systems. Systems and methods include disaster recovery of two-node computing clusters. A method embodiment commences upon identifying a computing cluster having two nodes, the two nodes corresponding to a first node and a second node that each send and receive heartbeat indications periodically while performing storage I/O operations. One or both of the two nodes detect a heartbeat failure between the two nodes, and in response to detecting the heartbeat failure, one or both of the nodes temporarily cease storage I/O operations. A witness node is accessed in an on-demand basis as a result of detecting the heartbeat failure. The witness performs a leadership election operation to provide a leadership lock to only one requestor. The leader then resumes storage I/O operations and performs one or more disaster remediation operations. After remediation, the computing cluster is restored to a configuration having two nodes.

    MANAGING INCREMENTAL SNAPSHOTS FOR FAST LEADER NODE BRING-UP

    公开(公告)号:US20210349858A1

    公开(公告)日:2021-11-11

    申请号:US16947444

    申请日:2020-07-31

    Applicant: Nutanix, Inc.

    Abstract: Methods, systems and computer program products for computer system data management. A leader node that is in a leader-follower relationship with a follower node is deployed to manage metadata across a plurality of interconnected computing nodes of the computer system. Upon a failure of any sort that results in a loss of operation or a disconnection of the leader node, the followers undertake steps to bring up a new leader. Such steps to bring up a new leader include electing a new leader to replace the lost or disconnected leader and initializing the data state of the new leader node by causing the leader to request incremental snapshots from followers. Additional metadata that might have changed since closing the incremental snapshots can also be retrieved from the followers. The metadata that might have changed since the time of closing the incremental snapshot is subjected to verification using a consensus algorithm.

Patent Agency Ranking