-
公开(公告)号:US20200026625A1
公开(公告)日:2020-01-23
申请号:US16041348
申请日:2018-07-20
Applicant: Nutanix, Inc.
Inventor: Pavan Kumar KONKA , Karan GUPTA , Aashray ARORA , Deepthi SRINIVASAN
Abstract: Systems and methods for high availability computing systems. Systems and methods include disaster recovery of two-node computing clusters. A method embodiment commences upon identifying a computing cluster having two nodes, the two nodes corresponding to a first node and a second node that each send and receive heartbeat indications periodically while performing storage I/O operations. One or both of the two nodes detect a heartbeat failure between the two nodes, and in response to detecting the heartbeat failure, one or both of the nodes temporarily cease storage I/O operations. A witness node is accessed in an on-demand basis as a result of detecting the heartbeat failure. The witness performs a leadership election operation to provide a leadership lock to only one requestor. The leader then resumes storage I/O operations and performs one or more disaster remediation operations. After remediation, the computing cluster is restored to a configuration having two nodes.
-
公开(公告)号:US20240422029A1
公开(公告)日:2024-12-19
申请号:US18765837
申请日:2024-07-08
Applicant: NUTANIX, INC.
Inventor: Aashray ARORA , Aditya Vilas JALTADE , Rishi BHARDWAJ
Abstract: Various embodiments set forth a computer-readable media storing program instructions that, when executed by one or more processors, cause the processors to perform steps of maintaining, by a first node of a first cluster, a respective first open transport control protocol (TCP) connection with each of a plurality of second nodes in a second cluster; maintaining, by a third node of the first cluster, a second open TCP connection with the first node, wherein the third node is prevented from establishing a TCP connection with any of the plurality of second nodes; and sending, by the third node, a message to the second cluster by sending the message to the first node via the second open TCP connection, wherein the first node is configured to forward the message to one of the second nodes via a corresponding one of the respective first open TCP connections.
-