-
公开(公告)号:US20050268153A1
公开(公告)日:2005-12-01
申请号:US11101720
申请日:2005-04-07
IPC分类号: G06F11/00 , G06F11/07 , G06F15/177
CPC分类号: G06F11/0709 , G06F11/0793 , G06F11/1425 , G06F11/1482 , H04L43/0811 , H04L43/10
摘要: In a cluster system comprising at least two nodes connected via a communication network and having a name and a host weight assigned to it, a method is implemented comprising the steps of inspecting the communication link, determining which node has to be shut down after a failure, creating an advertisement report for the node to be shut down, sending the advertisement report to at least one node of the cluster system, calculating a delay time depending on the weight of the first node and sending the shut down command to the node for which a failure report was received. In one embodiment of the invention the advertisement reports include a master node, which allows identifying and specifying the surviving subcluster. The method will send shut down signals to those nodes of a subcluster with lower weight than the surviving subcluster. A failsafe mechanism is implemented.
摘要翻译: 在包括通过通信网络连接并具有分配给它的名称和主机权重的至少两个节点的集群系统中,实现一种方法,包括检查通信链路的步骤,确定哪个节点必须在故障之后被关闭 创建关闭节点的广告报告,将广告报告发送给集群系统的至少一个节点,根据第一个节点的权重计算延迟时间,并向其节点发送关闭命令, 收到失败报告。 在本发明的一个实施例中,广告报告包括主节点,其允许识别和指定幸存子集群。 该方法将发送关闭信号到具有比剩余子集群更低的权重的子集群的那些节点。 实施故障安全机制。