System and method for fault tolerance in multi-node system
    2.
    发明授权
    System and method for fault tolerance in multi-node system 失效
    多节点系统容错系统与方法

    公开(公告)号:US06918063B2

    公开(公告)日:2005-07-12

    申请号:US10068434

    申请日:2002-02-04

    IPC分类号: G06F11/00 H04L1/22

    摘要: A method and system for promoting fault tolerance in a multi-node computing system that provides deadlock-free message routing in the presence of node and/or link faults using only two rounds and, thus, requiring only two virtual channels to ensure deadlock freedom. A lamb set of nodes for use in message routing is introduced, with each node in the lamb set being used only as points along message routes, and not for sending or receiving messages.

    摘要翻译: 一种用于在多节点计算系统中促进容错的方法和系统,其在仅存在两个循环的情况下在节点和/或链路故障的存在下提供无死锁消息路由,并且因此仅需要两个虚拟通道来确保死锁自由。 引入用于消息路由的一组羊群,其中,羔羊集中的每个节点仅被用作消息路由上的点,而不用于发送或接收消息。