Fault tolerance and failover using active copy-cat
    2.
    发明授权
    Fault tolerance and failover using active copy-cat 有权
    使用活动的copy-cat进行容错和故障切换

    公开(公告)号:US07480827B2

    公开(公告)日:2009-01-20

    申请号:US11502998

    申请日:2006-08-11

    IPC分类号: G06F11/00 G06F11/14

    摘要: Fault tolerant operation is disclosed for a primary instance, such as a process, thread, application, processor, etc., using an active copy-cat instance, a.k.a. backup instance, that mirrors operations in the primary instance, but only after those operations have successfully completed in the primary instance. Fault tolerant logic monitors inputs and outputs of the primary instance and gates those inputs to the backup instance once a given input has been processed. The outputs of the backup instance are then compared with the outputs of the primary instance to ensure correct operation. The disclosed embodiments further relate to fault tolerant failover mechanism allowing the backup instance to take over for the primary instance in a fault situation wherein the primary and backup instances are loosely coupled, i.e. they need not be aware that they are operating in a fault tolerant environment. As such, the primary instance need not be specifically designed or programmed to interact with the fault tolerant mechanisms. Instead, the primary instance need only be designed to adhere to specific basic operating guidelines and shut itself down when it cannot do so. By externally controlling the ability of the primary instance to successfully adhere to its operating guidelines, the fault tolerant mechanisms of the disclosed embodiments can recognize error conditions and easily failover from the primary instance to the backup instance.

    摘要翻译: 对于主要实例(例如进程,线程,应用程序,处理器等),使用反映主要实例中的操作的活动副本 - 实例(也称为备份实例),但仅在这些操作具有 在初级实例中成功完成。 容错逻辑监控主实例的输入和输出,并且一旦处理了给定的输入,就将这些输入输入到备份实例。 然后将备份实例的输出与主实例的输出进行比较,以确保正确的操作。 所公开的实施例还涉及容错故障转移机制,允许备份实例在主要和备份实例松耦合的故障情况下接管主要实例,即,他们不需要意识到它们在容错环境中运行 。 因此,主要实例不需要特别设计或编程以与容错机制交互。 相反,主要实例只需要设计为遵守特定的基本操作指南,并在不能这样做的时候关闭它。 通过外部控制主实例成功遵守其操作指南的能力,所公开实施例的容错机制可以识别错误状况并且容易地从主实例故障转移到备份实例。

    FAULT TOLERANCE AND FAILOVER USING ACTIVE COPY-CAT
    3.
    发明申请
    FAULT TOLERANCE AND FAILOVER USING ACTIVE COPY-CAT 有权
    使用活性COPY-CAT的容错和失败

    公开(公告)号:US20090106328A1

    公开(公告)日:2009-04-23

    申请号:US12263821

    申请日:2008-11-03

    IPC分类号: G06F17/30

    摘要: Fault tolerant operation is disclosed for a primary instance, such as a process, thread, application, processor, etc., using an active copy-cat instance, a.k.a. backup instance, that mirrors operations in the primary instance, but only after those operations have successfully completed in the primary instance. Fault tolerant logic monitors inputs and outputs of the primary instance and gates those inputs to the backup instance once a given input has been processed. The outputs of the backup instance are then compared with the outputs of the primary instance to ensure correct operation. The disclosed embodiments further relate to fault tolerant failover mechanism allowing the backup instance to take over for the primary instance in a fault situation wherein the primary and backup instances are loosely coupled, i.e. they need not be aware that they are operating in a fault tolerant environment. As such, the primary instance need not be specifically designed or programmed to interact with the fault tolerant mechanisms. Instead, the primary instance need only be designed to adhere to specific basic operating guidelines and shut itself down when it cannot do so. By externally controlling the ability of the primary instance to successfully adhere to its operating guidelines, the fault tolerant mechanisms of the disclosed embodiments can recognize error conditions and easily failover from the primary instance to the backup instance.

    摘要翻译: 对于主要实例(例如进程,线程,应用程序,处理器等),使用反映主要实例中的操作的活动副本 - 实例(也称为备份实例),但仅在这些操作具有 在初级实例中成功完成。 容错逻辑监控主实例的输入和输出,并且一旦处理了给定的输入,就将这些输入输入到备份实例。 然后将备份实例的输出与主实例的输出进行比较,以确保正确的操作。 所公开的实施例还涉及容错故障转移机制,允许备份实例在主要和备份实例松耦合的故障情况下接管主要实例,即,他们不需要意识到它们在容错环境中运行 。 因此,主要实例不需要专门设计或编程以与容错机制交互。 相反,主要实例只需要设计为遵守特定的基本操作指南,并在不能这样做的时候关闭它。 通过外部控制主实例成功遵守其操作指南的能力,所公开实施例的容错机制可以识别错误状况并且容易地从主实例故障转移到备份实例。