-
1.
公开(公告)号:US5883939A
公开(公告)日:1999-03-16
申请号:US705423
申请日:1996-08-29
申请人: Roy Friedman , Kenneth P. Birman
发明人: Roy Friedman , Kenneth P. Birman
CPC分类号: H04M3/4228 , H04M3/2254 , H04Q3/0029
摘要: Group communication technology, such as the Horus process, is used to implement a fault-tolerant high performance, reactive, real-time distributed IN coprocessor. The architecture of the distributed IN coprocessor comprises workstation clusters, external adaptors and an update interface interconnected by high speed communication links. Each workstation of the IN architecture represents a query element, so all the databases used by the IN coprocessor in the course of servicing incoming requests are split between query elements, provided that each of the workstations has access to the information stored in a certain database or databases. Group communication systems provide necessary features for managing and obtaining high reliability and operability of the IN coprocessor including failure detection, notification of other members of the system about the failures, reconfiguration of the system to exclude failed members, bringing back into the system the members that have been recovered, and updating the recovered or new members according to the new state of the system.
摘要翻译: 组通信技术(如Horus过程)用于实现容错的高性能,无功,实时分布式IN协处理器。 分布式IN协处理器的架构包括工作站集群,外部适配器和通过高速通信链路互连的更新接口。 IN架构的每个工作站都代表一个查询元素,因此IN协处理器在处理传入请求的过程中所使用的所有数据库都会在查询元素之间分开,前提是每个工作站都可以访问存储在特定数据库中的信息, 数据库。 组通信系统提供管理和获得IN协处理器的高可靠性和可操作性的必要特征,包括故障检测,系统其他成员关于故障的通知,重新配置系统以排除失败的成员,将系统中的成员 已经恢复,并根据系统的新状态更新已恢复的或新的成员。
-
公开(公告)号:US06393581B1
公开(公告)日:2002-05-21
申请号:US09073381
申请日:1998-05-06
IPC分类号: G06F1116
CPC分类号: H04M3/2254 , G01B17/00 , G01B21/24 , H04M3/4228 , H04Q3/0029 , Y10S707/99953
摘要: Apparatus and method of cluster computing are described. The present invention provides a useful compromise between the manageability, power, and ease of use of centralized systems and the reliability, fault-tolerance, upgradability, and scalability of distributed systems. Moreover, the present invention provides fault-tolerance and security while adhering to real-time to respond constraints or bounds. The invention is described in preferred embodiment examples in the context of two clustered applications: a telecommunication switch-controller and a Web servers, although many practical applications will benefit from the present invention.
摘要翻译: 描述了集群计算的装置和方法。 本发明提供了集中式系统的可管理性,功率和易用性以及分布式系统的可靠性,容错性,可升级性和可扩展性之间的有益的折中。 此外,本发明提供容错和安全性,同时坚持实时响应约束或界限。 在两个集群应用的上下文中,在优选实施例的示例中描述了本发明:电信交换机控制器和Web服务器,尽管许多实际应用将受益于本发明。
-
公开(公告)号:US5968185A
公开(公告)日:1999-10-19
申请号:US116770
申请日:1998-07-16
申请人: Thomas C. Bressoud , John E. Ahern , Kenneth P. Birman , Robert C. B. Cooper , Bradford B. Glade , Fred B. Schneider , John D. Service
发明人: Thomas C. Bressoud , John E. Ahern , Kenneth P. Birman , Robert C. B. Cooper , Bradford B. Glade , Fred B. Schneider , John D. Service
CPC分类号: G06F11/2097 , G06F11/1482 , G06F11/2023
摘要: In a fault-tolerant computer system, a primary replica supervisor is interposed between an operating system and a primary replica of an application program being executed by a primary processor. An object-code editor locates calls to the operating system and loops in the application program and inserts instruction sequences that enable the replica supervisor to intercept the calls to the operating system, results returned by the operating system as a result of the calls and asynchronous events delivered by the operating system to the replica. A backup replica supervisor is similarly interposed between an operating system and a backup replica of the application program being executed by a backup processor. The primary replica interacts with an environment. The replica supervisors ensure that the backup replica undergoes state transformations, as a result of the calls to the operating system and asynchronous events, that are equivalent to state transformations that the primary replica undergoes as a result of corresponding calls and asynchronous events. Thus, after a failure in the primary processor, the backup replica can interact with the environment in a manner consistent with interactions between the primary replica and the environment prior to the failure.
摘要翻译: 在容错计算机系统中,主复制主管介于操作系统和由主处理器执行的应用程序的主副本之间。 对象代码编辑器定位到操作系统的调用并在应用程序中循环,并插入指令序列,使得副本管理器能够拦截对操作系统的调用,由于调用和异步事件,操作系统返回的结果 由操作系统交付给副本。 备份副本管理程序类似地插入在由备份处理器执行的应用程序的操作系统和备份副本之间。 主要副本与环境交互。 由于对操作系统和异步事件的调用,副本监视器确保备份副本经历状态转换,这些转换等效于主副本作为相应调用和异步事件的结果进行的状态转换。 因此,在主处理器发生故障之后,备份副本可以以与故障之前主要副本和环境之间的交互一致的方式与环境进行交互。
-
公开(公告)号:US5802265A
公开(公告)日:1998-09-01
申请号:US565145
申请日:1995-12-01
申请人: Thomas C. Bressoud , John E. Ahern , Kenneth P. Birman , Robert C. B. Cooper , Bradford B. Glade , Fred B. Schneider , John D. Service
发明人: Thomas C. Bressoud , John E. Ahern , Kenneth P. Birman , Robert C. B. Cooper , Bradford B. Glade , Fred B. Schneider , John D. Service
CPC分类号: G06F11/2097 , G06F11/1482 , G06F11/2023
摘要: In a fault-tolerant computer system, a primary replica supervisor is interposed between an operating system and a primary replica of an application program being executed by a primary processor. An object-code editor locates calls to the operating system and loops in the application program and inserts instruction sequences that enable the replica supervisor to intercept the calls to the operating system, results returned by the operating system as a result of the calls and asynchronous events delivered by the operating system to the replica. A backup replica supervisor is similarly interposed between an operating system and a backup replica of the application program being executed by a backup processor. The primary replica interacts with an environment. The replica supervisors ensure that the backup replica undergoes state transformations, as a result of the calls to the operating system and asynchronous events, that are equivalent to state transformations that the primary replica undergoes as a result of corresponding calls and asynchronous events. Thus, after a failure in the primary processor, the backup replica can interact with the environment in a manner consistent with interactions between the primary replica and the environment prior to the failure.
-
-
-