System and method for detecting and managing HPC node failure
    2.
    发明申请
    System and method for detecting and managing HPC node failure 有权
    用于检测和管理HPC节点故障的系统和方法

    公开(公告)号:US20050246569A1

    公开(公告)日:2005-11-03

    申请号:US10826959

    申请日:2004-04-15

    IPC分类号: G06F11/00

    摘要: A method for managing HPC node failure includes determining that one of a plurality of HPC nodes has failed, with each HPC node comprising an integrated fabric. The failed node is then removed from a virtual list of HPC nodes, with the virtual list comprising one logical entry for each of the plurality of HPC nodes.

    摘要翻译: 用于管理HPC节点故障的方法包括确定多个HPC节点中的一个已经发生故障,每个HPC节点包括集成结构。 然后从HPC节点的虚拟列表中移除故障节点,虚拟列表包括用于多个HPC节点中的每一个的一个逻辑条目。

    High performance computing system and method
    4.
    发明申请
    High performance computing system and method 有权
    高性能计算系统及方法

    公开(公告)号:US20050235092A1

    公开(公告)日:2005-10-20

    申请号:US10824874

    申请日:2004-04-15

    IPC分类号: G06F9/50 G06F15/80 G06F13/00

    摘要: A High Performance Computing (HPC) node comprises a motherboard, a switch comprising eight or more ports integrated on the motherboard, and at least two processors operable to execute an HPC job, with each processor communicably coupled to the integrated switch and integrated on the motherboard.

    摘要翻译: 高性能计算(HPC)节点包括主板,包括集成在主板上的八个或更多个端口的交换机以及可操作以执行HPC作业的至少两个处理器,每个处理器可通信地耦合到集成开关并集成在主板上 。