Abstract:
A system and method can implement highly available Internet Protocol (IP) based communication across multiple independent communication paths. The system can have different IP addresses associated with different interfaces and communication paths and can implement communication fail-over as part of the communication layers above the IP layer, e.g. at the application level. The system can provide a balance between an average fail-over time and implementation complexity, and can achieve simplicity and robustness while providing high communication performance.
Abstract:
Systems and methods for supporting a single logical IP subnet across multiple independent layer 2 subnets in a high performance computing environment. A method can provide, at a computer including one or more microprocessors, a logical device, the logical device being addressed by a layer 3 address, wherein the logical device comprises a plurality of network adapters, each of the network adapters comprising a physical port, and a plurality of switches. The method can arrange the plurality of switches into a plurality of discrete layer 2 subnets. The method can provide a mapping table at the logical device.
Abstract:
Systems and methods for supporting a single logical IP subnet across multiple independent layer 2 subnets in a high performance computing environment. A method can provide, at a computer including one or more microprocessors, a logical device, the logical device being addressed by a layer 3 address, wherein the logical device comprises a plurality of network adapters, each of the network adapters comprising a physical port, and a plurality of switches. The method can arrange the plurality of switches into a plurality of discrete layer 2 subnets. The method can provide a mapping table at the logical device.
Abstract:
Systems and methods for multicast send duplication instead of replication in a high performance computing environment. A method can provide a plurality of switches, a plurality of hosts, the plurality of hosts being interconnected via the plurality of switches, wherein a host of the plurality of hosts comprises a multicast sender node, the sender node comprising a system image generation module and a current message sequence module. The method can organize the plurality of switches into two rails, the two or more rails providing redundant connectivity between the plurality of hosts. The method can send two or more duplicate multicast packets on different rails. Upon a receiving node receiving at least two versions of the same multicast packet, only one will be delivered to the communication stack/clients above the layer that handles the encapsulation header.
Abstract:
Systems and methods for multicast send duplication instead of replication in a high performance computing environment. A method can provide a plurality of switches, a plurality of hosts, the plurality of hosts being interconnected via the plurality of switches, wherein a host of the plurality of hosts comprises a multicast sender node, the sender node comprising a system image generation module and a current message sequence module. The method can organize the plurality of switches into two rails, the two or more rails providing redundant connectivity between the plurality of hosts. The method can send two or more duplicate multicast packets on different rails. Upon a receiving node receiving at least two versions of the same multicast packet, only one will be delivered to the communication stack/clients above the layer that handles the encapsulation header.
Abstract:
A system and method can support security management in a network environment. A switch in the network environment includes a switch chip, which is configured with a secure management key (M_Key) prior to one or more external links becoming operational. Furthermore, a local daemon in the switch can monitor the secure M_key on the switch chip, and persistently store a current M_key used by a local subnet manager (SM). The current M_key is a state that is dynamically updated in a fabric in the network environment.
Abstract:
A system and method can ensure Internet Protocol (IP) address and node name consistency when performing remote transactions via multiple un-related IP addresses for the same remote peer. The system can ensure that all cooperating peer nodes are in full agreement of the names and IP addresses at any point in time. In particular, when network configurations can be updated dynamically, the system can ensure that such updates do not lead to inconsistent or failed transactions because a peer node has a stale view of what addresses to use. Furthermore, the peer node that initiates the transaction can verify that all the other peer nodes have exactly the same view of the overall system configuration, in order to ensure that each distributed transaction is carried out using consistent address information.
Abstract:
A system and method can support security management in a network environment. A switch in the network environment includes a switch chip, which is configured with a secure management key (M_Key) prior to one or more external links becoming operational. Furthermore, a local daemon in the switch can monitor the secure M_key on the switch chip, and persistently store a current M_key used by a local subnet manager (SM). The current M_key is a state that is dynamically updated in a fabric in the network environment.
Abstract:
A system and method can support network management in a network environment. The network environment can include a plurality of configuration daemons (CDs), wherein each CD resides on a switch in the network environment. The CD operates to receive a configuration file that includes a list of known management key (M_Key) values. Furthermore, the CD operates to store the configuration file, and make the configuration file available to a local subnet manager(SM) on the switch, wherein the local SM is associated with a currently used M_Key value. Then, the CD operates to update the local SM with a new M_Key, after receiving an instruction from a master CD that is associated with a master SM in the network environment.
Abstract:
Systems and methods for multicast send duplication instead of replication in a high performance computing environment. A method can provide a plurality of switches, a plurality of hosts, the plurality of hosts being interconnected via the plurality of switches, wherein a host of the plurality of hosts comprises a multicast sender node, the sender node comprising a system image generation module and a current message sequence module. The method can organize the plurality of switches into two rails, the two or more rails providing redundant connectivity between the plurality of hosts. The method can send two or more duplicate multicast packets on different rails. Upon a receiving node receiving at least two versions of the same multicast packet, only one will be delivered to the communication stack/clients above the layer that handles the encapsulation header.