Abstract:
System and method for supporting a partitioned switch forwarding table in a high performance computing environment. Described methods and systems can support partitioned switch forwarding tables (e.g., partitioned LFTs) by setting up hardware registers that divide the LFT into at least two partitions, a first partition that supports legacy forwarding (e.g., standard LID based forwarding without the need to use portions of the GRH), and a second partition to support the GRH based forwarding that is described above. In such a manner, switches and other hardware within a core fabric can behave as legacy nodes/switches having standard LFTs, while also being able to support the extended addressing supplied through the use of portions of the GRH.
Abstract:
A system for communicating a multi-destination packet through a network switch fabric with a plurality of input and output ports is described. This system receives the multi-destination packet at an input port, wherein the multi-destination packet includes a multicast packet or a broadcast packet that is directed to multiple output ports, and wherein the network switch fabric maintains a separate virtual output queue (VOQ) for each output port. Next, the system sends the multi-destination packet from the input port to the multiple output ports by inserting the multi-destination packet into VOQs associated with the multiple output ports. The multi-destination packet is inserted into one VOQ at a time, so that after the multi-destination packet is read out of a VOQ and is sent to a corresponding output port, the multi-destination packet is inserted in another VOQ until the multi-destination packet is sent to all of the multiple output ports.
Abstract:
Upon detecting a failure of a switch link in a set of multiple redundant switch links that directly connect two nodes in the switch fabric network, the system performs a failover operation that remaps network traffic associated with the failed switch link to an alternative switch link in the set of multiple redundant switch links. Each node in the switch fabric network maintains a mapping table that translates from logical ports and associated logical virtual lanes to physical ports and associated virtual lanes. The system also provisions switch links with private virtual lanes comprising resources to facilitate failover operations. While remapping the network traffic, the system changes the mapping table so that network traffic, which is presently directed to a physical port and associated virtual lane for the failed switch link, is remapped to an alternative physical port and an associated private virtual lane, for the alternative switch link.
Abstract:
The disclosed system handles a switch link failure in a switch fabric network. When a node in the switch fabric network detects a failure of a switch link coupled to the node, the system remaps traffic that is directed to a logical port number, which is currently mapped to a primary physical port number associated with the failed switch link, to a secondary physical port number associated with the alternative switch link. This remapping involves performing a lookup in a local mapping table at the node, wherein the local mapping table stores associations between logical port numbers and physical port numbers, wherein for each logical port number, the mapping table includes a primary physical port number and one or more secondary physical port numbers, which are associated with alternative switch links. The system notifies a subnet manager for the switch fabric network about the link failure and the remapping.
Abstract:
System and method for supporting flexible forwarding domain boundaries in a high performance computing environment. In accordance with an embodiment, flexible forwarding domain boundaries can be supported by dividing/partitioning a physical switch into two or more logical switches, where each switch is logically in a different domain, and allowing a fabric to be decomposed into independent subnets with one two or more physical end ports at the physical switch. By doing so, the same hierarchical forwarding structure and management structure between subnets can be provided as when complete physical switches are used as building blocks.
Abstract:
System and method for supporting multiple concurrent SL to VL mappings in a high performance computing environment. In accordance with an embodiment, systems and methods can provide for two or more SL to VL mappings per ingress switch port in a network switched fabric. By allowing for multiple such mappings, greater virtual lane independence can be achieved while continuing to achieve quality of service guarantees.
Abstract:
A system for communicating packets through a network switch fabric is described. At an aggregation point in the network switch fabric, the system segregates packet flows from multiple sources into a set of quality-of-service (QoS) buckets. The system also associates packet flows from the multiple sources with a global QoS bucket. The system monitors traffic rates for each QoS bucket in the set of QoS buckets and the global QoS bucket. The system determines a state for each QoS bucket by comparing a traffic rate for the QoS bucket with state-specific thresholds. The system also determines a state for the global QoS bucket by comparing a traffic rate for the global QoS bucket with state-specific global thresholds. When a packet is received for a given QoS bucket, the system performs an action based on a state of the given QoS bucket and a state of the global QoS bucket.
Abstract:
The disclosed embodiments relate to a system for communicating packets through a network switch fabric. During operation, at an aggregation point in the network switch fabric, the system segregates packet flows from multiple sources into a set of quality-of-service (QoS) buckets. Next, the system monitors traffic rates for each QoS bucket. The system then determines a state for each QoS bucket by comparing a traffic rate for the QoS bucket with one or more state-specific thresholds. When a packet is subsequently received for a given QoS bucket, the system performs an action based on a state of the given QoS bucket.
Abstract:
A system for communicating packets through a network switch fabric is described. At an aggregation point in the network switch fabric, the system segregates packet flows from multiple sources into a set of quality-of-service (QoS) buckets. The system also associates packet flows from the multiple sources with a global QoS bucket. The system monitors traffic rates for each QoS bucket in the set of QoS buckets and the global QoS bucket. The system determines a state for each QoS bucket by comparing a traffic rate for the QoS bucket with state-specific thresholds. The system also determines a state for the global QoS bucket by comparing a traffic rate for the global QoS bucket with state-specific global thresholds. When a packet is received for a given QoS bucket, the system performs an action based on a state of the given QoS bucket and a state of the global QoS bucket.
Abstract:
An input/output (I/O) switch fabric includes input physical ports that convey packets associated with at least a first network flow. Moreover, virtual routers in the I/O switch fabric, which have associated routing tables, provide types of service and/or routes for different source-destination pairs based on link-layer information and network-layer information in the packets. Note that different virtual routers can provide different types of service and/or different routes. For example, a type of service associated with a first virtual router may include changing packet headers when crossing service domains in a global network, and a type of service associated with a second virtual router may avoid changing packet headers when providing connectivity in the network. Furthermore, the I/O switch fabric includes output physical ports that convey packets associated with at least a second network flow. The virtual routers may facilitate InfiniBand inter-subnet crossing.