Abstract:
A computer implemented method, data processing system, and computer usable program code are provided for routing a packet in a switch system. A set of paths through the switch system to a destination address are determined. The set of paths are analyzed to identify a path with a least used link, where the least used link leads to a next stage of the switch system. The routing to the destination address is set through the path with the least used link to form a selected path in order to improve congestion control and load balancing in response to identifying the path with a least used link.
Abstract:
An I/O adapter sharing mechanism in a logically partitioned computer system allows sharing a network I/O adapter between logical partitions in an efficient manner. A first logical partition owns (or controls) the I/O adapter, and a second logical partition desires to use (or share) the I/O adapter. An I/O adapter device driver that includes a hardware interface is provided in the first logical partition. A virtual device driver is provided in the second logical partition that provides a set of functions that is at least partially defined by querying the I/O adapter device driver in the first logical partition. The I/O adapter sharing mechanism includes a transfer mechanism that allows data to be transferred directly from the virtual device driver in the second logical partition to the I/O adapter without passing through the I/O adapter device driver in the first logical partition.
Abstract:
At least one input/output (I/O) firmware partition is provided in a partitioned environment to facilitate access to I/O resources owned by the at least one I/O firmware partition. The I/O resources of an I/O firmware partition are shared by one or more other partitions of the environment, referred to as consumer partitions. The consumer partitions use the I/O firmware partition to access the I/O resources. Since the I/O firmware partitions are responsible for providing access to the I/O resources owned by those partitions, the consumer partitions are relieved of this task, reducing complexity and costs in the consumer partitions.
Abstract:
A system, method, and computer-readable medium for detecting errors on a network. According to a preferred embodiment of the present invention, a network error manager retrieves a network topology from a master subnet manager, wherein the network includes a collection of devices coupled by a first interconnect type. When a connectivity failure is detected in the first interconnect type, the network error manager receives from the master subnet manager at least one event notification via a second interconnect type. An error log analysis component identifies at least one device among the collection of devices as a possible cause of the connectivity failure in the first interconnect type. The network error manager retrieves events from at least one device among the collection of devices that can influence a state of the first interconnect type.
Abstract:
A method, system, and apparatus for synchronizing device, node, and drawer addresses between two networks within a data processing system is provided. In one embodiment, a service processor assigns a plurality of SPCN addresses to each of a plurality of devices in the data processing system. System firmware then determines the RIO addresses corresponding to the plurality of devices. If one of the SPCN addresses is not the same as the RIO address for the corresponding device, node, or drawer, then the service processor reassigns a new SPCN address to the corresponding device, node, or drawer such that the new SPCN address is identical to the RIO address for a corresponding device, node, or drawer.
Abstract:
A method, system, and product in a data processing system are disclosed for providing centralized management of a InfiniBand distributed system-area network that includes multiple end nodes. A manager application is established in one of the end nodes. An agent application is established in one or more end nodes. Each agent application is independent from the manager application. The manager application maintains a current list of active agent applications and uses the list to manage the agent applications in the end nodes.
Abstract:
To emulate multi-threaded processing in an operating system supporting only single-threaded processes and single-level interrupts, the processor timer is started with a selected time-out period during execution of a master code thread. Processing of the master code thread proceeds until the timer interrupt, at which time the operating system timer interrupt service routine (ISR) transfers execution control to a slave code thread or slave code thread component. The slave code thread or component is executed in its entirety, at which time the timer is reset and execution control is returned to the master code thread, where processing resumes at the point during which the timer interrupt was asserted. To minimize disruption of the master code thread execution, a maximum latency should be enforced on the slave code thread, which may be accomplished by breaking the slave code thread into multiple components. The timer ISR maintains an index of the predetermined starting points within the slave code thread(s) with a pointer identifying the next slave code thread component to be selected when the timer interrupt is asserted. Processing thus alternates between the master code thread and the slave code thread or components, with different slave code thread components being selected in round-robin fashion. The duty cycle between the master code thread and the slave code thread or components may be varied by selection of the time-out period and the maximum latency allowed to slave code thread processing.
Abstract:
A method, system, and product in a data processing system are disclosed for providing centralized management of an INFINIBAND distributed system-area network that includes multiple end nodes. A manager application is established in one of the end nodes. An agent application is established in one or more end nodes. Each agent application is independent from the manager application. The manager application maintains a current list of active agent applications and uses the list to manage the agent applications in the end nodes.
Abstract:
An I/O adapter sharing mechanism in a logically partitioned computer system allows sharing a network I/O adapter between logical partitions in an efficient manner. A first logical partition owns (or controls) the I/O adapter, and a second logical partition desires to use (or share) the I/O adapter. An I/O adapter device driver that includes a hardware interface is provided in the first logical partition. A virtual device driver is provided in the second logical partition that provides a set of functions that is at least partially defined by querying the I/O adapter device driver in the first logical partition. The I/O adapter sharing mechanism includes a transfer mechanism that allows data to be transferred directly from the virtual device driver in the second logical partition to the I/O adapter without passing through the I/O adapter device driver in the first logical partition.
Abstract:
A system, method, and computer program product are disclosed for executing a reliable warm reboot of one of multiple partitions included in a logically partitioned data processing system. The data processing system includes partition hardware. A request to reboot a particular partition is received within the partition where the particular partition includes multiple processors. Prior to executing the reboot request, the partition hardware is set to a predetermined state. The reboot request is then executed within the particular partition. The predetermined state is preferably achieved by resetting the partition hardware to a predetermined state.