摘要:
A method, system and computer program product for remotely debugging a malfunctioning node controller of a node in a distributed node network through a functioning node controller of the same node. The method comprises establishing a serial link between the malfunctioning node controller and a functioning node controller and configuring the functioning node controller as a virtual console by the remotely-located central data processing system (DPS). The method further includes receiving, via an internal Fru Support Interface (FSI) link, serial data from the malfunctioning node controller through the virtual console, and debugging, by the DPS, a failure condition of the malfunctioning node controller, in response to receipt of the serial data through the virtual console.
摘要:
A computer implemented method, a tangible computer readable medium, and a data processing system intelligently propagate link status information received by a blade server to the various ports of an embedded multi-port switch. The link status of a switch port in an external switch module can be communicated to the operating systems of individual blade servers that are affected by that link status. When an external switch module is unplugged from a server blade chassis, the bus controller broadcasts a link down event, such as a link down interrupt, to the individual server blades where it is received by the embedded multi-port switch for those server blades. The embedded multi-port switch translates the link down interrupt into a hardware link down event, and forwards the hardware link down event to the other elements connected to the embedded multi-port switch.
摘要:
A computer implemented method, a tangible computer medium, and a data processing system are provided for waking a blade server from an operational state of reduced power. When server blade enters the state of reduced power, a service firmware configures a multi-port blade switch of the server blade to direct incoming packets to the service firmware. The service firmware then polls for receipt of a Wake-on-Lan magic packet. When the Wake-on-Lan magic packet is received by the service firmware, the service firmware reconfigures the multi-port blade switch to direct incoming packets to a network interface card of the server blade. The service firmware then initiates a reboot of the server blade.
摘要:
A computer implemented method, a tangible computer medium, and a data processing system are provided for waking a blade server from an operational state of reduced power. When server blade enters the state of reduced power, a service firmware configures a multi-port blade switch of the server blade to direct incoming packets to the service firmware. The service firmware then polls for receipt of a Wake-on-Lan magic packet. When the Wake-on-Lan magic packet is received by the service firmware, the service firmware reconfigures the multi-port blade switch to direct incoming packets to a network interface card of the server blade. The service firmware then initiates a reboot of the server blade.
摘要:
A computer implemented method, a tangible computer readable medium, and a data processing system intelligently propagate link status information received by a blade server to the various ports of an embedded multi-port switch. The link status of a switch port in an external switch module can be communicated to the operating systems of individual blade servers that are affected by that link status. When an external switch module is unplugged from a server blade chassis, the bus controller broadcasts a link down event, such as a link down interrupt, to the individual server blades where it is received by the embedded multi-port switch for those server blades. The embedded multi-port switch translates the link down interrupt into a hardware link down event, and forwards the hardware link down event to the other elements connected to the embedded multi-port switch.
摘要:
A method, system and computer program product for remotely debugging a malfunctioning node controller of a node in a distributed node network through a functioning node controller of the same node. The method comprises establishing a serial link between the malfunctioning node controller and a functioning node controller and configuring the functioning node controller as a virtual console by the remotely-located central data processing system (DPS). The method further includes receiving, via an internal Fru Support Interface (FSI) link, serial data from the malfunctioning node controller through the virtual console, and debugging, by the DPS, a failure condition of the malfunctioning node controller, in response to receipt of the serial data through the virtual console.
摘要:
A method for enabling a Node Controller (NC), which claims a duplicate or invalid service processor Node Controller Identification (NCID) in a distributed service processor system, to be integrated into the system includes reading an NCID by the NC after the NC is booted, saving the NCID into a non-volatile storage and broadcasting an NC Present Message (NPM) to a System Controller (SC) repeatedly until the SC initiates communication, updating the NCID for the NC in the non-volatile storage when the NC receives an NCID change message from the SC and rating any future NPM as a new NCID, and checking a record of a new NC when the SC receives the NPM from the NC. If the SC has a record of a recorded NC with the same NCID as the new NC, then the SC checks its role as a primary SC. If the SC does not have the record of the recorded NC with the same NCID as the new NC, then the SC checks validity of the NCID.
摘要:
A method for enabling a Node Controller (NC), which claims a duplicate or invalid service processor Node Controller Identification (NCID) in a distributed service processor system, to be integrated into the system includes reading an NCID by the NC after the NC is booted, saving the NCID into a non-volatile storage and broadcasting an NC Present Message (NPM) to a Service Processor (SC) repeatedly until the SC initiates communication, updating the NCID for the NC in the non-volatile storage when the NC receives an NCID change message from the SC and rating any future NPM as a new NCID, and checking a record of an new NC in the non-volatile storage when the SC receives the NPM from the NC. If the SC has a record of a recorded NC with the same NCID as the new NC, then the SC checks its role as a primary SC. If the SC does not have the record of the recorded NC with the same NCID as the new NC, then the SC checks validity of the NCID.
摘要:
One embodiment for dynamically loading a service onto a node includes requesting, by a service requester, of a service from a service broker. Further, the method includes selecting, by the service broker, of the node among the nodes. Further still, the method includes re-booting, by the service broker, of the node based on the selecting, and then requesting, by the node, of an image from a TFTP relay agent for the service for the node. Yet further, the method includes querying, by the TFTP relay agent, for a type of the service from the service broker for the node, and then downloading of the image for the type of the service from a software repository accessed by the TFTP relay agent to the node. Finally, the method includes booting, by the node, to the image, informing the service requester that the service is ready for use, and using the service.
摘要:
A mechanism, in a data processing system, is provided for logical partition defragmentation. The mechanism gathers resource requirements for a plurality of logical partitions running in a plurality of power domains within one or more servers. The mechanism determines optimal hardware utilization for the plurality of logical partitions. The mechanism migrates one or more of the plurality of logical partitions to run in a subset of the plurality of power domains such that at least one power domain within the plurality of power domains is unused. The mechanism puts the at least one unused power domain in a low power state.