摘要:
A method and system for handling errors on an Input/Output (I/O) link of a system is provided. The link is being shared by a plurality of devices of the system; each device may be shared by one or more operating systems. The method of error handling during configure of the system includes steps of checking a device for an error during configuration of the system and restricting the device from being configured, if an error is detected. The method includes the steps of examining the device capabilities to recover from errors, and; configuring the device in a first mode if the device has capabilities to recover from errors, else configuring the device in a second mode or not configuring the device. Further the handling of error may also be performed during runtime of the system, for doing so the step of checking is followed by a step of isolating and deleting the device if value of a number indicative of errors encountered for the device is greater than a threshold value. Else the method provides step of initiating a recovery process for the device and reporting result of the recovery process.
摘要:
A method and system for handling errors on an Input/Output (I/O) link of a system is provided. The link is being shared by a plurality of devices of the system; each device may be shared by one or more operating systems. The method of error handling during configure of the system includes steps of checking a device for an error during configuration of the system and restricting the device from being configured, if an error is detected. The method includes the steps of examining the device capabilities to recover from errors, and; configuring the device in a first mode if the device has capabilities to recover from errors, else configuring the device in a second mode or not configuring the device. Further the handling of error may also be performed during runtime of the system, for doing so the step of checking is followed by a step of isolating and deleting the device if value of a number indicative of errors encountered for the device is greater than a threshold value. Else the method provides step of initiating a recovery process for the device and reporting result of the recovery process.
摘要:
A method for determining the criticality of a device in a multi-path computer configuration comprising the steps of: traversing a directed acyclic graph representing a platform hierarchy; and determining paths within the directed acyclic graph affected by the removal of the device. A computer system comprising a directed acyclic graph data structure representing a platform hierarchy; and a control arrangement for traversing the directed acyclic graph to determining paths therein affected by removal of a devices
摘要:
A method is provided of managing hardware triggered hotplug operations of one or more input/output (I/O) cards of a computer system.The method comprises receiving hardware triggers, each of which relates to a hotplug operation to be carried out on an I/O card associated with a card slot, placing the hardware triggers in a queue, and processing the queue of hardware triggers. The method further comprises processing one or more of said hardware triggers. This comprises analysing a hardware trigger to determine the card slot to which said hardware trigger relates, and consulting a hotplug operation policy to determine whether hotplug operations are enabled for said card slot. If hotplug operations are not enabled for said card slot, this further comprises ignoring said hardware trigger, and if hotplug operations are enabled for said card slot, this further comprise querying said slot to determine whether it contains a card. If said card slot does not contain a card, this further comprises ignoring said hardware trigger, and if said card slot does contain a card, determining the nature of the hotplug operation of said hardware trigger. If said hotplug operation is a delete card hotplug operation or a suspend card hotplug operation, this further comprises determining whether said card is essential or non-essential to said computer system, and ignoring said delete card hotplug operation or said suspend card hotplug operation when said card is essential, and performing said delete card hotplug operation or said suspend card hotplug operation when said card is non-essential. If said hotplug operation is an add card hotplug operation or a resume card hotplug operation, this further comprises performing said add card hotplug operation or said resume card hotplug operation.
摘要:
A method for determining the criticality of a device in a multi-path computer configuration comprising the steps of: traversing a directed acyclic graph representing a platform hierarchy; and determining paths within the directed acyclic graph affected by the removal of the device. A computer system comprising a directed acyclic graph data structure representing a platform hierarchy; and a control arrangement for traversing the directed acyclic graph to determining paths therein affected by removal of a device.