摘要:
A method for performing wellness checking on a plurality of distributed networks of independent subsystems, the plurality of distributed networks including a plurality of first nodes and a plurality of second nodes, the method comprising: allowing initialization of a wellness check on the plurality second nodes; allowing each of the plurality of first nodes to send a request to corresponding plurality of second nodes; commencing a first wellness check for checking a first wellness status of each of the plurality of second nodes; checking for the physical network connection of each of the plurality of second nodes; sending wellness status with a determined severity level of each of the plurality of second nodes to corresponding plurality of first nodes; establishing errors of each of the plurality of second nodes; commencing a second wellness check for re-checking a second wellness status of each of the plurality of second nodes with the established errors; sending a notification identifying the established errors; and scheduling a third wellness check for re-checking a third wellness status of each of the plurality of second nodes after a predetermined period of time.
摘要:
A database of action plans carried out by a service provider is provided that stores the action plan as a series of action codes as well as the associated information such as error code, error type and whether the action plan resolved the problem. When an error occurs and is reported automatically, the database is searched for the error that occurred. Action plans as well as success rates are collected with most probable solutions being presented first. Each action code in the action plan corresponds to a particular point in maintenance documentation that is stored, e.g., on a management console, at the customer location. After reporting the error, the management console receives action plans for the error based on actual service reports as well as action plans suggested by documentation. When a service representative accesses the management console for information about the error, appropriate documentation is presented for each step in the action plan, allowing the service representative to follow along the suggested action plans and associated maintenance documentation onsite.
摘要:
A system for continuous health monitoring includes a computer system including a locking mechanism configured to allow multiple health point checks to be accessed simultaneously, a plurality of component health point checks configured to monitor at least one component of the system and configured to store health monitoring statistics in the computer system, and a scheduler configured to periodically enable the plurality of component health point checks based on one of a user request and a predefined amount of time.
摘要:
According to one embodiment, a method for handling errors includes detecting that an error has occurred in a machine in a system, determining an error ID associated with the error, determining that the error should be reported to a host system for further analysis based on the error ID, determining that one or more actions should be taken by the system based on the error ID, and sending information corresponding to the error to the host system after it is determined that the error should be reported to the host system. In another embodiment, a system includes a processor and logic for performing the method. According to another embodiment, a computer program product includes a computer readable storage medium having computer readable code embodied therewith for performing the method. Other methods, systems, and computer program products are also described according to more embodiments.
摘要:
According to one embodiment, a method for handling errors includes detecting that an error has occurred in a machine in a system, determining an error ID associated with the error, determining that the error should be reported to a host system for further analysis based on the error ID, determining that one or more actions should be taken by the system based on the error ID, and sending information corresponding to the error to the host system after it is determined that the error should be reported to the host system. In another embodiment, a system includes a processor and logic for performing the method. According to another embodiment, a computer program product includes a computer readable storage medium having computer readable code embodied therewith for performing the method. Other methods, systems, and computer program products are also described according to more embodiments.
摘要:
An apparatus, system, and method are disclosed for automating adapter replacement. An input module receives an identification of a first adapter. A storage configuration module removes the first adapter from a storage subsystem configuration. A host configuration module adds the second adapter to the host operating system configuration. In one embodiment, a failover configuration module removes the first adapter from a failover driver configuration. The host configuration module adds the second adapter to the host operating system configuration. The storage configuration module adds the second adapter to the storage subsystem configuration. Further the failover configuration module may add the second adapter to the failover driver configuration. The identifier module retrieves an identifier from the second adapter in the form of a world-wide port number from the second adapter.
摘要:
An apparatus, system, and method are disclosed for automating adapter replacement. An input module receives an identification of a first adapter. A storage configuration module removes the first adapter from a storage subsystem configuration. A host configuration module adds the second adapter to the host operating system configuration. In one embodiment, a failover configuration module removes the first adapter from a failover driver configuration. The host configuration module adds the second adapter to the host operating system configuration. The storage configuration module adds the second adapter to the storage subsystem configuration. Further the failover configuration module may add the second adapter to the failover driver configuration. The identifier module retrieves an identifier from the second adapter in the form of a world-wide port number from the second adapter.