摘要:
Monitoring of storage systems, including virtual tape libraries (VTLs), may be performed with automated systems. Automated monitoring may be performed through an SSH connection to a storage manager for the storage system. For example, a method may include initiating, by a monitoring server, an SSH connection to a storage manager; issuing commands, by the monitoring server, through the SSH connection to monitor a storage device coupled to the storage manager; receiving, by the monitoring server, responses to the issued commands through the SSH connection; matching, by the monitoring server, the responses to automation rules; and executing, by the monitoring server, commands based on the matched automation rules.
摘要:
Read-and-reply console messages may be received and a reply automatically generated. Further, a message system may test the availability of an automation sequence for a computing environment. Test commands may be transmitted to a system to test if automation is available. If automation is available, a message system may receive a read-and-reply console message and transmit the message to a host system where an automation sequence may be performed. After the automation sequence is verified, the read-and-reply console message may be answered.
摘要:
Systems and methods for multi-threaded server control automation for disaster recovery are described. A method may include initiating a disaster recovery sequence on two or more processors, wherein the disaster recovery sequence comprises a plurality of subsequences. The method may also include implementing the disaster recovery sequence on the two or more processors in parallel, wherein one or more subsequences of the disaster recovery sequence are implemented on the two or more processors in parallel. Upon completion of the disaster recovery sequence, at least one server partition is repurposed from a first configuration, such as a test configuration, to a second configuration, such as a production configuration.
摘要:
Systems and methods for automated monitoring of automated server control components are provided. Embodiments may include transmitting a request for state information associated with a component of the automated server control system. At least one of a reception of the state information for the component and a threshold time period without reception of the state information for the component may be detected. Some embodiments may include determining if the received state information for the component comprises an error condition. In some instances, an alert may be output based on at least one of detecting the threshold time period without reception of the state information for the component and determining that the received state information comprises an error condition.
摘要:
A method and apparatus for sending a message from a first operator at a first workstation to a second operator at a second workstation in an operations-computing arrangement for operating one or more computer systems. The operations-computing arrangement includes the first and second workstations, which are coupled to respective operations servers. Each operations server is coupled to at least one computer system, each workstation executes software that provides an operator interface for operating a computer system, and each operations server provides an interface to a computer system. The infrastructure used to transmit system-generated event reports to operators at the workstations is used to transmit text messages between operators.
摘要:
Method and apparatus for providing a timely, automated re-assignment of resources, such as peripheral devices, memory, and/or processing capacity, among a number of host data processing systems. In a preferred embodiment, the present invention allows peripheral devices, such as tape drives, to be configured as shareable units, and accessed by any participating host data processing system as the need arises. The invention preferably includes a central coordinating facility, which evaluates the device status information gathered, from each participating host data processing system. The device status information is used to determine which host data processing systems have free devices available for use. Within these constraints, the invention automatically orchestrates the re-assignment of selected peripheral devices from where they are not currently needed to where they are needed, with little or no operator interaction.
摘要:
Automation scripts may be examined and compared to a set of new or changed operating system messages to identify potential issues with the automation script that occur with updates to the operating system. When the issue is identified, the reference in the automation script may be updated to refer to the changed operating system message. Further, the reference in the automation script may be modified to address other changes in the system messages, such as changes in information and/or parameters reported in the message.
摘要:
In a disaster recovery context, device operation states can he automatically changed from their expected operating states for steady-state operation as soon as a disaster recovery event is triggered. Every operating device with a first expected operating state for steady-state operation may be automatically changed to a second expected operating state when a disaster recovery event is triggered. Every operating device with the second expected operating state for steady-state operation may be automatically changed to the first expected state when a disaster recovery event is triggered.
摘要:
Non-automated read-and-reply console messages may be automated. These messages may be classified into impact groups in which the messages may be removed from the database or sent to an automation analyzer for analysis. As more messages become automated, a debugging mode may be enabled to allow an operator to respond to a message with a proposed action. If the proposed action is aligned with an action predetermined in response to the automation analysis, the operator may be allowed to respond to future actions.
摘要:
Data channels of a computer system may be remotely monitored to detect data channel degradation. A monitoring agent on a client may execute script commands on the remote computer system to monitor input/output (I/O) rates of a set of channels. The monitoring agent may compute an average data rate of the data channels and compare the I/O rate of each channel to the average. When the rate of a channel falls below the average by at least a threshold amount, an alert may be generated to indicate to an administrator a possible failure with the data channel.