摘要:
There is provided a method executed in a cluster system comprising a first computer and at least one second computer that stores the data transmitted from the first computer. The method comprising the steps of: judging whether the failure-occurred processing is restarted by the first computer or the failure-occurred processing is switched over by the second computer based on the system information; receiving the stored data from the second computer and re-executing the processing in which the failure has occurred by the first computer, in the case of which it is judged that the processing in which the failure has occurred is restarted by the first computer; and executing, by the second computer, the processing in which the failure has occurred, in the case of which it is judged that the processing in which the failure has occurred is switched over by the at least one second computer.
摘要:
There is provided a method executed in a cluster system comprising a first computer and at least one second computer that stores the data transmitted from the first computer. The method comprising the steps of: judging whether the failure-occurred processing is restarted by the first computer or the failure-occurred processing is switched over by the second computer based on the system information; receiving the stored data from the second computer and re-executing the processing in which the failure has occurred by the first computer, in the case of which it is judged that the processing in which the failure has occurred is restarted by the first computer; and executing, by the second computer, the processing in which the failure has occurred, in the case of which it is judged that the processing in which the failure has occurred is switched over by the at least one second computer.
摘要:
Provided is a data replication method capable of reducing the number of communication times when a processing result of an active system is replicated to a standby system. The data replication method, in which a first computer receives a first message containing a first processing request, and a plurality of second computers replicates the first message, includes the steps of: sending, by a third computer, the first message to the first computer and the second computers; sending, by each of the second computers, a message receive notification of the first message to the first computer; sending, by the first computer, after reception of the message receive notification from the second computers, the message receive notification of the first message to the third computer; and sending, by the first computer, a notification indicating that the first processing request becomes executable by the first computer to the second computers.
摘要:
Provided is a data replication method capable of reducing the number of communication times when a processing result of an active system is replicated to a standby system. The data replication method, in which a first computer receives a first message containing a first processing request, and a plurality of second computers replicates the first message, includes the steps of: sending, by a third computer, the first message to the first computer and the second computers; sending, by each of the second computers, a message receive notification of the first message to the first computer; sending, by the first computer, after reception of the message receive notification from the second computers, the message receive notification of the first message to the third computer; and sending, by the first computer, a notification indicating that the first processing request becomes executable by the first computer to the second computers.
摘要:
An application processing system including a database server and a storage device which is connected with a standby application processing system including a secondary database server and a secondary storage device to continue to carry out application processing in the case of a fault, disaster, etc. or for maintenance. The storage device of the application processing system issues a log disk update request to the secondary database server of the standby system to update a log disk of the standby system and thereby create a duplicate of the log disk for the DBMS of the application processing system. This allows the DBMS of the standby system to roll forward the log data in the duplicated log to update duplicated data on the secondary storage device.
摘要:
Provided is a method of changing the configuration of a shared-nothing database system without shutting down the system. The configuration changing method for a shared-nothing database system includes plural database servers, a storage which provides data areas to the database servers, and a management server which has a database management system and divides a transaction to allocate pieces of the transaction to the database servers, and the configuration changing method allows no two of the database servers to access the same data area. The configuration changing method changes the configurations of the database servers by specifying which data area is to be reallocated, judging whether or not the specified data area can be reallocated, and reallocating the specified data area without shutting down the database management system when it is judged that the specified data area can be reallocated.
摘要:
Provided is a failover method for a cluster system for realizing smooth failover of the guest OS's, even when there are many guest OS's, while reducing consumption of computer resources of a server. Smooth failover is realized by preventing competition during failover even when the number of guest OS's is increased. In a cluster configuration in which a slave/master cluster program is operated in a guest OS/host OS, the master cluster program (510) collects and transmits heartbeats of the slave cluster program, thereby realizing failure monitoring through the certain amount of heartbeats without depending on the number of guest OS's. Further, when the master cluster program monitors failures of the slave cluster program of its own computer to find a normal operation of the guest OS, the amount of communication through heartbeats is reduced by eliminating the necessity of communication to a standby system slave cluster program.
摘要:
A high availability cluster computer system can realize exclusive control of a resource shared between computers and effect failover by resetting a currently-active system computer in case a malfunction occurs in the currently-active system computer. In case a malfunction occurs in a certain system in a cluster, another system in the cluster which has detected the malfunction issues a reset based on a priority to realize failover, in which a standby system takes over the processing of the malfunctioning system when the malfunctioning system is stopped.
摘要:
To realize a degraded operation for equalizing loads on servers to prevent performance from being degraded in a server system having a cluster configuration in which a node in which an error occurs is excluded. The server system includes a plurality of DB servers for dividing a transaction of a database processing for execution, a storage system including a preset data area and a preset log area that are accessed by the server, and a management server for managing the transaction to be allocated to the plurality of DB servers. A data area and a log area used by the DB server with the error among the plurality of DB servers are designated, and the data area accessed by the DB server with the error is recovered in the log area accessed by the server with the error.
摘要:
A high availability database system, subject to a failure causing data corruption in the active server system, may recover valid data prior to the corruption, and use the protected and recovered data to effect a failover. At the time T1 when a stand-by computer system detects the occurrence of a failure, a pair split is done to the data area to protect the data area time T0, just before the occurrence of the failure, and another pair split is done to the log area at the time, where the log area is corrupted or a time T3, where the failover is to take effect without corrupting data, to protect the log area containing data at a given time between time TO and time T3. By using the data and log areas thus protected, the stand-by computer system may take over valid data and log areas of the given time to perform a failover.