摘要:
Provided are a method, system and program for dumping data in processing systems to a shared storage. A plurality of processing systems receive a signal indicating an event. Each of the processing systems write data used by the processing system to a shared storage device in response to receiving the signal, wherein each processing system writes the data to the shared storage device.
摘要:
A dual cluster storage server maintains track control blocks (TCBs) in a data structure to describe the data stored in cache in corresponding track images or segments. Following a cluster failure and reboot, the surviving cluster uses the TCBs to rebuild data structures such as a scatter table, which is a hash table that identifies a location of a track image, and a least recently used (LRU)/most recently used (MRU) list for the track images. This allows the cache data to be recovered. The TCBs describe whether the data in the track images is modified and valid, and describe forward and backward pointers for the data in the LRU/MRU lists. A separate non-volatile memory that is updated as the track images are updated is used to verify the integrity of the TCBs.
摘要:
Disclosed are a storage controller, and a method of operating a storage controller, for interfacing between host systems and a storage devices system. The storage controller includes a first cluster including a first processor and a first cache, and a second cluster including a second processor and a second cache. The method comprises the step of directing data from the host systems through first and second data paths to the storage system. The first processor and cache are associated with the first data path, and the second processor and cache are associated with the second data path. Under one set of conditions, the controller enters a failover mode, wherein data directed to the first data path are routed to the second data path. Under another set of conditions, the controller deconfigures the first cache without entering the failover mode.
摘要:
Disclosed are a storage controller, and a method of operating a storage controller, for interfacing between host systems and a storage devices system. The storage controller includes a first cluster including a first processor and a first cache, and a second cluster including a second processor and a second cache. The method comprises the step of directing data from the host systems through first and second data paths to the storage system. The first processor and cache are associated with the first data path, and the second processor and cache are associated with the second data path. Under one set of conditions, the controller enters a failover mode, wherein data directed to the first data path are routed to the second data path. Under another set of conditions, the controller deconfigures the first cache without entering the failover mode.
摘要:
Method for selectively clearing persistent data such as metadata in a storage subsystem. The method includes identifying one or more metadata tracks to clear, receiving a request from a client to clear the identified metadata tracks, passing the request to a storage device driver, and clearing the identified metadata tracks. The metadata tracks to be cleared may be identified by a specific storage function associated with the metadata to be cleared or by a specific storage target associated with metadata. Alternatively, both a storage function and a storage target may be specified. Preferably, the request to clear one or more identified metadata tracks may be executed without affecting unrelated storage functions. In addition, the request may be executed while preserving metadata tracks unrelated to the identified metadata tracks. Thus, execution of a request to selectively clear identified global metadata tracks in accordance with the present invention will typically not cause the loss of configuration information and will not necessitate a reboot of the system. In addition, a request as described above may be executed at any point in time while the storage system is running.
摘要:
A method, system, and article of manufacture for recovering from a track format error detected by a host computer associated with a storage controller associated with a storage disk array. The recovery method begins with saving a copy of the track format information associated with the data track that triggered the track format error in a predetermined location. Next, the track format information associated with the data track that caused the error is invalidated and the subject data is restaged. Subsequently, the restaged data is compared to the saved copy of the track format information to determine if the track format error exists with respect to the restaged data. If the track format error is detected with respect to the restaged data, the method further consists of reconstructing the data, preferably by performing a reconstruct read recovery. The reconstructed data is then compared to the saved copy of the track format information to determine if the track format error exists with respect to the reconstructed data. The method is completed by rebuilding the track format information to match the reconstructed data, writing the reconstructed data to the storage disk array, and passing the reconstructed data to the host.
摘要:
A bad track in a disk device in a data storage system is recovered by determining which cluster in the system the bad track belongs to. If the track belongs to the cluster that discovered that the track was bad, that cluster recovers the track by locating a good copy of the track, staging the good copy of the track to cache, and destaging the good copy of the track storage unit from the cache to all copies of the track, whether good or bad, in the data storage system. If the track belongs to another cluster, the cluster that discovered that the track was bad sends a message to the other cluster to inform it of the bad track, and the other cluster recovers the bad track. In another aspect, all tracks are owned by one server or server cluster, which performs the staging and destaging.
摘要:
An apparatus, system, and method are disclosed for predicting storage device failure. A technology descriptor module associates a technology descriptor with a storage device. A failure threshold module sets a predictive failure threshold for the storage device in response to the technology descriptor. In one embodiment, a workload management detection module detects workload management of the storage device. A threshold modification module may modify the predictive failure threshold in response to detecting the workload management of the storage device. A performance detection module detects a storage device error that exceeds the modified predictive failure threshold as a storage device predictive failure.
摘要:
An apparatus, system, and method are disclosed for data tracking and, in particular, for facilitating failure management within an electronic data communication system. The apparatus includes a tracking module and an error analysis module. The tracking module stores an adapter identifier in a tracking array. The adapter identifier corresponds to a source adapter from which data is received. The error analysis module determines a source of a data failure in response to recognition of the data failure. The data failure may occur on a host adapter, a device adapter, a communication fabric, a multi-processor, or another communication device. The apparatus, system, and method may be implemented in place of or in addition to hardware-assisted data integrity checking within a data storage system.
摘要:
Records such as error information records are stored across a power loss in a data storage system so that the records can be retrieved following a power loss without the use of a file management system of an operating system of the data storage system. Records are generated for system events such as errors, buffered, and stored in a raw data storage device such as a disk device without the use of a file management system. Following a power loss and subsequent restoring of power, the records are read again without the benefit of the file management system, and processed.