Abstract:
A storage device failure in a computer storage system can be analyzed by the storage system by examining relevant information about the storage device and its environment. Information about the storage device is collected in real-time and stored; this is an on-going process such that some information is continuously available. The information can include information relating to the storage device, such as input/output related information, and information relating to a storage shelf where the storage device is located, such as a status of adjacent storage devices on the shelf. All of the relevant information is analyzed to determine a reason for the storage device failure. Optionally, additional information may be collected and analyzed by the storage system to help determine the reason for the storage device failure. The analysis and supporting information can be stored in a log and/or presented to a storage system administrator to view.
Abstract:
A method and system for reading exception data by a storage server from a storage controller. An exception event is detected at the storage server. The storage server registers with the controller, such that only one storage server can read the exception data at a time. If the storage server is registered with the controller, the storage server reads the exception data. If the storage server does not successfully register with the controller, the controller keeps track of which storage servers have not read the exception data, to ensure that all storage servers can read the data.
Abstract:
A storage device failure in a computer storage system can be analyzed by the storage system by examining relevant information about the storage device and its environment. Information about the storage device is collected in real-time and stored; this is an on-going process such that some information is continuously available. The information can include information relating to the storage device, such as input/output related information, and information relating to a storage shelf where the storage device is located, such as a status of adjacent storage devices on the shelf. All of the relevant information is analyzed to determine a reason for the storage device failure. Optionally, additional information may be collected and analyzed by the storage system to help determine the reason for the storage device failure. The analysis and supporting information can be stored in a log and/or presented to a storage system administrator to view.
Abstract:
A storage device failure in a computer storage system can be analyzed by the storage system by examining relevant information about the storage device and its environment. Information about the storage device is collected in real-time and stored; this is an on-going process such that some information is continuously available. The information can include information relating to the storage device, such as input/output related information, and information relating to a storage shelf where the storage device is located, such as a status of adjacent storage devices on the shelf. All of the relevant information is analyzed to determine a reason for the storage device failure. Optionally, additional information may be collected and analyzed by the storage system to help determine the reason for the storage device failure. The analysis and supporting information can be stored in a log and/or presented to a storage system administrator to view.
Abstract:
A method and system for reading exception data by a storage server from a storage controller. An exception event is detected at the storage server. The storage server registers with the controller, such that only one storage server can read the exception data at a time. If the storage server is registered with the controller, the storage server reads the exception data. If the storage server does not successfully register with the controller, the controller keeps track of which storage servers have not read the exception data, to ensure that all storage servers can read the data.