Method for automatically diagnosing hardware faults in a data storage system
    1.
    发明授权
    Method for automatically diagnosing hardware faults in a data storage system 有权
    自动诊断数据存储系统硬件故障的方法

    公开(公告)号:US07779306B1

    公开(公告)日:2010-08-17

    申请号:US11690147

    申请日:2007-03-23

    IPC分类号: G06F11/00

    摘要: A method for automatically diagnosing faults a data storage system. The system includes a plurality of enclosures each having: a primary port; an expansion port; a plurality of disk drives; and a link control card coupled to the primary port and to the expansion port and the plurality of disk drives. The link control card includes a cut through switch having: disk drive port error counters for counting at ports of the plurality of disk drives; a primary port error counter for counting cumulative errors at the primary port, and an expansion port error counter for counting cumulative errors at the expansion port. The primary ports and expansion ports are serially interconnected to the storage processor through a fiber channel loop. The method sequentially reads counters in each one of the enclosures to determine whether errors counted in any one of such counters exceeds a predetermined threshold over a predetermined period of time.

    摘要翻译: 一种自动诊断数据存储系统故障的方法。 该系统包括多个外壳,每个外壳具有:主端口; 一个扩展端口; 多个磁盘驱动器; 以及链接控制卡,其耦合到主端口以及扩展端口和多个盘驱动器。 链路控制卡包括切割开关,具有:用于在多个盘驱动器的端口进行计数的盘驱动器端口错误计数器; 用于计算主端口累积错误的主端口错误计数器和用于计算扩展端口累积错误的扩展端口错误计数器。 主端口和扩展端口通过光纤通道环路与存储处理器串联连接。 该方法顺序地读取每个外壳中的计数器,以确定在任何一个这样的计数器中计数的错误是否在预定时间段内超过预定阈值。

    Diagnosing hardware faults in a data storage system
    2.
    发明授权
    Diagnosing hardware faults in a data storage system 有权
    诊断数据存储系统中的硬件故障

    公开(公告)号:US08407527B1

    公开(公告)日:2013-03-26

    申请号:US12494467

    申请日:2009-06-30

    IPC分类号: G06F11/00

    CPC分类号: G06F11/076 G06F11/0727

    摘要: Hardware faults in data storage systems are diagnosed. User I/O errors are received. Disk drive port error counters, primary port error counters, and expansion port error counters are read. A user I/O error threshold is modified based on the error counter readings. Depending on the type of errors counted, the user I/O error threshold may be increased or decreased. Once a first quantity of user I/O errors exceeds the modified user I/O error threshold, a faulty component is identified.

    摘要翻译: 诊断数据存储系统中的硬件故障。 接收到用户I / O错误。 读取磁盘驱动器端口错误计数器,主端口错误计数器和扩展端口错误计数器。 基于错误计数器读数修改用户I / O错误阈值。 根据计数的错误类型,可以增加或减少用户I / O错误阈值。 一旦第一数量的用户I / O错误超出修改后的用户I / O错误阈值,就会识别故障组件。

    Managing storage stability
    3.
    发明授权
    Managing storage stability 有权
    管理存储稳定性

    公开(公告)号:US07624300B2

    公开(公告)日:2009-11-24

    申请号:US11640668

    申请日:2006-12-18

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1092 G06F11/008

    摘要: Storage stability is managed. It is detected that a disk drive is requesting to be taken offline. The disk drive is begun to be treated as being in a probation state. If within an acceptable period of time the disk drive requests to be put back online, treatment of the disk drive as being in a probation state is stopped, and only any portions of the disk drive data that were the subject of write requests involving the disk drive while the disk drive was being treated as being in a probation state are rebuilt.

    摘要翻译: 管理存储稳定性。 检测到磁盘驱动器正在请求脱机。 磁盘驱动器开始被视为处于缓刑状态。 如果在可接受的时间段内磁盘驱动器请求恢复联机,则停止对磁盘驱动器处于缓刑状态的处理,并且只有磁盘驱动器数据的任何部分是涉及磁盘的写入请求的对象 磁盘驱动器被视为处于缓刑状态的驱动器被重建。

    Managing loop interface failure
    5.
    发明授权
    Managing loop interface failure 有权
    管理循环接口故障

    公开(公告)号:US07861123B1

    公开(公告)日:2010-12-28

    申请号:US12004311

    申请日:2007-12-20

    IPC分类号: G06F11/00

    CPC分类号: G06F11/221 G06F11/2089

    摘要: Loop interface failure is managed. A first device on a loop is identified as a potential cause of the loop interface failure. The loop is tested with the first device functionally removed from the loop. Depending on the results of the test, it is determined that the first device is not the cause of the loop interface failure and a second device on the loop is identified as the cause of the loop interface failure.

    摘要翻译: 环路接口故障被管理。 循环中的第一个设备被识别为环路接口故障的潜在原因。 使用从循环中功能移除的第一个设备来测试循环。 根据测试结果,确定第一个设备不是环路接口故障的原因,并且环路上的第二个设备被识别为环路接口故障的原因。

    Upgrading firmware of a power supply
    6.
    发明授权
    Upgrading firmware of a power supply 有权
    升级电源的固件

    公开(公告)号:US08782633B1

    公开(公告)日:2014-07-15

    申请号:US13238646

    申请日:2011-09-21

    IPC分类号: G06F9/44

    CPC分类号: G06F8/654

    摘要: A method, a system and a computer program product for upgrading firmware is disclosed. In one embodiment data storage is managed in a data storage system comprising a first enclosure having a first storage processor and a first power supply. A firmware upgrade is saved in the first storage processor. The firmware upgrade in the first storage processor and firmware in the first power supply are compared. The firmware upgrade is downloaded to the first power supply in response to the comparison determining a difference between the firmware upgrade in the first storage processor and the firmware in the first power supply. The firmware is upgraded in the first power supply with the firmware upgrade.

    摘要翻译: 公开了一种用于升级固件的方法,系统和计算机程序产品。 在一个实施例中,在包括具有第一存储处理器和第一电源的第一外壳的数据存储系统中管理数据存储。 固件升级保存在第一个存储处理器中。 比较第一个存储处理器中的固件升级和第一个电源中的固件。 响应于比较确定第一存储处理器中的固件升级与第一电源中的固件之间的差异,固件升级被下载到第一电源。 固件在第一个电源中进行升级,并进行固件升级。

    Managing loop interface instability
    7.
    发明授权
    Managing loop interface instability 有权
    管理循环界面不稳定

    公开(公告)号:US08161316B1

    公开(公告)日:2012-04-17

    申请号:US12241708

    申请日:2008-09-30

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1092 G06F11/076

    摘要: A method is used in managing loop interface instability. It is determined that a loop has excessive intermittent failures. It is determined, based on whether the intermittent failures are detectable on another loop, whether the cause of the excessive intermittent failures is within a specific category of components. A search procedure is executed that is directed to the specific category of components, to isolate the cause of the excessive intermittent failures.

    摘要翻译: 一种方法用于管理循环接口不稳定性。 确定循环有过多的间歇性故障。 根据间歇性故障是否可以在另一回路上检测出来确定过度断续故障的原因是否在特定类别的部件内。 执行针对特定类别组件的搜索过程,以隔离过度间歇性故障的原因。

    Method for operating disk drives in a data storage system
    9.
    发明授权
    Method for operating disk drives in a data storage system 有权
    在数据存储系统中操作磁盘驱动器的方法

    公开(公告)号:US07454561B1

    公开(公告)日:2008-11-18

    申请号:US11153932

    申请日:2005-06-16

    IPC分类号: G06F13/00

    摘要: A system sets a disk access inhibitor flag whenever a disk drive is placed by the system in an inaccessible condition. The drive operates to set a bit therein when the drive has placed itself in a by-pass condition. During each polling event, the system determines: (1) whether the bit has been set; and (2) whether the disk access inhibitor flag has been set. If the bit has been set and such disk access inhibitor flag has been set, the system maintains the drive in the inaccessible condition; otherwise, the drive is accessible to the system. If, during a polling event, the bit has been set but that drive has not had a bit set during a relatively long period of time, the system maintains the drive accessible to the system unless the drive sets the bit during a subsequent predetermined wait period, after which the system sets the flagdisk access inhibitor flag and places the drive in the inaccessible condition.

    摘要翻译: 每当系统在不可访问的条件下放置磁盘驱动器时,系统设置磁盘访问抑制器标志。 当驱动器自身处于旁路状态时,变频器会将其置于其中。 在每个轮询事件期间,系统确定:(1)该位是否被设置; 和(2)磁盘访问禁止标志是否被设置。 如果该位已经设置,并且已经设置了这样的磁盘访问抑制器标志,则系统将驱动器维持在不可访问的状态; 否则,系统可以访问驱动器。 如果在轮询事件期间该位已被设置,但是该驱动器在较长的时间段内没有设置位置,则系统将该驱动器保持为可访问的驱动器,除非该驱动器在随后的预定等待期间内设置该位 之后,系统将设置flagdisk访问禁止标志,并将驱动器置于无法访问的状态。

    Managing storage stability
    10.
    发明申请
    Managing storage stability 有权
    管理存储稳定性

    公开(公告)号:US20080148094A1

    公开(公告)日:2008-06-19

    申请号:US11640668

    申请日:2006-12-18

    IPC分类号: G06F11/00

    CPC分类号: G06F11/1092 G06F11/008

    摘要: Storage stability is managed. It is detected that a disk drive is requesting to be taken offline. The disk drive is begun to be treated as being in a probation state. If within an acceptable period of time the disk drive requests to be put back online, treatment of the disk drive as being in a probation state is stopped, and only any portions of the disk drive data that were the subject of write requests involving the disk drive while the disk drive was being treated as being in a probation state are rebuilt.

    摘要翻译: 管理存储稳定性。 检测到磁盘驱动器正在请求脱机。 磁盘驱动器开始被视为处于缓刑状态。 如果在可接受的时间段内磁盘驱动器请求恢复联机,则停止对磁盘驱动器处于缓刑状态的处理,并且只有磁盘驱动器数据的任何部分是涉及磁盘的写入请求的对象 磁盘驱动器被视为处于缓刑状态的驱动器被重建。