-
公开(公告)号:US07165192B1
公开(公告)日:2007-01-16
申请号:US10741399
申请日:2003-12-19
申请人: Christian Cadieux , Gavin G. Gibson
发明人: Christian Cadieux , Gavin G. Gibson
IPC分类号: G06F11/00
CPC分类号: H04L41/0631 , H04L41/0677 , H04L43/12
摘要: In some embodiments, a computer accessible medium comprises a plurality of instructions which, when executed, probe nodes in a network to determine if one or more nodes are experiencing any events indicative of a fault. The nodes are probed in a sequence. The instructions, when executed, in response to receiving a first alert transmitted by a first node in the network asynchronous to the probes performed according to the sequence, probe one or more neighbor nodes of the first node. In some other embodiments, the instructions, when executed, in response to receiving a first alert transmitted by a first node in the network asynchronous to the probes performed according to the sequence, interrupt probing according to the sequence to probe at least the first node.
摘要翻译: 在一些实施例中,计算机可访问介质包括多个指令,所述指令在被执行时是网络中的探测节点,以确定一个或多个节点是否经历指示故障的任何事件。 节点按顺序进行探测。 所述指令在被执行时响应于接收到由所述网络中的第一节点发送的与所述探测器所执行的所述探测异步的第一警报,探测所述第一节点的一个或多个邻居节点。 在一些其他实施例中,指令当被执行时响应于接收到由网络中的第一节点发送的与根据该序列执行的探测异步的第一警报,根据要至少探测第一节点的序列进行中断探测。
-
公开(公告)号:US07131032B2
公开(公告)日:2006-10-31
申请号:US10389642
申请日:2003-03-13
IPC分类号: G06F11/00
CPC分类号: H04L1/22
摘要: Provided are a method, system and article of manufacture for fault determination. A duration of time is determined for receiving an event. A plurality of events are received in a time period that is at least twice the determined duration. A plurality of factors are determined corresponding to the plurality of events. At least one factor is determined from the plurality of factors, wherein the at least one factor is a cause of at least one of the plurality of events.
摘要翻译: 提供了一种用于故障确定的方法,系统和制造。 确定接收事件的持续时间。 在至少是确定的持续时间的两倍的时间段中接收多个事件。 对应于多个事件确定多个因素。 从多个因素确定至少一个因素,其中至少一个因素是多个事件中的至少一个的原因。
-
公开(公告)号:US07058844B2
公开(公告)日:2006-06-06
申请号:US10172303
申请日:2002-06-13
CPC分类号: H04L1/0061 , H04L1/0082 , H04L1/22 , H04L41/0659
摘要: A fault region identification system adapted for use in a network, such as a storage area network (SAN), includes logic and/or program modules configured to identify errors that occur in the transmission of command, data and response packets between at least one host, switches and target devices on the network. The system maintains a count at each of a plurality of packet-receiving components of the network, the count indicating a number of CRC or other errors that have been detected by each component. The error counts are stored with the time of detection. The system alters the EOF (end-of-file) delimiter for each packet for which an error was counted such that other components ignore that packet, i.e. do not increment their error counts for that packet. Link segments adjacent single- or multiple-device components of the network are identified as fault regions, based upon the error counts of those components.
-
公开(公告)号:US06990609B2
公开(公告)日:2006-01-24
申请号:US10172302
申请日:2002-06-13
申请人: Stephen A. Wiley , John Schell , Christian Cadieux
发明人: Stephen A. Wiley , John Schell , Christian Cadieux
IPC分类号: G06F11/00
CPC分类号: G06F11/0709 , G06F11/079 , H04L41/0659 , H04L41/0677
摘要: A fault isolation system in a network is disclosed, particularly suited for use in a unidirectional fibre channel arbitrated loop. Information relating to read and write errors occurring on the loop is stored, and fault regions are located by determining areas on the loop downstream of write errors and upstream of read errors. The system may be extended to networks with bidirectional communications by storing directionality information with the detected errors. Command and response error information is not needed to deterministically locate the fault regions. When a given fault region is identified, loop and device diagnostics are executed for that region of the loop to specifically identify the failed components.
-
-
-