-
公开(公告)号:US20160224277A1
公开(公告)日:2016-08-04
申请号:US14613085
申请日:2015-02-03
Applicant: NetApp, Inc.
Inventor: Ankit Batra , Amarnath Jolad , Kazunobu Nishime
CPC classification number: G06F3/0653 , G06F3/0619 , G06F3/067 , H04L41/065 , H04L41/0893 , H04L41/12 , H04L43/0817 , H04L67/1097
Abstract: Monitoring health of associated, but separated storage clusters can be done at both a node scope and a cluster scope. Monitoring the storage clusters at the cluster scope includes monitoring the network elements that support the storage clusters and connect the storage clusters. Initially, a fabric monitor in each cluster discovers cluster topology. This cluster topology is communicated and maintained throughout the managing storage elements of the storage clusters. After the storage cluster topologies have been discovered, the fabric monitors of each cluster can periodically determine status of network elements of the storage clusters. This allows the storage clusters to maintain awareness of interconnect status, and react to changes in status. In addition, each managing storage element monitors its own health. This information is aggregated to determine when to trigger corrective actions, alerts, and/or storage features in accordance with rules defined at the managing storage elements.
Abstract translation: 可以在节点范围和集群范围内监视关联但分离的存储集群的运行状况。 监视集群范围内的存储集群包括监视支持存储集群并连接存储集群的网络元素。 最初,每个集群中的一个架构监视器发现集群拓扑。 在整个存储集群的管理存储元件中传送和维护该集群拓扑。 在发现存储集群拓扑之后,每个集群的架构监视器可以周期性地确定存储集群的网络元素的状态。 这允许存储集群保持互连状态的意识,并对状态的变化作出反应。 此外,每个管理存储元件监视其自己的健康。 聚合该信息以确定何时根据管理存储元件中定义的规则触发纠正措施,警报和/或存储功能。
-
公开(公告)号:US20180095849A1
公开(公告)日:2018-04-05
申请号:US15820784
申请日:2017-11-22
Applicant: NetApp Inc.
Inventor: Amarnath Jolad , Kazunobu Nishime , Iswarya Ayyappan , Ankit Batra
IPC: G06F11/20
CPC classification number: G06F11/2007 , G06F11/2025 , G06F11/2028 , G06F11/2046 , G06F11/2048 , G06F11/2097
Abstract: Direct monitoring of a plurality of storage nodes in a primary cluster is performed based on connectivity with the storage nodes. Indirect monitoring of a first storage node is performed, in response to direct monitoring of the first storage node indicating failure of the connectivity with the first storage node, wherein a second storage node of the plurality of nodes is a backup node for the first storage node. The indirect monitor of the first storage node indicates failure of the first storage node in response to performance of storage access operations by the second storage node that were previously performed by the first storage node. A cluster-switch operation is initiated to switch to from the primary cluster to a backup cluster based on an occurrence of at least one cluster-failure condition that comprises the indirect monitor of the first storage node indicating failure of the first storage node.
-
公开(公告)号:US20160132411A1
公开(公告)日:2016-05-12
申请号:US14718346
申请日:2015-05-21
Applicant: NetApp, Inc.
Inventor: Amarnath Jolad , Kazunobu Nishime , Iswarya Ayyappan , Ankit Batra
IPC: G06F11/20
CPC classification number: G06F11/2007 , G06F11/2025 , G06F11/2028 , G06F11/2046 , G06F11/2048 , G06F11/2097
Abstract: Direct monitoring of a plurality of storage nodes in a primary cluster is performed based on connectivity with the storage nodes. Indirect monitoring of a first storage node is performed, in response to direct monitoring of the first storage node indicating failure of the connectivity with the first storage node, wherein a second storage node of the plurality of nodes is a backup node for the first storage node. The indirect monitor of the first storage node indicates failure of the first storage node in response to performance of storage access operations by the second storage node that were previously performed by the first storage node. A cluster-switch operation is initiated to switch to from the primary cluster to a backup cluster based on an occurrence of at least one cluster-failure condition that comprises the indirect monitor of the first storage node indicating failure of the first storage node.
Abstract translation: 基于与存储节点的连接性来执行主集群中的多个存储节点的直接监视。 响应于对第一存储节点的直接监视指示与第一存储节点的连接失败,执行第一存储节点的间接监视,其中多个节点中的第二存储节点是第一存储节点的备份节点 。 响应于先前由第一存储节点执行的第二存储节点的存储访问操作的性能,第一存储节点的间接监视器指示第一存储节点的故障。 基于至少一个集群故障状况的发生,启动集群交换操作以从主集群切换到备份集群,所述集群故障条件包括指示第一存储节点的故障的第一存储节点的间接监视。
-
公开(公告)号:US11163653B2
公开(公告)日:2021-11-02
申请号:US16679823
申请日:2019-11-11
Applicant: NetApp Inc.
Inventor: Amarnath Jolad , Kazunobu Nishime , Iswarya Ayyappan , Ankit Batra
Abstract: Direct monitoring of a plurality of storage nodes in a primary cluster is performed based on connectivity with the storage nodes. Indirect monitoring of a first storage node is performed, in response to direct monitoring of the first storage node indicating failure of the connectivity with the first storage node, wherein a second storage node of the plurality of nodes is a backup node for the first storage node. The indirect monitor of the first storage node indicates failure of the first storage node in response to performance of storage access operations by the second storage node that were previously performed by the first storage node. A cluster-switch operation is initiated to switch to from the primary cluster to a backup cluster based on an occurrence of at least one cluster-failure condition that comprises the indirect monitor of the first storage node indicating failure of the first storage node.
-
公开(公告)号:US10437510B2
公开(公告)日:2019-10-08
申请号:US14613085
申请日:2015-02-03
Applicant: NetApp, Inc.
Inventor: Ankit Batra , Amarnath Jolad , Kazunobu Nishime
Abstract: Monitoring health of associated, but separated storage clusters can be done at both a node scope and a cluster scope. Monitoring the storage clusters at the cluster scope includes monitoring the network elements that support the storage clusters and connect the storage clusters. Initially, a fabric monitor in each cluster discovers cluster topology. This cluster topology is communicated and maintained throughout the managing storage elements of the storage clusters. After the storage cluster topologies have been discovered, the fabric monitors of each cluster can periodically determine status of network elements of the storage clusters. This allows the storage clusters to maintain awareness of interconnect status, and react to changes in status. In addition, each managing storage element monitors its own health. This information is aggregated to determine when to trigger corrective actions, alerts, and/or storage features in accordance with rules defined at the managing storage elements.
-
公开(公告)号:US11106388B2
公开(公告)日:2021-08-31
申请号:US16591714
申请日:2019-10-03
Applicant: NetApp inc.
Inventor: Ankit Batra , Amarnath Jolad , Kazunobu Nishime
Abstract: Monitoring health of associated, but separated storage clusters can be done at both a node scope and a cluster scope. Monitoring the storage clusters at the cluster scope includes monitoring the network elements that support the storage clusters and connect the storage clusters. Initially, a fabric monitor in each cluster discovers cluster topology. This cluster topology is communicated and maintained throughout the managing storage elements of the storage clusters. After the storage cluster topologies have been discovered, the fabric monitors of each cluster can periodically determine status of network elements of the storage clusters. This allows the storage clusters to maintain awareness of interconnect status, and react to changes in status. In addition, each managing storage element monitors its own health. This information is aggregated to determine when to trigger corrective actions, alerts, and/or storage features in accordance with rules defined at the managing storage elements.
-
公开(公告)号:US20200073768A1
公开(公告)日:2020-03-05
申请号:US16679823
申请日:2019-11-11
Applicant: NetApp Inc.
Inventor: Amarnath Jolad , Kazunobu Nishime , Iswarya Ayyappan , Ankit Batra
IPC: G06F11/20
Abstract: Direct monitoring of a plurality of storage nodes in a primary cluster is performed based on connectivity with the storage nodes. Indirect monitoring of a first storage node is performed, in response to direct monitoring of the first storage node indicating failure of the connectivity with the first storage node, wherein a second storage node of the plurality of nodes is a backup node for the first storage node. The indirect monitor of the first storage node indicates failure of the first storage node in response to performance of storage access operations by the second storage node that were previously performed by the first storage node. A cluster-switch operation is initiated to switch to from the primary cluster to a backup cluster based on an occurrence of at least one cluster-failure condition that comprises the indirect monitor of the first storage node indicating failure of the first storage node.
-
公开(公告)号:US20200034069A1
公开(公告)日:2020-01-30
申请号:US16591714
申请日:2019-10-03
Applicant: NetApp Inc.
Inventor: Ankit Batra , Amarnath Jolad , Kazunobu Nishime
Abstract: Monitoring health of associated, but separated storage clusters can be done at both a node scope and a cluster scope. Monitoring the storage clusters at the cluster scope includes monitoring the network elements that support the storage clusters and connect the storage clusters. Initially, a fabric monitor in each cluster discovers cluster topology. This cluster topology is communicated and maintained throughout the managing storage elements of the storage clusters. After the storage cluster topologies have been discovered, the fabric monitors of each cluster can periodically determine status of network elements of the storage clusters. This allows the storage clusters to maintain awareness of interconnect status, and react to changes in status. In addition, each managing storage element monitors its own health. This information is aggregated to determine when to trigger corrective actions, alerts, and/or storage features in accordance with rules defined at the managing storage elements.
-
公开(公告)号:US09842033B2
公开(公告)日:2017-12-12
申请号:US14718346
申请日:2015-05-21
Applicant: NetApp, Inc.
Inventor: Amarnath Jolad , Kazunobu Nishime , Iswarya Ayyappan , Ankit Batra
CPC classification number: G06F11/2007 , G06F11/2025 , G06F11/2028 , G06F11/2046 , G06F11/2048 , G06F11/2097
Abstract: Direct monitoring of a plurality of storage nodes in a primary cluster is performed based on connectivity with the storage nodes. Indirect monitoring of a first storage node is performed, in response to direct monitoring of the first storage node indicating failure of the connectivity with the first storage node, wherein a second storage node of the plurality of nodes is a backup node for the first storage node. The indirect monitor of the first storage node indicates failure of the first storage node in response to performance of storage access operations by the second storage node that were previously performed by the first storage node. A cluster-switch operation is initiated to switch to from the primary cluster to a backup cluster based on an occurrence of at least one cluster-failure condition that comprises the indirect monitor of the first storage node indicating failure of the first storage node.
-
-
-
-
-
-
-
-