-
公开(公告)号:US20160292054A1
公开(公告)日:2016-10-06
申请号:US15175818
申请日:2016-06-07
Applicant: Huawei Technologies Co., Ltd.
Inventor: Junjie Wang , Ruiling Wang , Yan Ye
CPC classification number: G06F11/2025 , G06F11/2023 , G06F11/2289 , G06F11/3027 , G06F13/4282
Abstract: A failover method, apparatus and system to implement fast failover between a primary processor and a secondary processor, where the method includes receiving, by a second device, a transaction processing packet, where the transaction processing packet includes processing information about access of a host to a peripheral component interconnect express (PCIe) device, the processing information is used to describe information required for resuming a transaction when the transaction is interrupted, the second device further stores topology information of the PCIe device, and a driver for the PCIe device is loaded to the second device, and when detecting that the first device fails, continuing to process, by the second device according to the topology information, the driver, and the processing information, the transaction that is about the access of the host to the PCIe device and is being processed when a first device fails.
Abstract translation: 一种用于在主处理器和辅助处理器之间实现快速故障切换的故障切换方法,装置和系统,其中所述方法包括由第二设备接收事务处理分组,其中所述事务处理分组包括处理关于主机访问的信息 外围组件互连快速(PCIe)设备,处理信息用于描述事务中断时恢复事务所需的信息,第二设备还存储PCIe设备的拓扑信息,并且加载PCIe设备的驱动程序 并且当检测到第一设备故障时,由第二设备根据拓扑信息,驱动程序和处理信息继续处理关于主机访问PCIe设备的事务 并且当第一设备发生故障时正在处理。
-
公开(公告)号:US10095592B2
公开(公告)日:2018-10-09
申请号:US15175818
申请日:2016-06-07
Applicant: Huawei Technologies Co., Ltd.
Inventor: Junjie Wang , Ruiling Wang , Yan Ye
Abstract: A failover method, apparatus and system to implement fast failover between a primary processor and a secondary processor, where the method includes receiving, by a second device, a transaction processing packet, where the transaction processing packet includes processing information about access of a host to a peripheral component interconnect express (PCIe) device, the processing information is used to describe information required for resuming a transaction when the transaction is interrupted, the second device further stores topology information of the PCIe device, and a driver for the PCIe device is loaded to the second device, and when detecting that the first device fails, continuing to process, by the second device according to the topology information, the driver, and the processing information, the transaction that is about the access of the host to the PCIe device and is being processed when a first device fails.
-
公开(公告)号:US10795785B2
公开(公告)日:2020-10-06
申请号:US16128956
申请日:2018-09-12
Applicant: Huawei Technologies Co., Ltd.
Inventor: Junjie Wang , Ruiling Wang , Yan Ye
Abstract: A failover method, apparatus and system to implement fast failover between a primary processor and a secondary processor, where the method includes receiving, by a first device, transaction content of a transaction and transaction status data of the transaction, the transaction status data being used to resume the transaction when the transaction is interrupted by a failure of a second device, and continuing to process, by the first device, the transaction according to the transaction content and the transaction status data when detecting that the second device fails.
-
公开(公告)号:US09678826B2
公开(公告)日:2017-06-13
申请号:US14549395
申请日:2014-11-20
Applicant: Huawei Technologies Co., Ltd.
Inventor: Muhui Lin , Junjie Wang , Ruiling Wang
CPC classification number: G06F11/0793 , G06F11/0745 , G06F11/0751 , G06F11/0772 , G06F11/0796 , G06F11/3027 , G06F11/3041 , G06F11/3051 , G06F11/3485 , G06F11/349 , G06F13/28
Abstract: A fault isolation method, computer system, and apparatus, which are capable of monitoring a state of a second endpoint device in the extended domain, and setting a device state record according to the state of the second endpoint device; after an access request between the second endpoint device and the primary domain is received, querying the device state record according to identifier information that is of the second endpoint device and in the access request, and determining the state of the second endpoint device; and if the state of the second endpoint device is a fault state, discarding the access request to prevent communication between the faulty second endpoint device and the primary domain and prevent spreading a fault to the primary domain, thereby ensuring system reliability.
-
公开(公告)号:US20190012245A1
公开(公告)日:2019-01-10
申请号:US16128956
申请日:2018-09-12
Applicant: Huawei Technologies Co., Ltd.
Inventor: Junjie Wang , Ruiling Wang , Yan Ye
CPC classification number: G06F11/2025 , G06F11/2023 , G06F11/2289 , G06F11/3027 , G06F13/385 , G06F13/4282 , H04L1/22
Abstract: A failover method, apparatus and system to implement fast failover between a primary processor and a secondary processor, where the method includes receiving, by a first device, transaction content of a transaction and transaction status data of the transaction, the transaction status data being used to resume the transaction when the transaction is interrupted by a failure of a second device, and continuing to process, by the first device, the transaction according to the transaction content and the transaction status data when detecting that the second device fails.
-
6.
公开(公告)号:US20150082080A1
公开(公告)日:2015-03-19
申请号:US14549395
申请日:2014-11-20
Applicant: Huawei Technologies Co., Ltd.
Inventor: Muhui Lin , Junjie Wang , Ruiling Wang
CPC classification number: G06F11/0793 , G06F11/0745 , G06F11/0751 , G06F11/0772 , G06F11/0796 , G06F11/3027 , G06F11/3041 , G06F11/3051 , G06F11/3485 , G06F11/349 , G06F13/28
Abstract: A fault isolation method, computer system, and apparatus, which are capable of monitoring a state of a second endpoint device in the extended domain, and setting a device state record according to the state of the second endpoint device; after an access request between the second endpoint device and the primary domain is received, querying the device state record according to identifier information that is of the second endpoint device and in the access request, and determining the state of the second endpoint device; and if the state of the second endpoint device is a fault state, discarding the access request to prevent communication between the faulty second endpoint device and the primary domain and prevent spreading a fault to the primary domain, thereby ensuring system reliability.
Abstract translation: 一种故障隔离方法,计算机系统和装置,其能够监视扩展域中的第二端点设备的状态,并根据第二端点设备的状态设置设备状态记录; 在接收到第二端点设备和主域之间的接入请求之后,根据第二端点设备的标识符信息和接入请求查询设备状态记录,并确定第二端点设备的状态; 并且如果第二端点设备的状态是故障状态,则丢弃接入请求以防止故障第二端点设备与主域之间的通信,并防止向主域扩散故障,从而确保系统的可靠性。
-
-
-
-
-