-
公开(公告)号:US10176033B1
公开(公告)日:2019-01-08
申请号:US14750967
申请日:2015-06-25
Applicant: Amazon Technologies, Inc.
Inventor: Kai Wang , Peter Cheng-Shiang Fang , Haoyu Huang , Qi Li , Yuanyuan Song , Lechang Cheng , Fyaaz Mohammad Barakati
IPC: G06F11/07
Abstract: A system and method for detecting the occurrence of an event causing multiple hosts to be unresponsive. The system and method including, for a set of hosts providing services to one or more customers of a computing resource service provider, determining one or more subsets of hosts that are unresponsive, determining whether the one or more subsets of hosts that are unresponsive meet a set of criteria for an occurrence of an large-scale event affecting multiple hosts, based at least in part on a determination that the set of criteria is met, initiating a remediation action.