Automatically Detecting Time-Of-Fault Bugs in Cloud Systems

    公开(公告)号:US20190303233A1

    公开(公告)日:2019-10-03

    申请号:US15938841

    申请日:2018-03-28

    Abstract: A method implemented by a network element (NE) in a distributed system, the method comprising tracing an execution of a program in the distributed system to produce a record of the execution of the program, wherein the record indicates states of shared resources at various times during the execution of the program, identifying a vulnerable operation that occurred during the program execution based on the record, wherein the record indicates that a first shared resource of the shared resources is in a flawed state after a node that caused the first shared resource to be in the flawed state crashed, and determining that the vulnerable operation results in a time of fault (TOF) bug based on performing a fault-tolerance mechanism.

Patent Agency Ranking