-
公开(公告)号:US20240303142A1
公开(公告)日:2024-09-12
申请号:US18041035
申请日:2022-06-07
Inventor: Shuaijian Wang , Shiyong Li , Henghua Zhang , Panpan Li , Zaibin Hu , Baotong Luo
CPC classification number: G06F11/0712 , G06F9/45558 , G06F11/079 , G06F11/1438 , G06F2009/45591 , G06F2201/815
Abstract: The present disclosure provides a method for controlling a distributed operation system, an apparatus for controlling a distributed operation system, a device, a medium and a program product, which relate to a computer application technology field, and in particular to a distributed operation technology field. A specific implementation includes: for a first container carrying a first process, determining a current fault type of a failure in the first container in response to detecting that the first process is triggered to terminate based on the failure in the first container; and reconstructing the first container and restarting the first process based on the first container reconstructed in response to determining that the current fault type is consistent with a target fault type. In the present disclosure, for a fault type of a container which allows the container to be successfully reconstructed, the container will be reconstructed, while for a fault type of a container which does not allow the container to be successfully reconstructed, the container will not be reconstructed, so as to save system operation costs and meet operation requirements.