-
公开(公告)号:US20240303142A1
公开(公告)日:2024-09-12
申请号:US18041035
申请日:2022-06-07
Inventor: Shuaijian Wang , Shiyong Li , Henghua Zhang , Panpan Li , Zaibin Hu , Baotong Luo
CPC classification number: G06F11/0712 , G06F9/45558 , G06F11/079 , G06F11/1438 , G06F2009/45591 , G06F2201/815
Abstract: The present disclosure provides a method for controlling a distributed operation system, an apparatus for controlling a distributed operation system, a device, a medium and a program product, which relate to a computer application technology field, and in particular to a distributed operation technology field. A specific implementation includes: for a first container carrying a first process, determining a current fault type of a failure in the first container in response to detecting that the first process is triggered to terminate based on the failure in the first container; and reconstructing the first container and restarting the first process based on the first container reconstructed in response to determining that the current fault type is consistent with a target fault type. In the present disclosure, for a fault type of a container which allows the container to be successfully reconstructed, the container will be reconstructed, while for a fault type of a container which does not allow the container to be successfully reconstructed, the container will not be reconstructed, so as to save system operation costs and meet operation requirements.
-
公开(公告)号:US12158801B2
公开(公告)日:2024-12-03
申请号:US18157429
申请日:2023-01-20
Inventor: Zhigang Zeng , Zhenyuan Sun , Bingqing Shao , Pengfei Yan , Shiyong Li , Yanpeng Wang
IPC: G06F11/07
Abstract: A method of responding to an operation, an electronic device and a storage medium are provided, which relate to a field of cloud computing, and in particular to a field of cluster technology. The specific implementation solution includes: performing, in response to determining that a target operation performed by a target client on a shared resource has timed out, a fault detection on the target client to obtain a fault detection result; and implementing, in response to determining that the fault detection result represents that the target client has a fault, an update operation to obtain a target authority identifier, so that the target client is prevent from continuing to perform the target operation by using the target authority identifier.
-