SCALABLE MULTI-FRAMEWORK MULTI-TENANT LIFECYCLE MANAGEMENT OF DEEP LEARNING APPLICATIONS
摘要:
A lifecycle management method, system, and computer program product include coordinating hardware, platform and application-level health checks for framework-independent and application-specific monitoring, failure detection, and recovery, coordinating the hardware, the platform, and the application-level health check by state-specific aggregation of distributed atomic status events, and creating a recovery policy based on the state-specific aggregation of the distributed atomic status events.
信息查询
0/0