Method and system for providing storage checkpointing to a group of independent computer applications
    14.
    发明授权
    Method and system for providing storage checkpointing to a group of independent computer applications 有权
    用于向一组独立计算机应用程序提供存储检查点的方法和系统

    公开(公告)号:US09449015B1

    公开(公告)日:2016-09-20

    申请号:US14532713

    申请日:2014-11-04

    IPC分类号: G06F17/30

    摘要: A system and method for storage checkpointing to a group of independent computer applications. The system has a storage disk that stores files; a storage access interface to access the storage disk; and a computer. The computer runs the group of independent computer applications and utilizes the files stored on the storage disk. A file system on the server accesses the files stored on the storage disk. An operating system and at least one device driver can be called by the file system, and at least one buffer buffers first data written to the storage disk and second data read from the storage disk.

    摘要翻译: 一种用于存储检查点到一组独立计算机应用程序的系统和方法。 系统具有存储文件的存储磁盘; 用于访问存储盘的存储访问接口; 和电脑。 计算机运行独立的计算机应用程序组,并利用存储在存储磁盘上的文件。 服务器上的文件系统访问存储在存储磁盘上的文件。 文件系统可以调用操作系统和至少一个设备驱动程序,并且至少一个缓冲器缓冲写入存储盘的第一数据和从存储盘读取的第二数据。

    Method and computer readable medium for providing checkpointing to windows application groups
    15.
    发明授权
    Method and computer readable medium for providing checkpointing to windows application groups 有权
    用于向Windows应用程序组提供检查点的方法和计算机可读介质

    公开(公告)号:US09201737B1

    公开(公告)日:2015-12-01

    申请号:US14532336

    申请日:2014-11-04

    摘要: A computer readable medium and method for providing checkpointing to Windows application groups. The checkpointing may be triggered asynchronously using Asynchronous Procedure Calls. The computer readable medium includes computer-executable instructions for execution by a processing system. The computer-executable instructions may be for reviewing one or more command line arguments to determine whether to start at least one of the application groups, and when determining to start the at least one of the application groups, creating a process table in a shared memory to store information about each process of the at least one of the application groups. Further, the instructions may be for registering with a kernel module to create an application group barrier, creating a named pipe for applications of the application group to register and unregister, triggering a checkpoint thread to initiate an application group checkpoint; and launching an initial application of the applications of the application group.

    摘要翻译: 一种用于向Windows应用程序组提供检查点的计算机可读介质和方法。 可以使用异步过程调用异步触发检查点。 计算机可读介质包括用于由处理系统执行的计算机可执行指令。 计算机可执行指令可以是用于查看一个或多个命令行参数以确定是否启动应用组中的至少一个,以及当确定启动应用组中的至少一个时,在共享存储器中创建过程表 以存储关于所述应用组中的至少一个的每个进程的信息。 此外,指令可以用于向内核模块注册以创建应用组屏障,为应用组的应用创建命名管道以注册和注销,触发检查点线程以启动应用组检查点; 并启动应用程序组的应用程序的初始应用程序。

    Method and system for providing storage checkpointing to a group of independent computer applications
    16.
    发明授权
    Method and system for providing storage checkpointing to a group of independent computer applications 有权
    用于向一组独立计算机应用程序提供存储检查点的方法和系统

    公开(公告)号:US09158626B1

    公开(公告)日:2015-10-13

    申请号:US14291152

    申请日:2014-05-30

    IPC分类号: G06F17/30 G06F11/14

    摘要: A method and system for storage checkpointing of an independent computer application. The independent computer application is launched by a coordinator; and the coordinator installs at least one of an exec interceptor and a fork interceptor. The coordinator also installs at least one file operations interceptor for all file operations and registers the independent computer application with the coordinator. The independent computer application is run and the at least one file operations interceptor is called upon encountering a file operation. The file operations interceptor logs a file event in a file operations database and passes the operation to at least one of a file system, an operating system, at least one or more device drivers, and a storage disk via a storage interface. The file operations interceptor also verifies that the file operation has been issued.

    摘要翻译: 用于独立计算机应用程序的存储检查点的方法和系统。 独立计算机应用由协调员发起; 并且协调器安装至少一个exec拦截器和fork拦截器。 协调器还为所有文件操作安装至少一个文件操作拦截器,并向协调器注册独立的计算机应用程序。 运行独立的计算机应用程序,并在遇到文件操作时调用至少一个文件操作拦截器。 文件操作拦截器将文件事件记录在文件操作数据库中,并且经由存储接口将操作传递到文件系统,操作系统,至少一个或多个设备驱动器和存储盘中的至少一个。 文件操作拦截器还验证文件操作已经被发出。

    Method and system for providing coordinated checkpointing to a group of independent computer applications
    17.
    发明授权
    Method and system for providing coordinated checkpointing to a group of independent computer applications 有权
    为一组独立的计算机应用程序提供协调检查点的方法和系统

    公开(公告)号:US08775871B1

    公开(公告)日:2014-07-08

    申请号:US13951759

    申请日:2013-07-26

    IPC分类号: G06F11/00

    摘要: A system and method thereof for performing loss-less migration of an application group. In an exemplary embodiment, the system may include a high-availability services module structured for execution in conjunction with an operating system, and one or more computer nodes of a distributed system upon which at least one independent application can be executed upon. The high-availability services module may be structured to be executable on the one or more computer nodes for loss-less migration of the one or more independent applications, and is operable to perform checkpointing of all state in a transport connection.

    摘要翻译: 一种用于执行应用组的无损迁移的系统及其方法。 在示例性实施例中,系统可以包括结构化为结合操作系统执行的高可用性服务模块,以及可以在其上执行至少一个独立应用的分布式系统的一个或多个计算机节点。 高可用性服务模块可以被构造为在一个或多个计算机节点上可执行,用于一个或多个独立应用的无损迁移,并且可操作以执行传输连接中的所有状态的检查点。

    System and method for hybrid kernel- and user-space incremental and full checkpointing

    公开(公告)号:US10990487B1

    公开(公告)日:2021-04-27

    申请号:US16675154

    申请日:2019-11-05

    发明人: Allan Havemose

    IPC分类号: G06F11/00 G06F11/14

    摘要: A system includes a multi-process application that runs on primary hosts and is checkpointed by a checkpointer comprised of a kernel-mode checkpointer module and one or more user-space interceptors providing at least one of barrier synchronization, checkpointing thread, resource flushing, and an application virtualization space. Checkpoints may be written to storage and the application restored from said stored checkpoint at a later time. Checkpointing may be incremental using Page Table Entry (PTE) pages and Virtual Memory Areas (VMA) information. Checkpointing is transparent to the application and requires no modification to the application, operating system, networking stack or libraries. In an alternate embodiment the kernel-mode checkpointer is built into the kernel.

    System and method for statistical application agnostic fault detection

    公开(公告)号:US10817364B1

    公开(公告)日:2020-10-27

    申请号:US16167988

    申请日:2018-10-23

    发明人: Allan Havemose

    IPC分类号: G06F11/00 G06F11/07

    摘要: A system, method, and computer readable medium for statistical application-agnostic fault detection of multi-process applications. The computer readable medium includes computer-executable instructions for execution by a processing system. A multi-process application runs on a host. Interceptors collect statistical events and sends said events to a statistical fault detector. The statistical fault detector creates one or more distributions and compares recent statistical event data to historical statistical event data and uses deviation from historical norm for fault detection. The present invention detects faults both within the application and within the environment wherein the application executes, if conditions within the environment cause impaired application performance. The invention also teaches consensus fault detection and elimination of cascading fault notifications based on a hierarchy of events and event groups. Interception and fault detection is transparent to the application, operating system, networking stack and libraries.

    System and method for trend estimation for application-agnostic statistical fault detection

    公开(公告)号:US10656989B1

    公开(公告)日:2020-05-19

    申请号:US16043480

    申请日:2018-07-24

    发明人: Allan Havemose

    IPC分类号: G06F11/00 G06F11/07

    摘要: Trend estimation for application-agnostic statistical fault detection of multi-process applications in environments with data trend includes at least one of: a multi-process application runs on a host. Statistical events are collected and sent to a statistical fault detector. The statistical fault detector creates one or more distributions and compares recent statistical event data to historical statistical event data and uses deviation from historical norm for fault detection. Trend is estimated, and if needed, removed from event data prior to the creation of distributions. Trend is estimated using spectral techniques, filter banks and Maximum Entry Spectral Estimation, and dominant frequencies are estimated and utilized to adapt to the environment.