Scheduling work in a multi-node computer system based on checkpoint characteristics
    81.
    发明授权
    Scheduling work in a multi-node computer system based on checkpoint characteristics 有权
    基于检查点特性在多节点计算机系统中调度工作

    公开(公告)号:US09417909B2

    公开(公告)日:2016-08-16

    申请号:US12270164

    申请日:2008-11-13

    CPC classification number: G06F9/485 G06F9/4881 G06F9/5038 G06F11/1438

    Abstract: Efficient application checkpointing uses checkpointing characteristics of a job to determine how to schedule jobs for execution on a multi-node computer system. A checkpoint profile in the job description includes information on the expected frequency and duration of a check point cycle for the application. The checkpoint profile may be based on a user/administrator input as well as historical information. The job scheduler will attempt to group applications (jobs) that have the same checkpoint profile, on the same nodes or group of nodes. Additionally, the job scheduler may control when new jobs start based on when the next checkpoint cycle(s) are expected. The checkpoint monitor will monitor the checkpoint cycles, updating the checkpoint profiles of running jobs. The checkpoint monitor will also keep track of an overall system checkpoint profile to determine the available checkpointing capacity before scheduling jobs on the cluster.

    Abstract translation: 高效的应用程序检查点使用作业的检查点特征来确定如何调度作业以在多节点计算机系统上执行。 作业说明中的检查点简档包括有关应用程序检查点周期的预期频率和持续时间的信息。 检查点简档可以基于用户/管理员输入以及历史信息。 作业调度程序将尝试将具有相同检查点概要文件的应用程序(作业)分组在相同的节点或节点组上。 此外,作业调度器可以基于何时预期下一个检查点周期来控制新作业何时开始。 检查点监视器将监视检查点周期,更新运行作业的检查点配置文件。 检查点监视器还将跟踪整个系统检查点配置文件,以便在调度集群上的作业之前确定可用的检查点容量。

    Virtual machine cluster backup in a multi-node environment
    82.
    发明授权
    Virtual machine cluster backup in a multi-node environment 有权
    虚拟机集群备份在多节点环境中

    公开(公告)号:US09411628B2

    公开(公告)日:2016-08-09

    申请号:US14540640

    申请日:2014-11-13

    Abstract: Embodiments are directed to backing up a virtual machine cluster and to determining virtual machine node ownership prior to backing up a virtual machine cluster. In one scenario, a computer system determines which virtual machines nodes are part of the virtual machine cluster, determines which shared storage resources are part of the virtual machine cluster and determines which virtual machine nodes own the shared storage resources. The computer system then indicates to the virtual machine node owners that at least one specified application is to be quiesced over the nodes of the virtual machine cluster, such that a consistent, cluster-wide checkpoint can be created. The computer system further creates a cluster-wide checkpoint which includes a checkpoint for each virtual machine in the virtual machine cluster.

    Abstract translation: 实施例涉及在备份虚拟机集群之前备份虚拟机集群并确定虚拟机节点所有权。 在一种情况下,计算机系统确定哪些虚拟机节点是虚拟机集群的一部分,确定哪些共享存储资源是虚拟机集群的一部分,并确定哪些虚拟机节点拥有共享存储资源。 然后,计算机系统向虚拟机节点所有者指示至少一个指定的应用将在虚拟机集群的节点上停顿,使得可以创建一致的集群范围的检查点。 计算机系统还创建一个集群范围的检查点,其中包含虚拟机集群中每个虚拟机的检查点。

    DISTRIBUTED SYSTEM, SERVER COMPUTER, DISTRIBUTED MANAGEMENT SERVER, AND FAILURE PREVENTION METHOD
    83.
    发明申请
    DISTRIBUTED SYSTEM, SERVER COMPUTER, DISTRIBUTED MANAGEMENT SERVER, AND FAILURE PREVENTION METHOD 审中-公开
    分布式系统,服务器计算机,分布式管理服务器和故障预防方法

    公开(公告)号:US20160224444A1

    公开(公告)日:2016-08-04

    申请号:US15096422

    申请日:2016-04-12

    Inventor: Yoshihiro OKADA

    Abstract: A distributed system according to an exemplary embodiment includes first and second servers capable of executing the same application, wherein when a failure occurs in the application in the first server, the first server generates failure information identifying a cause of the failure in the application, and the second server performs failure prevention processing which is determined based on the failure information and intended to prevent a failure in the application.

    Abstract translation: 根据示例性实施例的分布式系统包括能够执行相同应用的第一和第二服务器,其中当在第一服务器中的应用中发生故障时,第一服务器生成识别应用中的故障原因的故障信息,以及 第二服务器执行基于故障信息确定并旨在防止应用中的故障的故障防止处理。

    Recovering step and batch-based processes
    85.
    发明授权
    Recovering step and batch-based processes 有权
    恢复步骤和基于批处理的进程

    公开(公告)号:US09405636B2

    公开(公告)日:2016-08-02

    申请号:US13906608

    申请日:2013-05-31

    CPC classification number: G06F11/1482 G06F11/1438 G06F11/1474

    Abstract: A method of recovering batch-based processes may include providing an interface for receiving processes recoverability information. The recoverability information may include (i) information describing a mutual exclusivity of data affected by a process, (ii) information describing sub-processes associated with the process, and/or (iii) information describing scope cleanup procedures associated with the process. The method may also include receiving the recoverability information through the interface, and receiving an indication that the process experienced an error while being executed on a client system. The method may additionally include providing the process recoverability information to make a recoverability determination for the process.

    Abstract translation: 恢复基于批处理的进程的方法可以包括提供用于接收进程可恢复性信息的接口。 可恢复性信息可以包括(i)描述受过程影响的数据的相互排他性的信息,(ii)描述与该过程相关联的子过程的信息,和/或(iii)描述与该过程相关联的范围清除过程的信息。 该方法还可以包括通过接口接收可恢复性信息,以及接收在客户机系统上执行时进程经历错误的指示。 该方法可以另外包括提供过程可恢复性信息以对该过程进行可恢复性确定。

    Customizable Backup and Restore from NonVolatile Logic Array
    86.
    发明申请
    Customizable Backup and Restore from NonVolatile Logic Array 审中-公开
    非易失性逻辑阵列的可定制备份和还原

    公开(公告)号:US20160217840A1

    公开(公告)日:2016-07-28

    申请号:US15089607

    申请日:2016-04-04

    Abstract: Design and operation of a processing device is configurable to optimize wake-up time and peak power cost during restoration of a machine state from non-volatile storage. The processing device includes a plurality of non-volatile logic element arrays configured to store a machine state represented by a plurality of volatile storage elements of the processing device. A stored machine state is read out from the plurality of non-volatile logic element arrays to the plurality of volatile storage elements. During manufacturing, a number of rows and a number of bits per row in non-volatile logic element arrays are based on a target wake up time and a peak power cost. In another approach, writing data to or reading data of the plurality of non-volatile arrays can be done in parallel, sequentially, or in any combination to optimize operation characteristics.

    Abstract translation: 处理设备的设计和操作可配置为在非易失性存储器恢复机器状态期间优化唤醒时间和峰值功耗成本。 处理装置包括被配置为存储由处理装置的多个易失性存储元件表示的机器状态的多个非易失性逻辑元件阵列。 将存储的机器状态从多个非易失性逻辑元件阵列读出到多个易失性存储元件。 在制造期间,非易失性逻辑元件阵列中每行的数行和数位数是基于目标唤醒时间和峰值功率成本的。 在另一种方法中,可以并行,顺序地或以任何组合来对数据进行数据写入或读取数据,以优化操作特性。

    PROVIDING A RECOVERY PLACEHOLDER WITHIN AN APPLICATION
    88.
    发明申请
    PROVIDING A RECOVERY PLACEHOLDER WITHIN AN APPLICATION 有权
    在应用程序中提供恢复位置

    公开(公告)号:US20160210207A1

    公开(公告)日:2016-07-21

    申请号:US15086188

    申请日:2016-03-31

    Abstract: Embodiments described herein provide recovery placeholders within an application. Specifically, one approach includes providing an application operating on a client device, and generating a recovery placeholder that defines a current state of the application by analyzing a queue containing a set of messages, and identifying one or more selected events corresponding to the application from the queue. In one approach, the current state defines, at the time the recovery placeholder is generated, at least one of: a position within a window of the application, a current activity of the application, a position of the window within a display of a display device, and a placement order of the window of the application in relation to a stack of other cascaded windows. At a later point in time, the application may then be restored to the current state by accessing the recovery placeholder to replay the one or more selected events.

    Abstract translation: 本文所述的实施例提供应用程序内的恢复占位符。 具体地,一种方法包括提供在客户端设备上运行的应用程序,以及生成恢复占位符,其通过分析包含一组消息的队列来定义应用程序的当前状态,以及从所述客户端设备识别应用程序的一个或多个所选事件 队列。 在一种方法中,当前状态在生成恢复占位符时定义了以下至少之一:应用程序的窗口内的位置,应用程序的当前活动,窗口在显示器的显示内的位置 设备以及应用程序窗口相对于其他级联窗口堆栈的放置顺序。 在稍后的时间点,然后可以通过访问恢复占位符来将应用程序恢复到当前状态以重播一个或多个所选择的事件。

Patent Agency Ranking