PROACTIVE HIGH AVAILABILITY IN A VIRTUALIZED COMPUTER SYSTEM

    公开(公告)号:US20190384648A1

    公开(公告)日:2019-12-19

    申请号:US16549032

    申请日:2019-08-23

    Applicant: VMware, Inc.

    Abstract: Disclosed are aspects of proactive high availability that proactively identify and predict hardware failure scenarios and migrate virtual resources to healthy hardware resources. In some aspects, a mapping that maps virtual resources to hardware resources. A plurality of hardware events are identified in association with a hardware resource. A hardware failure scenario is predicted based on a health score of a first hardware resource. A health score is determined based on the hardware events, and a fault model that indicates a level of severity of the hardware events. A particular virtual resource is migrated from the hardware resource to another hardware that has a greater health score.

    Maintaining high availability of a group of virtual machines using heartbeat messages
    4.
    发明授权
    Maintaining high availability of a group of virtual machines using heartbeat messages 有权
    使用心跳消息维护一组虚拟机的高可用性

    公开(公告)号:US09558070B2

    公开(公告)日:2017-01-31

    申请号:US14561672

    申请日:2014-12-05

    Applicant: VMware, Inc.

    CPC classification number: G06F11/1438 G06F9/45533 G06F9/542 G06F11/0757

    Abstract: Embodiments maintain high availability of software application instances in a fault domain. Subordinate hosts are monitored by a master host. The subordinate hosts publish heartbeats via a network and datastores. Based at least in part on the published heartbeats, the master host determines the status of each subordinate host, distinguishing between subordinate hosts that are entirely inoperative and subordinate hosts that are operative but partitioned (e.g., unreachable via the network). The master host may restart software application instances, such as virtual machines, that are executed by inoperative subordinate hosts or that cease executing on partitioned subordinate hosts.

    Abstract translation: 实施例在故障域中保持软件应用程序实例的高可用性。 主机由主机监控。 下级主机通过网络和数据存储发布心跳。 至少部分地基于所发布的心跳,主主机确定每个从属主机的状态,区分完全不起作用的从属主机和可操作但被分区的下级主机(例如,经由网络不可达)。 主主机可以重新启动由不能操作的下属主机执行的软件应用程序实例,例如虚拟机,或者停止在分区的下级主机上执行。

    Orchestrating high availability failover for virtual machines stored on distributed object-based storage
    5.
    发明授权
    Orchestrating high availability failover for virtual machines stored on distributed object-based storage 有权
    为存储在基于分布式对象的存储上的虚拟机协调高可用性故障转移

    公开(公告)号:US09495259B2

    公开(公告)日:2016-11-15

    申请号:US14317669

    申请日:2014-06-27

    Applicant: VMware, Inc.

    Abstract: Techniques are disclosed for orchestrating high availability (HA) failover for virtual machines (VMs) running on host systems of a host cluster, where the host cluster aggregates locally-attached storage resources of the host systems to provide an object store, and where persistent data for one or more of the VMs is stored as per-VM storage objects across the locally-attached storage resources comprising the object store. In one embodiment, a host system in the host cluster executing a HA module determines a VM to be restarted on an active host system in the host cluster. The host system further determines if the VM's persistent data is stored in the object store. If so, the host system adds the VM to a list of VMs to be immediately restarted. Otherwise, the host system checks whether the VM is accessible to the host system by querying a storage layer of the host system configured to manage the object store.

    Abstract translation: 披露了用于为在主机集群的主机系统上运行的虚拟机(VM)的高可用性(HA)故障切换进行协调的技术,其中主机集群聚集主机系统的本地连接的存储资源以提供对象存储,以及持久数据 对于一个或多个VM,通过包括对象存储的本地连接的存储资源被存储为每个VM存储对象。 在一个实施例中,执行HA模块的主机集群中的主机系统确定要在主机集群中的主动主机系统上重新启动的VM。 主机系统还确定VM的持久数据是否存储在对象存储中。 如果是这样,主机系统会将VM添加到要立即重新启动的VM列表中。 否则,主机系统通过查询配置为管理对象存储的主机系统的存储层来检查主机系统是否可访问VM。

    Interdependent virtual machine management
    7.
    发明授权
    Interdependent virtual machine management 有权
    相互依赖的虚拟机管理

    公开(公告)号:US09213572B2

    公开(公告)日:2015-12-15

    申请号:US14094344

    申请日:2013-12-02

    Applicant: VMware, Inc.

    CPC classification number: G06F9/45558 G06F9/45533 G06F9/485 G06F2009/45575

    Abstract: Exemplary methods, apparatuses, and systems determine a list of virtual machines to be subject to a corrective action. When one or more of the listed virtual machines have dependencies upon other virtual machines, network connections, or storage devices, the determination of the list includes determining that the dependencies of the one or more virtual machines have been met. An attempt to restart or take another corrective action for the first virtual machine within the list is made. A second virtual machine that is currently deployed and running or powered off or paused in response to the corrective action for the first virtual machine is determined to be dependent upon the first virtual machine. In response to the second virtual machine's dependencies having been met by the attempt to restart or take corrective action for the first virtual machine, the second virtual machine is added to the list of virtual machines.

    Abstract translation: 示例性方法,装置和系统确定要进行纠正动作的虚拟机的列表。 当列出的一个或多个虚拟机具有与其他虚拟机,网络连接或存储设备的依赖关系时,该列表的确定包括确定一个或多个虚拟机的依赖性已被满足。 对列表中的第一个虚拟机尝试重新启动或采取其他纠正措施。 当前部署并运行或关闭或暂停响应于第一虚拟机的纠正措施的第二虚拟机被确定为依赖于第一虚拟机。 响应于通过尝试重新启动或对第一虚拟机采取纠正措施来满足第二虚拟机的依赖性,第二虚拟机被添加到虚拟机列表中。

    Managing Availability of Virtual Machines in Cloud Computing Services
    8.
    发明申请
    Managing Availability of Virtual Machines in Cloud Computing Services 有权
    管理云计算服务中虚拟机的可用性

    公开(公告)号:US20140344805A1

    公开(公告)日:2014-11-20

    申请号:US13896038

    申请日:2013-05-16

    Applicant: VMware, Inc.

    CPC classification number: G06F9/45558 G06F2009/45575

    Abstract: Recovery of virtual machines when one or more hosts fail includes identifying virtual machines running on the remaining functioning hosts. Some of the identified powered on virtual machines are suspended in favor of restarting some of the failed virtual machines from the failed host(s). A subsequent round of identifying virtual machines for suspension and virtual machines for restarting is performed. Virtual machines for suspension and restarting may be identified based on their associated “recovery time objective” (RTO) values or their “maximum number of RTO violations” value.

    Abstract translation: 当一个或多个主机出现故障时恢复虚拟机包括识别在其余功能主机上运行的虚拟机。 一些已识别的带电虚拟机被暂停,有利于从失败的主机重新启动一些故障虚拟机。 执行下一轮识别用于重新启动的悬挂和虚拟机的虚拟机。 可以根据其相关的“恢复时间目标”(RTO)值或其“最大RTO违规数量”值来识别用于暂停和重新启动的虚拟机。

    Policy-based selection and configuration of target site resources for data replication

    公开(公告)号:US11422980B2

    公开(公告)日:2022-08-23

    申请号:US14863314

    申请日:2015-09-23

    Applicant: VMware, Inc.

    Abstract: Exemplary methods, apparatuses, and systems include a target site management server transmitting, to a source site management server, a plurality of protection service plans available for replication of data from the source site to the target site. The transmission of the protection service plans includes a description of one or more service level characteristics provided by each protection service plan and excludes a listing of physical and virtual resources within the target site that are to provide the service level characteristics. The target site management server receives selection of one of the protection service plans and determines the physical resources within the target site to provide the advertised service level characteristics for the data replication. The target site management server further transmits configuration details to one or more of the determined physical resources to implement the replication infrastructure within the target site according to the selected protection service plan.

Patent Agency Ranking