Decoupling compute and storage resources in cloud-based HCI (hyper-converged infrastructure)

    公开(公告)号:US10949238B2

    公开(公告)日:2021-03-16

    申请号:US16211047

    申请日:2018-12-05

    Applicant: VMware, Inc.

    Abstract: Techniques for decoupling compute and storage resources in a hyper-converged infrastructure (HCI) are provided. In one set of embodiments, a control plane of the HCI deployment can provision a host from a host platform of an infrastructure on which the HCI deployment is implemented and can provision one or more storage volumes from a storage platform of the infrastructure, where the storage platform runs on physical server resources in the infrastructure that are separate from the host platform. The control plane can then cause the one or more storage volumes to be network-attached to the host in a manner that enables a hypervisor of the host to make the one or more storage volumes available, as part of a virtual storage pool, to one or more virtual machines in the HCI deployment for data storage.

    Management of applications across nodes using exo-clones

    公开(公告)号:US10812582B2

    公开(公告)日:2020-10-20

    申请号:US15191228

    申请日:2016-06-23

    Applicant: VMware, Inc.

    Abstract: Examples disclosed herein relate to propagating changes made on a file system volume of a primary cluster of nodes to the same file system volume also being managed by a secondary cluster of nodes. An application is executed on both clusters, and data changes on the primary cluster are mirrored to the secondary cluster using an exo-clone file. The exo-clone file includes the differences between two or more snapshots of the volume on the primary cluster, along with identifiers of the change blocks and (optionally) state information thereof. Just these changes, identifiers, and state information are packaged in the exo-clone file and then exported to the secondary cluster, which in turn makes the changes to its version of the volume. Exporting just the changes to the data blocks and the corresponding block identifiers drastically reduces the information needed to be exchanged and processed to keep the two volumes consistent.

    Distributed transaction log
    13.
    发明授权

    公开(公告)号:US10769036B2

    公开(公告)日:2020-09-08

    申请号:US15810650

    申请日:2017-11-13

    Applicant: VMware, Inc.

    Abstract: Embodiments of the disclosure provide techniques for updating a distributed transaction log on a previously offline resource object component using distributed transaction logs from active host computer nodes from separate RAID mirror configurations. Each component object maintains a journal (log) where distributed transactions are recorded. If a component object goes offline and subsequently returns (e.g., if the node hosting the component object reboots), the component object is marked as stale. To return the component object to an active state, a distributed resources module retrieves the journals from other resource component objects from other RAID configurations where the data is mirrored. The module filters corresponding data that is missing in the journal of the previously offline corresponding object and merges the filtered data to the journal.

    Flexible efficient runtime placement of data across multiple disks

    公开(公告)号:US10268408B2

    公开(公告)日:2019-04-23

    申请号:US15853110

    申请日:2017-12-22

    Applicant: VMware, Inc.

    Abstract: In a storage cluster having nodes, blocks of a logical storage space of a storage object are allocated flexibly by a parent node to component nodes that are backed by physical storage. The method includes maintaining a first allocation map for the parent node, and second and third allocation maps for the first and second component nodes, respectively, executing a first write operation on the first component node and updating the second allocation map to indicate that the first block is a written block, selecting the second component node for executing a second write operation, and executing the second write operation on the second component node. Upon execution of the second write operation, the third allocation map is updated to indicate that the second block is a written block and the first allocation map is updated to indicate that the second block is allocated to the second component node.

    Maintaining high availability during network partitions for virtual machines stored on distributed object-based storage
    17.
    发明授权
    Maintaining high availability during network partitions for virtual machines stored on distributed object-based storage 有权
    在分布式对象存储上存储的虚拟机的网络分区期间维持高可用性

    公开(公告)号:US09513946B2

    公开(公告)日:2016-12-06

    申请号:US14317712

    申请日:2014-06-27

    Applicant: VMware, Inc.

    CPC classification number: G06F9/45558 G06F9/542 G06F2009/4557

    Abstract: Techniques are disclosed for maintaining high availability (HA) for virtual machines (VMs) running on host systems of a host cluster, where each host system executes a HA module in a plurality of HA modules and a storage module in a plurality of storage modules, where the host cluster aggregates, via the plurality of storage modules, locally-attached storage resources of the host systems to provide an object store, where persistent data for the VMs is stored as per-VM storage objects across the locally-attached storage resources comprising the object store, and where a failure causes the plurality of storage modules to observe a network partition in the host cluster that the plurality of HA modules do not. In one embodiment, a host system in the host cluster executing a first HA module invokes an API exposed by the plurality of storage modules for persisting metadata for a VM to the object store. If the API is not processed successfully, the host system: (1) identifies a subset of second HA modules in the plurality of HA modules; (2) issues an accessibility query for the VM to the subset of second HA modules in parallel, the accessibility query being configured to determine whether the VM is accessible to the respective host systems of the subset of second HA modules; and (3) if at least one second HA module in the subset indicates that the VM is accessible to its respective host system, transmits a command to the at least one second HA module to invoke the API on its respective host system.

    Abstract translation: 公开了用于维护在主机集群的主机系统上运行的虚拟机(VM)的高可用性(HA)的技术,其中每个主机系统在多个HA模块中执行HA模块以及多个存储模块中的存储模块, 其中所述主机集群通过所述多个存储模块聚集所述主机系统的本地连接的存储资源以提供对象存储,其中所述VM的持久数据被存储为跨所述本地连接的存储资源的每个VM存储对象,包括 对象存储,以及故障导致多个存储模块观察主机集群中的多个HA模块没有的网络分区。 在一个实施例中,执行第一HA模块的主机集群中的主机系统调用由多个存储模块公开的API,用于将VM的元数据持久保存到对象存储。 如果API未被成功处理,则主机系统:(1)识别多个HA模块中的第二HA模块的子集; (2)并行地向所述第二HA模块的子集发起对所述VM的辅助性查询,所述辅助功能查询被配置为确定所述VM是否可访问所述第二HA模块子集的相应主机系统; 以及(3)如果所述子集中的至少一个第二HA模块指示所述VM可由其相应的主机系统访问,则向所述至少一个第二HA模块发送命令以在其相应主机系统上调用所述API。

    Persisting High Availability Protection State for Virtual Machines Stored on Distributed Object-Based Storage
    18.
    发明申请
    Persisting High Availability Protection State for Virtual Machines Stored on Distributed Object-Based Storage 有权
    坚持存储在基于分布式对象的存储上的虚拟机的高可用性保护状态

    公开(公告)号:US20150378857A1

    公开(公告)日:2015-12-31

    申请号:US14317637

    申请日:2014-06-27

    Applicant: VMware, Inc.

    Abstract: Techniques are disclosed for persisting high availability (HA) protection state for virtual machines (VMs) running on host systems of a host cluster, where the host cluster aggregates locally-attached storage resources of the host systems to provide an object store, and where persistent data for the VMs is stored as per-VM storage objects across the locally-attached storage resources comprising the object store. In one embodiment, a host system in the host cluster executing a HA module determines an identity of a VM that has been powered-on in the host cluster. The host system then persists HA protection state for the VM in a storage object of the VM, where the HA protection state indicates that the VM should be restarted on an active host system in the case of a failure in the host cluster.

    Abstract translation: 公开了用于为在主机集群的主机系统上运行的虚拟机(VM)持续存在高可用性(HA)保护状态的技术,其中主机集群聚集主机系统的本地连接的存储资源以提供对象存储,并且其中持久性 VM的数据按照每个VM存储对象存储在包括对象存储的本地连接的存储资源中。 在一个实施例中,执行HA模块的主机集群中的主机系统确定主机集群中已经通电的VM的身份。 然后,主机系统将在VM的存储对象中为VM维护HA保护状态,其中HA保护状态指示在主机集群中发生故障时应在活动主机系统上重新启动VM。

    Load balancing of resources
    20.
    发明授权

    公开(公告)号:US11704166B2

    公开(公告)日:2023-07-18

    申请号:US17321299

    申请日:2021-05-14

    Applicant: VMware, Inc.

    CPC classification number: G06F9/5083 G06F9/5016

    Abstract: Embodiments presented herein techniques for balancing a multidimensional set of resources of different types within a distributed resources system. Each host computer providing the resources publishes a status on current resource usage by guest clients. Upon identifying a local imbalance, the host computer determines a source workload to migrate to or from the resources container to minimize the variance in resource usage. Additionally, when placing a new resource workload, the host computer selects a resources container that minimizes the variance to further balance resource usage.

Patent Agency Ranking