Snapshots and clones in a block-based data deduplication storage system

    公开(公告)号:US10031672B2

    公开(公告)日:2018-07-24

    申请号:US14726572

    申请日:2015-05-31

    Applicant: VMware, Inc.

    Abstract: A deduplication storage system with snapshot and clone capability includes storing logical pointer objects and organizing a first set of the logical pointer objects into a hierarchical structure. A second set of the logical pointer objects may be associated with corresponding logical data blocks of a client data object. The second set of the logical pointer objects may point to physical data blocks having deduplicated data that comprise data of the corresponding logical data blocks. Some of the logical pointer objects in the first set may point to the logical pointer objects in the second set, so that the hierarchical structure represents the client data object. A root of the hierarchical structure may be associated with the client data object. A snapshot or clone may be created by making a copy of the root and associating the copied root with the snapshot or clone.

    Scalable distributed storage architecture

    公开(公告)号:US09811531B2

    公开(公告)日:2017-11-07

    申请号:US14010293

    申请日:2013-08-26

    Applicant: VMware, Inc.

    CPC classification number: G06F17/30194 G06F17/30233 G06F17/30283

    Abstract: Techniques are disclosed for providing a file system interface for an object store intended to support simultaneous access to objects stored in the object store by multiple clients. In accordance with one method, an abstraction of a root directory to a hierarchical namespace for the object store is exposed to clients. The object store is backed by a plurality of physical storage devices housed in or directly attached to the plurality of host computers and internally tracks its stored objects using a flat namespace that maps unique identifiers to the stored objects. The creation of top-level objects appearing as subdirectories of the root directory is enabled, wherein each top-level object represents a separate abstraction of a storage device having a separate namespace that can be organized in accordance with any designated file system.

    Persisting high availability protection state for virtual machines stored on distributed object-based storage
    33.
    发明授权
    Persisting high availability protection state for virtual machines stored on distributed object-based storage 有权
    为存储在分布式对象存储上的虚拟机持续高可用性保护状态

    公开(公告)号:US09367414B2

    公开(公告)日:2016-06-14

    申请号:US14317637

    申请日:2014-06-27

    Applicant: VMware, Inc.

    Abstract: Techniques are disclosed for persisting high availability (HA) protection state for virtual machines (VMs) running on host systems of a host cluster, where the host cluster aggregates locally-attached storage resources of the host systems to provide an object store, and where persistent data for the VMs is stored as per-VM storage objects across the locally-attached storage resources comprising the object store. In one embodiment, a host system in the host cluster executing a HA module determines an identity of a VM that has been powered-on in the host cluster. The host system then persists HA protection state for the VM in a storage object of the VM, where the HA protection state indicates that the VM should be restarted on an active host system in the case of a failure in the host cluster.

    Abstract translation: 公开了用于为在主机集群的主机系统上运行的虚拟机(VM)持续存在高可用性(HA)保护状态的技术,其中主机集群聚集主机系统的本地连接的存储资源以提供对象存储,并且其中持久性 VM的数据按照每个VM存储对象存储在包括对象存储的本地连接的存储资源中。 在一个实施例中,执行HA模块的主机集群中的主机系统确定主机集群中已经通电的VM的身份。 然后,主机系统将在VM的存储对象中为VM维护HA保护状态,其中HA保护状态指示在主机集群中发生故障时应在活动主机系统上重新启动VM。

    Writing data to an LSM tree file structure using consistent cache staging

    公开(公告)号:US11620261B2

    公开(公告)日:2023-04-04

    申请号:US16213815

    申请日:2018-12-07

    Applicant: VMware, Inc.

    Abstract: The disclosure herein describes writing data to a log-structured merge (LSM) tree file system on an object storage platform. Write data instructions indicating data for writing to the LSM tree file system are received. Based on the received instructions, the data is written to the first data cache. Based on an instruction to transfer data in the live data cache to the LSM tree file system, the first data cache is converted to a stable cache. A second data cache configured as a live data cache is then generated based on cloning the first data cache. The data in the first data cache is then written to the LSM tree file system. Use of a stable cache and a cloned live data cache enables parallel writing data to the file system by the stable cache and handling write data instructions by the live data cache.

    Efficient garbage collection of variable size chunking deduplication

    公开(公告)号:US11461229B2

    公开(公告)日:2022-10-04

    申请号:US16552954

    申请日:2019-08-27

    Applicant: VMware, Inc.

    Abstract: The present disclosure provides techniques for deallocating previously allocated storage blocks. The techniques include obtaining a list of chunk IDs to analyze, choosing a chunk ID, and determining the storage blocks spanned by the chunk corresponding to the chosen chunk ID. The technique further includes determining whether any file references any storage blocks spanned by the chunk. The determining may be performed by comparing an internal reference count to a total reference count, where the internal reference count is the number of reference to the storage block by a chunk ID data structure. If no files reference any of the storage blocks spanned by the chunk, then all the storage blocks of the chunk can be deallocated.

    Scalable distributed storage architecture

    公开(公告)号:US11249956B2

    公开(公告)日:2022-02-15

    申请号:US16781673

    申请日:2020-02-04

    Applicant: VMware, Inc.

    Abstract: Techniques are disclosed for providing a file system interface for an object store intended to support simultaneous access to objects stored in the object store by multiple clients. In accordance with one method, an abstraction of a root directory to a hierarchical namespace for the object store is exposed to clients. The object store is backed by a plurality of physical storage devices housed in or directly attached to the plurality of host computers and internally tracks its stored objects using a flat namespace that maps unique identifiers to the stored objects. The creation of top-level objects appearing as subdirectories of the root directory is enabled, wherein each top-level object represents a separate abstraction of a storage device having a separate namespace that can be organized in accordance with any designated file system.

    Decoupling Compute and Storage Resources in Cloud-Based HCI (Hyper-Converged Infrastructure)

    公开(公告)号:US20210173695A1

    公开(公告)日:2021-06-10

    申请号:US17180640

    申请日:2021-02-19

    Applicant: VMware, Inc.

    Abstract: Techniques for decoupling compute and storage resources in a hyper-converged infrastructure (HCI) are provided. In one set of embodiments, a control plane of the HCI deployment can provision a host from a host platform of an infrastructure on which the HCI deployment is implemented and can provision one or more storage volumes from a storage platform of the infrastructure, where the storage platform runs on physical server resources in the infrastructure that are separate from the host platform. The control plane can then cause the one or more storage volumes to be network-attached to the host in a manner that enables a hypervisor of the host to make the one or more storage volumes available, as part of a virtual storage pool, to one or more virtual machines in the HCI deployment for data storage.

    Translating high level requirements policies to distributed configurations

    公开(公告)号:US11018988B2

    公开(公告)日:2021-05-25

    申请号:US14469424

    申请日:2014-08-26

    Applicant: VMware Inc.

    Abstract: Embodiments of the disclosure provide techniques for partitioning a resource object into multiple resource components of a cluster of host computer nodes in a distributed resources system. The distributed resources system translates high-level policy requirements into a resource configuration that the system accommodates. The system determines an allocation based on the policy requirements and identifies resource configurations that are available. Upon selecting a resource configuration, the distributed resources system assigns the allocation and associated values to the selected configuration and publishes the new configuration to other host computer nodes in the cluster.

Patent Agency Ranking