Translating high level requirements policies to distributed configurations

    公开(公告)号:US11018988B2

    公开(公告)日:2021-05-25

    申请号:US14469424

    申请日:2014-08-26

    Applicant: VMware Inc.

    Abstract: Embodiments of the disclosure provide techniques for partitioning a resource object into multiple resource components of a cluster of host computer nodes in a distributed resources system. The distributed resources system translates high-level policy requirements into a resource configuration that the system accommodates. The system determines an allocation based on the policy requirements and identifies resource configurations that are available. Upon selecting a resource configuration, the distributed resources system assigns the allocation and associated values to the selected configuration and publishes the new configuration to other host computer nodes in the cluster.

    RESOURCE FAIRNESS CONTROL IN DISTRIBUTED STORAGE SYSTEMS USING CONGESTION DATA

    公开(公告)号:US20190317665A1

    公开(公告)日:2019-10-17

    申请号:US15954512

    申请日:2018-04-16

    Applicant: VMware, Inc.

    Abstract: Computer system and method for managing storage requests in a distributed storage system uses congestion data related to processing of storage requests for local storage to adaptively adjust a bandwidth limit for a first class of storage requests to be processed. The bandwidth limit is enforced on the storage requests belonging to the first class of storage requests without enforcing any bandwidth limit on the storage requests belonging to a second class of storage requests.

    END-TO-END CHECKSUM IN A MULTI-TENANT ENCRYPTION STORAGE SYSTEM

    公开(公告)号:US20190215152A1

    公开(公告)日:2019-07-11

    申请号:US15866185

    申请日:2018-01-09

    Applicant: VMware, Inc.

    Abstract: A multi-tenant storage system can store clear text data and associated clear text checksum received from a storage tenant using their associated cryptographic key (“cryptokey”). When the clear text data is compressible, cryptographic data (“cryptodata”) is generated from a concatenation of the clear text checksum and compressed clear text data using the cryptokey. A cryptographic checksum (“cryptochecksum”) is generated from the cryptodata. When the clear text data is uncompressible, cryptographic data (“cryptodata”) is generated by encrypting the clear text data using the cryptokey with an extra verification step to make sure the clear text checksum can be rebuilt during the read request. A cryptographic checksum (“cryptochecksum”) is generated from the cryptodata. The cryptodata and associated cryptochecksum are stored in the multi-tenant storage system, so that repairs to damaged cryptodata can be made using the associated cryptochecksum.

    Synchronizing replicas with media errors in distributed storage systems

    公开(公告)号:US10198312B2

    公开(公告)日:2019-02-05

    申请号:US15943293

    申请日:2018-04-02

    Applicant: VMware, Inc.

    Abstract: Systems and techniques are described for transferring data. A described technique includes receiving a request to transmit a data block from a first data storage device to a second data storage device. An attempt to read the data block from the first data storage device is made. A media error resulting from the attempt to read the data block from the first data storage device is detected. In response to detecting the media error, a new data block is generated and includes mismatched checksum data that causes a checksum mismatched error when the new data block is accessed. The new data block is transmitted for storage at the second data storage device in place of the data block.

    Partition tolerance in cluster membership management

    公开(公告)号:US09672115B2

    公开(公告)日:2017-06-06

    申请号:US14209426

    申请日:2014-03-13

    Applicant: VMware, Inc.

    Abstract: Techniques are disclosed for managing a cluster of computing nodes following a division of the cluster into at least a first and second partition, where the cluster aggregates local storage resources of the nodes to provide an object store, and objects stored in the object store are divided into data components stored across the nodes. In accordance with one method, it is determined that a majority of data components comprising a first object are stored within nodes in the first partition. It is determined that a majority of data components comprising a second object are stored within nodes in the second partition. Configuration objects are permitted to be performed on the first object in the first partition while denying access to the first object from the second partition, and on the second object in the second partition while denying access to the second object from the first partition.

    Enhanced locking mechanism for B+ tree data structures

    公开(公告)号:US12130791B2

    公开(公告)日:2024-10-29

    申请号:US18144797

    申请日:2023-05-08

    Applicant: VMware, Inc.

    CPC classification number: G06F16/2246 G06F16/2343 G06F16/24552 G06F16/288

    Abstract: A method for modifying key-value pairs of a B+ tree is provided. The method receives a request to modify a particular key-value pair. Each node of the tree has a modification number. The method traverses a path on the tree from the root node toward the particular node. The traversing includes upon reaching a parent node of the path, acquiring a shared lock on both the parent node and a child node one level below the parent node. Upon determining that the child node is the particular node, the method stores the modification number of the particular node, releases the shared lock on the particular node, compares a current modification number of the node with its stored number, and acquires an exclusive lock on the node if the numbers are the same. The method increments the current modification number of the node and modifies it while in the exclusive lock.

    Storing data in a log-structured format in a two-tier storage system

    公开(公告)号:US11803469B2

    公开(公告)日:2023-10-31

    申请号:US17410673

    申请日:2021-08-24

    Applicant: VMware, Inc.

    CPC classification number: G06F12/0804 G06F12/1009 G06F16/2246 G06F2212/1032

    Abstract: The disclosure herein describes storing data using a capacity data storage tier and a smaller performance data storage tier. The capacity data storage tier includes capacity data storage hardware configured to store log-structured leaf pages (LLPs), and the performance data storage tier includes performance data storage hardware. A virtual address table (VAT) includes a set of virtual address entries referencing the LLPs. A tree-structured index includes index nodes referencing the set of virtual address entries of the VAT. Data to be stored is received, and at least a first portion of metadata associated with the received data is stored in the LLPs using the VAT, and at least a second portion of metadata associated with the received data is stored in the performance data storage tier. The architecture reduces space usage of the performance data storage tier.

    Combining the metadata and data address spaces of a distributed storage object via a composite object configuration tree

    公开(公告)号:US11474719B1

    公开(公告)日:2022-10-18

    申请号:US17320023

    申请日:2021-05-13

    Applicant: VMware, Inc.

    Abstract: Techniques for combining the metadata and data address spaces of a distributed storage object are provided. In one set of embodiments, a distributed storage system can receive a request to provision a storage object. In response, the distributed storage system can create, in accordance with an erasure coding scheme, one or more capacity components for holding data of the storage object; create, in accordance with a mirroring scheme having an equivalent level of fault tolerance as the erasure coding scheme, one or more metadata components for holding metadata of the storage object; and create a composite object configuration tree for the storage object that includes first and second subtrees, where the first subtree comprises an indication of the mirroring scheme and references to the one or more metadata components, and where the second subtree comprises an indication of the erasure coding scheme and references to the one or more capacity components.

Patent Agency Ranking