Distributed, scalable key-value store

    公开(公告)号:US10891264B2

    公开(公告)日:2021-01-12

    申请号:US14700615

    申请日:2015-04-30

    Applicant: VMware, Inc.

    Abstract: Techniques for implementing a distributed, scalable key-value store (DSKVS) across a plurality of nodes are provided. In one embodiment, each node in the plurality of nodes can store: (1) a hash table in a nonvolatile storage of the node, where the hash table is configured to hold a partition of a total set of key-value data maintained by the DSKVS; (2) a logical log in the nonvolatile storage, where the logical log is configured to hold transaction log records corresponding to key-value update operations performed on the node; and (3) a cache in a volatile memory of the node, where the cache is configured to hold key-value data that has been recently updated on the node via one or more of the key-value update operations.

    Method of rebuilding real world storage environment

    公开(公告)号:US10789139B2

    公开(公告)日:2020-09-29

    申请号:US15952229

    申请日:2018-04-12

    Applicant: VMware, Inc.

    Abstract: A method for replicating a first virtual storage system of a customer includes receiving periodically collected configuration data, workload data, service failure data, and management workflow data on the first virtual storage system, creating a first multi-dimensional array of observed variables based on periodically collected data, applying dimensionality reduction to the first multi-dimensional array to determine an artificial variable having a largest variance, determining a smaller, second multi-dimensional array that represents the first multi-dimensional array based on the artificial variable, and building a second virtual storage system to replicate the first virtual storage system based on the second multi-dimensional array.

    System and methods of zero-copy data path among user level processes

    公开(公告)号:US10747594B1

    公开(公告)日:2020-08-18

    申请号:US16256713

    申请日:2019-01-24

    Applicant: VMware, Inc.

    Abstract: The disclosure provides an approach for performing an operation by a first process on behalf of a second process, the method comprising: obtaining, by the first process, a memory handle from the second process, wherein the memory handle allows access, by the first process, to at least some of the address space of the second process; dividing the address space of the memory handle into a plurality of sections; receiving, by the first process, a request from the second process to perform an operation; determining, by the first process, a section of the plurality of sections that is to be mapped from the address space of the memory handle to the address space of the first process for the performance of the operation by the first process; mapping the section from the address space of the memory handle to the address space of the first process; and performing the operation by the first process on behalf of the second process.

    ISOLATION OF CONCURRENT READ AND WRITE TRANSACTIONS ON THE SAME FILE

    公开(公告)号:US20200183890A1

    公开(公告)日:2020-06-11

    申请号:US16213561

    申请日:2018-12-07

    Applicant: VMware, Inc.

    Abstract: The disclosure provides for isolation of concurrent read and write transactions on the same file, thereby enabling higher file system throughput relative to serial-only transactions. Race conditions and lock contentions in multi-writer scenarios are avoided in file stat (metadata) updates by the use of an aggregator to merge updates of committed transactions to maintain file stat truth, and an upgrade lock that enforces atomicity of file stat access, even while still permitting multiple processes to concurrently read from and/or write to the file data. The disclosure is applicable to generic file systems, whether native or virtualized, and may be used, for example, to speed access to database files that require prolonged input/output (I/O) transaction time periods.

    Multi-level access control for distributed storage systems

    公开(公告)号:US10678932B2

    公开(公告)日:2020-06-09

    申请号:US16153633

    申请日:2018-10-05

    Applicant: VMware, Inc.

    Abstract: System and method for accessing a distributed storage system uses a storage-level access control process at a distributed file system that interfaces with the distributed storage system to determine whether a particular client has access to a particular first file system object using an identifier of the particular client and storage-level access control rules in response to a file system request from the particular client to access a second file system object in the particular first file system. The storage-level access control rules are defined for a plurality of clients and a plurality of first file system objects of the distributed storage system to allow the particular client access to the second file system object in the particular first file system object only if the particular client has been determined to have access to the particular first file system object according to the storage-level access control rules.

    Write-optimized nested trees
    116.
    发明授权

    公开(公告)号:US10649959B2

    公开(公告)日:2020-05-12

    申请号:US15717613

    申请日:2017-09-27

    Applicant: VMware, Inc.

    Abstract: A Bε-tree associated with a file system on a storage volume includes a hierarchy of nodes. Each node includes a buffer portion that can be characterized by a fixed maximum allowable size to store key-value pairs as messages in the buffer. Messages can be initially buffered in the root node of the Bε-tree, and flushed to descendent children from the root node. Messages stored in the buffers can be indexed using a B+-tree data structure. As the B+-tree data structure in a buffer grows (due to receiving flushed messages) and shrinks (due to messages being flushed), disk blocks can be allocated from the storage volume to increase the actual size of the buffer and deallocated from the buffer to reduce the actual size of the buffer.

    End-to-end checksum in a multi-tenant encryption storage system

    公开(公告)号:US10581602B2

    公开(公告)日:2020-03-03

    申请号:US15866185

    申请日:2018-01-09

    Applicant: VMware, Inc.

    Abstract: A multi-tenant storage system can store clear text data and associated clear text checksum received from a storage tenant using their associated cryptographic key (“cryptokey”). When the clear text data is compressible, cryptographic data (“cryptodata”) is generated from a concatenation of the clear text checksum and compressed clear text data using the cryptokey. A cryptographic checksum (“cryptochecksum”) is generated from the cryptodata. When the clear text data is uncompressible, cryptographic data (“cryptodata”) is generated by encrypting the clear text data using the cryptokey with an extra verification step to make sure the clear text checksum can be rebuilt during the read request. A cryptographic checksum (“cryptochecksum”) is generated from the cryptodata. The cryptodata and associated cryptochecksum are stored in the multi-tenant storage system, so that repairs to damaged cryptodata can be made using the associated cryptochecksum.

    Consistent snapshots and clones in an asymmetric virtual distributed file system

    公开(公告)号:US10515192B2

    公开(公告)日:2019-12-24

    申请号:US15013877

    申请日:2016-02-02

    Applicant: VMware, Inc.

    Abstract: Processing in an asymmetrically distributed file system may include storing first data representative of the content of the files in a file system volume among a plurality of storage nodes. Second data representative of attributes of the files in the file system volume may be stored in only one of the storage nodes. Time-limited leases allow clients direct access to the plurality of storage node in order to access portions of the file system volume. The time-limited leases may be provided to client lessors. Snapshots of the file system volume may be generated after sending a revocation to the client lessors to revoke time-limited leases provided to the client lessors and having received the acknowledgements of the revocations or after the leases have expired for non-responding lessors, to ensure that changes are not made to the file system volume during snapshot processing.

    Exclusive session mode resilient to failure

    公开(公告)号:US10419498B2

    公开(公告)日:2019-09-17

    申请号:US14956284

    申请日:2015-12-01

    Applicant: VMware, Inc.

    Abstract: Examples perform input/output (I/O) requests, issued by a plurality of clients to an owner-node, in a virtual storage area network (vSAN) environment. I/O requests are guaranteed, as all I/O requests are performed during non-overlapping, exclusive sessions between one client at a time and the owner node. The owner node rejects requests for simultaneous sessions, and duplicate sessions are prevented by requiring that a client refresh its memory state after termination of a previous session.

Patent Agency Ranking