Distributed storage system and method for managing storage access bandwidth for multiple clients

    公开(公告)号:US10810143B2

    公开(公告)日:2020-10-20

    申请号:US15944743

    申请日:2018-04-03

    Applicant: VMware, Inc.

    Abstract: System and method for managing storage requests issued from multiple sources in a distributed storage system utilizes different queues at a host computer in the distributed storage system to place different classes of storage requests for access to a virtual storage area network. The storage requests in the queues are processed using a fair scheduling algorithm. For each queue, when the storage requests in the queue exceeds a threshold, a backpressure signal is generated and transmitted to at least one source for a class of storage requests queued in one of the queues corresponding to that backpressure signal to delay issuance of new storage requests of that class of storage requests.

    TIME-BASED CONGESTION DISCOUNTING FOR I/O FAIRNESS CONTROL

    公开(公告)号:US20190312925A1

    公开(公告)日:2019-10-10

    申请号:US15947313

    申请日:2018-04-06

    Applicant: VMware, Inc.

    Abstract: Computer system and method for managing storage requests in a distributed storage system uses congestion signals associated with storage requests, which are generated based on congestion at local storage of the computer system that supports a virtual storage area network. The storage requests are differentiated between a first class of storage requests and at least one other class of storage requests. For a storage request of the first class of storage requests, an actual ratio of a current average bandwidth of the first class of storage requests to a current average bandwidth of a second class of storage requests is calculated and compared with an expected ratio. The congestion signal associated with the storage request is then adjusted and transmitted to at least one source of storage requests for storage request fairness control.

    Techniques for reducing data log recovery time and metadata write amplification

    公开(公告)号:US11625370B2

    公开(公告)日:2023-04-11

    申请号:US16842657

    申请日:2020-04-07

    Applicant: VMware, Inc.

    Abstract: Techniques for reducing data log recovery time and metadata write amplification when checkpointing a data log of a storage object in a distributed storage system are provided. In one set of embodiments, a node of the system can determine whether the data log has reached a first threshold size, where the data log comprises a plurality of data log records, and where each data log record includes data and metadata for a write request directed to the storage object. If the data log has reached the first threshold size, the node can copy, from each of the plurality of data log records, the metadata for the write request to a corresponding metadata log entry in a metadata log of the storage object. The node can then truncate the data log by removing the plurality of data log records.

    Efficient erasure-coded storage in distributed data systems

    公开(公告)号:US11507544B2

    公开(公告)日:2022-11-22

    申请号:US16894663

    申请日:2020-06-05

    Applicant: VMware, Inc.

    Abstract: Techniques for efficiently storing client data blocks on a distributed-computing system are provided. The system includes a fast performance tier and a large capacity tier. The capacity tier stores the client data blocks in erasure encoded data stripes. The performance tier stores logical map data including an address map indicating a correspondence between logical addresses associated with a first layer of the system and physical addresses associated with a second layer. A method includes receiving a request to include additional client data blocks in the client blocks. The request indicates logical addresses for additional blocks. Corresponding physical addresses for additional block are determined. Each additional block is stored at the physical address. Additional logical map data is stored in the performance tier. Storing the additional logical map data includes updating the address map to indicate the correspondence between the logical addresses and the physical addresses for the additional blocks.

    Systems and methods of maintaining fault tolerance for new writes in degraded erasure coded distributed storage

    公开(公告)号:US11494090B2

    公开(公告)日:2022-11-08

    申请号:US17033610

    申请日:2020-09-25

    Applicant: VMware Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for maintaining fault tolerance for new writes in a storage system when one or more components of the storage system are unavailable. One of the methods includes determining that one or more first disks of a capacity object of a storage system are unavailable, wherein the storage system comprises a segment usage table identifying the plurality of segments of the capacity object; in response: identifying a plurality of available second disks, adding a plurality of new segments corresponding to the second disks to the capacity object, and adding data identifying the plurality of new segments to the segment usage table; and for each of one or more new write requests to the capacity object: identifying an available segment from the plurality of new segments, and writing data associated with the new write request to the identified available segment.

    Data encryption in a two-tier storage system

    公开(公告)号:US11379383B2

    公开(公告)日:2022-07-05

    申请号:US17002649

    申请日:2020-08-25

    Applicant: VMware, Inc.

    Abstract: A method for encrypting data blocks is provided. The method receives a plurality of data blocks and encrypts each data block using an LBA of the data block as a tweak. The method writes the plurality of encrypted data blocks to physical blocks of the plurality of physical disks. The method then performs deduplication on the physical disks by determining that first and second physical blocks in the physical disks are duplicates, decrypting encrypted data in the first physical block using a first LBA associated with the first physical block as the tweak, and re-encrypting decrypted data in the first physical block using a PBA associated with the first physical block as the tweak. When reading the data back, either the LBA or PBA is used as the tweak, depending on whether the data was encrypted using LBA or re-encrypted using PBA during the deduplication process.

    Intelligently scheduling resynchronization jobs in a distributed object-based storage system

    公开(公告)号:US11023493B2

    公开(公告)日:2021-06-01

    申请号:US16182448

    申请日:2018-11-06

    Applicant: VMware, Inc.

    Abstract: Techniques for intelligently scheduling resynchronization jobs in a distributed object-based storage system are provided. In one set of embodiments, a storage node of the system can create a resynchronization job for a component of an object maintained by the system, where the resynchronization job defines one or more input/output (I/O) operations to be carried out with respect to the component. If a number of currently running resynchronization jobs on the storage node has reached a threshold, the storage node can further determine a priority level associated with the object; add the resynchronization job to an object queue for the object; and if the added resynchronization job is a first job in the object queue, add the object queue as a new queue entry to a global priority queue corresponding to the priority level associated with the object.

Patent Agency Ranking