FRAMEWORK FOR DISTRIBUTED KEY-VALUE STORE IN A WIDE AREA NETWORK

    公开(公告)号:US20180302474A1

    公开(公告)日:2018-10-18

    申请号:US16015820

    申请日:2018-06-22

    Applicant: VMware, Inc.

    Abstract: A distributed key-value storage system may include a master node. The key-value store may be distributed among first and second nodes. The master node may receive a publish request to publish one or more key-value pairs. Each key-value pair may be stored in a retransmit buffer and sent to all the first nodes using a communication protocol of a first kind that does not include a retransmit protocol mechanism. Some of the key-value pairs may be sent to one or more second node using a communication protocol of a second kind that includes a retransmit protocol mechanism.

    SYNCHRONIZING REPLICAS WITH MEDIA ERRORS IN DISTRIBUTED STORAGE SYSTEMS

    公开(公告)号:US20180225171A1

    公开(公告)日:2018-08-09

    申请号:US15943293

    申请日:2018-04-02

    Applicant: VMware, Inc.

    CPC classification number: G06F11/1004 G06F3/0619 G06F3/064 G06F3/0683

    Abstract: Systems and techniques are described for transferring data. A described technique includes receiving a request to transmit a data block from a first data storage device to a second data storage device. An attempt to read the data block from the first data storage device is made. A media error resulting from the attempt to read the data block from the first data storage device is detected. In response to detecting the media error, a new data block is generated and includes mismatched checksum data that causes a checksum mismatched error when the new data block is accessed. The new data block is transmitted for storage at the second data storage device in place of the data block.

    Synchronizing replicas with media errors in distributed storage systems

    公开(公告)号:US09952923B2

    公开(公告)日:2018-04-24

    申请号:US15199128

    申请日:2016-06-30

    Applicant: VMware, Inc.

    CPC classification number: G06F11/1004 G06F3/0619 G06F3/064 G06F3/0683

    Abstract: Systems and techniques are described for transferring data. A described technique includes receiving a request to transmit a data block from a first data storage device to a second data storage device. An attempt to read the data block from the first data storage device is made. A media error resulting from the attempt to read the data block from the first data storage device is detected. In response to detecting the media error, a new data block is generated and includes mismatched checksum data that causes a checksum mismatched error when the new data block is accessed. The new data block is transmitted for storage at the second data storage device in place of the data block.

    Managing extent sharing between snapshots using mapping addresses

    公开(公告)号:US11822804B2

    公开(公告)日:2023-11-21

    申请号:US17492950

    申请日:2021-10-04

    Applicant: VMware, Inc.

    Abstract: The present disclosure is related to methods, systems, and machine-readable media for managing extent sharing between snapshots using mapping addresses. A first mapping address can be assigned to a first extent responsive to a request to write the first extent. A second mapping address can be assigned to a second extent responsive to a request to write the second extent. A snapshot can be created. A snapshot mapping address, that is monotonically increased from the second mapping address, can be assigned to the snapshot. A third mapping address, that is monotonically increased from the second mapping address, can be assigned to a third extent of the snapshot responsive to a request to write the third extent. An ownership status of each of the first extent, the second extent, and the third extent can be determined based respectively on: a comparison between the first mapping address and the snapshot mapping address, a comparison between the second mapping address and the snapshot mapping address, and a comparison between the third mapping address and the snapshot mapping address.

    SYSTEM AND METHOD FOR DELETING PARENT SNAPSHOTS OF RUNNING POINTS OF STORAGE OBJECTS USING EXTENT OWNERSHIP VALUES

    公开(公告)号:US20230169036A1

    公开(公告)日:2023-06-01

    申请号:US17522820

    申请日:2021-11-09

    Applicant: VMware, Inc.

    CPC classification number: G06F16/128

    Abstract: System and method for deleting parent snapshots of running points of storage objects stored in a storage system, in response to a request to delete a parent snapshot of a running point of a storage object stored in the storage system, changes the minimum extent ownership value of the running point to the minimum extent ownership value of the parent snapshot so that any physical extent with an extent ownership value equal to or greater than the changed minimum extent ownership value is deemed to be owned by the running point. For each logical block of the parent snapshot, depending on whether the physical extent corresponding to that logical block is determined to be exclusively accessible to the parent snapshot, the physical extent is removed or no action is taken on the physical extent so that the physical extent is used by the running point.

    System and method for managing B tree node sharing using operation sequence numbers

    公开(公告)号:US11593399B2

    公开(公告)日:2023-02-28

    申请号:US17355125

    申请日:2021-06-22

    Applicant: VMware, Inc.

    Abstract: System and method for managing copy-on-write (COW) B tree structures for metadata of storage objects stored in a storage system determine, when a request to modify a target storage object stored in the storage system that requires a modification of a target leaf node in a B tree structure for metadata of the target storage object is received, whether an operation sequence number of the target leaf node is greater than a snapshot sequence number of a parent snapshot of a running point of the B tree structure. When the operation sequence number is greater than the snapshot sequence number, the target leaf mode is modified in place without copying the target leaf node. When the operation sequence number is not greater than the snapshot sequence number, the target leaf node is copied as a new leaf node for the B tree structure and the new leaf node is modified.

    DETERMINING SHARED NODES BETWEEN SNAPSHOTS USING PROBABILISTIC DATA STRUCTURES

    公开(公告)号:US20230028678A1

    公开(公告)日:2023-01-26

    申请号:US17383087

    申请日:2021-07-22

    Applicant: VMware, Inc.

    Abstract: The present disclosure is related to methods, systems, and machine-readable media for determining shared nodes between snapshots using probabilistic data structures. A unique identifier can be assigned to each node of a first tree data structure corresponding to a first snapshot of a virtual computing instance (VCI). A first probabilistic data structure representing the first tree data structure can be created that includes hashes of the identifiers assigned to the nodes of the first tree data structure. A unique identifier can be assigned to each node of a second tree data structure corresponding to a second snapshot of the VCI. A second probabilistic data structure representing the second tree data structure can be created that includes hashes of the identifiers assigned to the nodes of the second tree data structure. A particular node of the second tree data structure can be determined to be shared by the first tree data structure responsive to a determination that the first probabilistic data structure includes a hash of an identifier assigned to the particular node.

    Identifying a fault domain for a delta component of a distributed data object

    公开(公告)号:US11327841B2

    公开(公告)日:2022-05-10

    申请号:US16875838

    申请日:2020-05-15

    Applicant: VMware, Inc.

    Abstract: The disclosure herein describes placing a delta component of a base component in a target fault domain. A delta component associated with a base component is generated. The generation includes selecting a first fault domain as a target fault domain for the delta component based on the first fault domain including a witness component associated with the distributed data object of the base component. Otherwise, the generation includes selecting a second fault domain as the target fault domain based on the second fault domain including at least one data component that includes a different address space than the base component. Otherwise, the generation includes selecting a third fault domain as the target fault domain based on the third fault domain being unused. Then, the delta component is placed on the target fault domain, whereby data durability of the distributed data object is enhanced, and available fault domains are preserved.

Patent Agency Ranking