DATA ENCRYPTION IN A TWO-TIER STORAGE SYSTEM

    公开(公告)号:US20220066950A1

    公开(公告)日:2022-03-03

    申请号:US17002649

    申请日:2020-08-25

    Applicant: VMware, Inc.

    Abstract: A method for encrypting data blocks is provided. The method receives a plurality of data blocks and encrypts each data block using an LBA of the data block as a tweak. The method writes the plurality of encrypted data blocks to physical blocks of the plurality of physical disks. The method then performs deduplication on the physical disks by determining that first and second physical blocks in the physical disks are duplicates, decrypting encrypted data in the first physical block using a first LBA associated with the first physical block as the tweak, and re-encrypting decrypted data in the first physical block using a PBA associated with the first physical block as the tweak. When reading the data back, either the LBA or PBA is used as the tweak, depending on whether the data was encrypted using LBA or re-encrypted using PBA during the deduplication process.

    SYSTEM AND METHOD OF RESYNCING N-WAY MIRRORED METADATA ON DISTRIBUTED STORAGE SYSTEMS WITHOUT REQUIRING CHECKSUM IN THE UNDERLYING STORAGE

    公开(公告)号:US20210349791A1

    公开(公告)日:2021-11-11

    申请号:US16870852

    申请日:2020-05-08

    Applicant: VMware, Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resynchronizing data in a storage system. One of the methods includes receiving, by a first storage subsystem, a plurality of write requests corresponding to respective meta data blocks, wherein the first storage subsystem comprises a meta object; storing, by the first storage subsystem and for each write request, in each disk of the meta object, a version of the corresponding meta data block; determining that a particular disk of the meta object has failed; determining whether one or more valid versions of the meta data block are stored in respective other disks of the meta object; and in response to determining that one or more valid versions of the meta data block are stored in respective other disks of the meta object, resynchronizing the meta data block in the particular disk.

    SHRINKING SEGMENT CLEANING ALGORITHM IN AN OBJECT STORAGE

    公开(公告)号:US20220164125A1

    公开(公告)日:2022-05-26

    申请号:US17100663

    申请日:2020-11-20

    Applicant: VMware, Inc.

    Abstract: A method for cleaning an object storage having a plurality of segments is provided. Each segment includes an identifier through which the segment is accessed. The method identifies a first segment in the plurality of segments. The first segment includes a first identifier and a first size. The method determines that a utilization ratio for the first segment is below a threshold. As a result, the method generates a second segment from the first segment, such that the second segment includes a second identifier that is the same as the first identifier and a second size that is smaller than the first size. The method then writes the second segment to the object storage.

    SCALABLE I/O OPERATIONS ON A LOG-STRUCTURED MERGE (LSM) TREE

    公开(公告)号:US20220156231A1

    公开(公告)日:2022-05-19

    申请号:US17097473

    申请日:2020-11-13

    Applicant: VMware, Inc.

    Abstract: A method for managing data associated with objects stored in a cloud storage is provided. The method receives, at a first compute node, first data associated with an object stored in the cloud storage, the first compute node being one of a plurality of compute nodes that store data associated with different objects as storage objects in a log-structured merging (LSM) tree data structure. The method then assigns a first unique name to a first storage object associated with the first data, the first unique name comprising a combination of at least an identifier identifying the first compute node and a first incremental local value. The method stores the first storage object in a first level (L0) of the LSM tree data structure.

    ENHANCED HASH CALCULATION IN DISTRIBUTED DATASTORES

    公开(公告)号:US20210294495A1

    公开(公告)日:2021-09-23

    申请号:US16827648

    申请日:2020-03-23

    Applicant: VMware, Inc.

    Abstract: A method for generating one or more hashes for one or more data blocks is provided. The method receives a data block to write on at least one physical disk of a set of physical disks associated with a set of host machines. The method then calculates a hash for the received data block and writes a first entry to a data log in a cache disk, the first entry comprising a first header and data indicative of the received block, the first header comprising the hash. The method further writes the data to the at least one physical disk as part of data blocks of a stripe, and stores the hash in a summary block on the at least one physical disk. The summary block is associated with the data blocks of the stripe stored on the at least one physical disk.

    SYSTEM AND METHOD FOR REDUCING READ AMPLIFICATION OF ARCHIVAL STORAGE USING PROACTIVE CONSOLIDATION

    公开(公告)号:US20220197861A1

    公开(公告)日:2022-06-23

    申请号:US17131155

    申请日:2020-12-22

    Applicant: VMware, Inc.

    Abstract: System and method for managing snapshots of storage objects in a storage system use a consolidation operation to reduce read amplification for stored snapshots of a storage object that are stored in log segments in the storage system according to a log-structured file system as storage service objects. The consolidation operation involves identifying target log segments among the log segments that include live blocks that are associated with the latest snapshot of the storage object and determining the number of the live blocks included in each of the target log segments. Based on the number of the live blocks in each of the target log segments, candidate consolidation log segments are determined from the target log segments. The live blocks in the candidate consolidation log segments are then consolidated to new log segments, which are uploaded to the storage system as new storage service objects.

    SYSTEM AND METHODS OF EFFICIENTLY RESYNCING FAILED COMPONENTS WITHOUT BITMAP IN AN ERASURE-CODED DISTRIBUTED OBJECT WITH LOG-STRUCTURED DISK LAYOUT

    公开(公告)号:US20210349793A1

    公开(公告)日:2021-11-11

    申请号:US16870861

    申请日:2020-05-08

    Applicant: VMware, Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resynchronizing data in a storage system. One of the methods includes determining that a particular disk of a capacity object of a storage system was offline for an interval of time, wherein the capacity object comprises a plurality of segments, and wherein the storage system comprises a segment usage table identifying a linked list of particular segments of the capacity object that are in use; determining a time point at which the particular disk went offline; determining one or more first segments of the capacity object that were modified after the time point, wherein determining one or more first segments comprises determining each segment of the segment usage table having a transaction ID that is larger than the time point; and resynchronizing, for each first segment, a portion of the particular disk corresponding to the first segment.

    SYSTEM AND METHOD OF RESYNCING DATA IN ERASURE-CODED OBJECTS ON DISTRIBUTED STORAGE SYSTEMS WITHOUT REQUIRING CHECKSUM IN THE UNDERLYING STORAGE

    公开(公告)号:US20210349790A1

    公开(公告)日:2021-11-11

    申请号:US16870801

    申请日:2020-05-08

    Applicant: VMware, Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resynchronizing data in a storage system. One of the methods includes determining that a particular primary disk of a capacity object of a storage system has failed, wherein the capacity comprises a plurality of segments, and wherein the each segment comprises: a plurality of primary columns each corresponding to a respective primary disk of the capacity object, and a plurality of parity columns each corresponding to a respective parity disk of the capacity object; and resynchronizing, for each segment of one or more segments of the capacity object, the primary column of the segment corresponding to the particular primary disk using i) the primary columns of the segment corresponding to each other primary disk of the capacity object, ii) one or more parity columns of the segment, and iii) the column summaries of the segment.

    ENHANCED DATA ENCRYPTION IN DISTRIBUTED DATASTORES USING RANDOM TWEAKS STORED IN DATA BLOCKS

    公开(公告)号:US20210294502A1

    公开(公告)日:2021-09-23

    申请号:US16827692

    申请日:2020-03-23

    Applicant: VMware, Inc.

    Abstract: A method for encrypting data in one or more data blocks is provided. The method receives a first data block to be written to a physical storage that includes one or more physical disks. The method applies a first random tweak to data indicative of the first data block to generate a first encrypted data block, and writes the first encrypted data block and the first random tweak to a first physical block of the physical storage. The method receives a second data block to be written to the physical storage. The method then applies a second random tweak, different than the first random tweak, to data indicative of the second data block to generate a second encrypted data block, and writes the second encrypted data block and the second random tweak to a second physical block of the physical storage.

    ENHANCED DATA COMPRESSION IN DISTRIBUTED DATASTORES

    公开(公告)号:US20210294499A1

    公开(公告)日:2021-09-23

    申请号:US16827618

    申请日:2020-03-23

    Applicant: VMware, Inc.

    Abstract: A method for performing write operations on a set of one or more physical disks of a set of one or more host machines is provided. The method receives a data block to write on at least one physical disk in the set of physical disks and generates a first set of one or more compressed sectors based on the received data block. The method writes (i) a first entry having a first header and the first set of compressed sectors to a data log that is maintained in a cache, and (ii) the first set of compressed sectors to a bank in memory. The method further determines if a size of data including compressed sectors in the bank satisfies a threshold, and when the size of data in the bank satisfies the threshold, writes the data to the at least one physical disk in the set of physical disks.

Patent Agency Ranking