-
公开(公告)号:US20220058094A1
公开(公告)日:2022-02-24
申请号:US16998060
申请日:2020-08-20
Applicant: VMware, Inc.
Inventor: Vamsi Gunturu , Wenguang Wang , Junlong Gao , Ilia Langouev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
Abstract: Solutions for managing archived storage include receiving, at a first node, a snapshot comprising object data (e.g., a virtual machine disk snapshot) from a second node (e.g., a software defined data center), and storing the snapshot in a tiered structure that includes a data tier and a metadata tier. Snapshots may be used for fail-over operations and/or backups, to support disaster recovery. The data tier comprises a log-structured file system (LFS), and the metadata tier comprises a content addressable storage (CAS) identifying addresses within the LFS. The metadata tier also comprises a logical layer indicating content in the CAS. Segment cleaning of the data tier is performed using a segment usage table (SUT). Some examples include performing a fail-over operation from the second node to a third node using at least the stored snapshot for workload recovery. In some examples, the CAS comprises a log-structured merge-tree (LSM-tree).
-
公开(公告)号:US20220019528A1
公开(公告)日:2022-01-20
申请号:US16933162
申请日:2020-07-20
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu
IPC: G06F12/06
Abstract: A logical map represents fragments from separate versions of a data object. Migration of data from a first (old) version to the second (new) version happens gradually, where write operations go to the new version of the data object. The logical map initially points to the old data object, but is updated to point to the portions of the new data object as write operations are performed on the new data object. A background migration copies data from the old data object to the new data object.
-
公开(公告)号:US20210365318A1
公开(公告)日:2021-11-25
申请号:US16882246
申请日:2020-05-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao
Abstract: Techniques for using erasure coding in a single region to reduce the likelihood of losing objects in a cloud object storage platform are provided. In one set of embodiments, a computer system can upload a plurality of data objects to a region of a cloud object storage platform, where the plurality of data objects including modifications to a data set. The computer system can further compute a parity object based on the plurality of data objects, where the parity object encodes parity information for the plurality of data objects. The computer system can then upload the parity object to the same region where the plurality of data objects was uploaded.
-
44.
公开(公告)号:US20210311653A1
公开(公告)日:2021-10-07
申请号:US16842649
申请日:2020-04-07
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Eric Knauft , Pascal Renauld
IPC: G06F3/06
Abstract: Techniques for issuing efficient writes to an erasure coded storage object in a distributed storage system are provided. In one set of embodiments, a node of the system can receive a write request for updating a logical data block of the storage object, write data/metadata for the block to a record in a data log of a metadata object of the storage object (where the metadata object is stored on a performance storage tier), place the block data in a free slot of an in-memory bank, and determine whether the in-memory bank has become full. If the in-memory bank is full, the node can further allocate a segment in a capacity object of the storage object for holding contents of the in-memory bank (where the capacity object is stored on a capacity storage tier), and write the in-memory bank contents via a full stripe write to the allocated segment.
-
公开(公告)号:US20210311651A1
公开(公告)日:2021-10-07
申请号:US16842604
申请日:2020-04-07
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu
IPC: G06F3/06
Abstract: Techniques for supporting large segments when issuing writes to an erasure coded storage object in a distributed storage system are provided. In one set of embodiments, a node of the system can receive a write request for updating a logical data block of the storage object, write data/metadata for the block to a record in a data log of a metadata object of the storage object (where the metadata object is stored on a performance storage tier), and determine whether the data log has accumulated a threshold number of records. If so, the node can further allocate an in-memory bank, place the data from the data log records into free slots of the bank, allocate a segment in a capacity object of the storage object for holding contents of the bank (where the capacity object is stored on a capacity storage tier), and write the bank contents via a full stripe write to the allocated segment.
-
公开(公告)号:US11625370B2
公开(公告)日:2023-04-11
申请号:US16842657
申请日:2020-04-07
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Eric Knauft
IPC: G06F16/215 , H04L67/1097 , G06F12/0804 , G06F16/23
Abstract: Techniques for reducing data log recovery time and metadata write amplification when checkpointing a data log of a storage object in a distributed storage system are provided. In one set of embodiments, a node of the system can determine whether the data log has reached a first threshold size, where the data log comprises a plurality of data log records, and where each data log record includes data and metadata for a write request directed to the storage object. If the data log has reached the first threshold size, the node can copy, from each of the plurality of data log records, the metadata for the write request to a corresponding metadata log entry in a metadata log of the storage object. The node can then truncate the data log by removing the plurality of data log records.
-
公开(公告)号:US11494276B2
公开(公告)日:2022-11-08
申请号:US16925168
申请日:2020-07-09
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Eric Knauft
Abstract: Processing write requests from clients includes logging the associated data into logs corresponding to the data objects targeted by the write requests. The logs are persisted by combining log entries from each of the logs into one ore more fixed-size data blocks. The fixed-size data blocks are inserted into a data tree stored on a block-based storage device.
-
公开(公告)号:US11494090B2
公开(公告)日:2022-11-08
申请号:US17033610
申请日:2020-09-25
Applicant: VMware Inc.
Inventor: Asit Desai , Wenguang Wang , Eric Knauft , Vamsi Gunturu
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for maintaining fault tolerance for new writes in a storage system when one or more components of the storage system are unavailable. One of the methods includes determining that one or more first disks of a capacity object of a storage system are unavailable, wherein the storage system comprises a segment usage table identifying the plurality of segments of the capacity object; in response: identifying a plurality of available second disks, adding a plurality of new segments corresponding to the second disks to the capacity object, and adding data identifying the plurality of new segments to the segment usage table; and for each of one or more new write requests to the capacity object: identifying an available segment from the plurality of new segments, and writing data associated with the new write request to the identified available segment.
-
公开(公告)号:US11481319B2
公开(公告)日:2022-10-25
申请号:US16882327
申请日:2020-05-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao
Abstract: Techniques for using data mirroring across regions to reduce the likelihood of losing objects in a cloud object storage platform are provided. In one set of embodiments, a computer system can upload first and second copies of a data object to first and second regions of the cloud object storage platform respectively, where the first and second copies are identical. The computer system can then attempt to read the first copy of the data object from the first region. If the read attempt fails, the computer system can retrieve the second copy of the data object from the second region.
-
公开(公告)号:US11385817B2
公开(公告)日:2022-07-12
申请号:US17028312
申请日:2020-09-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Ilya Languev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
IPC: G06F3/06 , G06F12/1018
Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in object storage using subset hashes. A plurality of hashes of a plurality of blocks of a plurality of log segments can be received from a software defined data center, wherein each block corresponds to a respective logical address. Each of the plurality of logical addresses can be associated with a respective sequentially-allocated chunk identifier in a logical map. A subset hash comprising a hash of a subset of the plurality of blocks can be determined that corresponds to a contiguous range of the plurality of logical addresses. A search of a hash map for the subset hash can be performed to determine if the subset hash is a duplicate. The subset of the plurality of blocks can be deduplicated responsive to a determination that the subset hash is a duplicate.
-
-
-
-
-
-
-
-
-