-
公开(公告)号:US11663160B2
公开(公告)日:2023-05-30
申请号:US17002669
申请日:2020-08-25
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Petr Vandrovec , Ilya Languev , Maxime Austruy , Ilia Sokolinski , Satish Pudi
CPC classification number: G06F16/128 , G06F11/1471 , G06F16/20 , G06F16/273 , H04L67/1097 , G06F11/1451 , G06F16/258 , G06F2201/80 , G06F2201/84
Abstract: In one set of embodiments, a computer system can create a snapshot of a data set, where the snapshot includes a plurality of data blocks of the data set. The computer system can upload the snapshot to a cloud object storage platform of a cloud infrastructure, where the snapshot is uploaded as a plurality of log segments conforming to an object format of the cloud object storage platform, and where each log segment includes one or more data blocks in the plurality of data blocks, and a set of metadata comprising, for each of the one or more data blocks, an identifier of the data set, an identifier of the snapshot, and a logical block address (LBA) of the data block. The computer system can then communicate the set of metadata to a server component running in a cloud compute and block storage platform of the cloud infrastructure.
-
公开(公告)号:US20220019456A1
公开(公告)日:2022-01-20
申请号:US17013727
申请日:2020-09-07
Applicant: VMware, Inc.
Inventor: MOUNESH BADIGER , Wenguang Wang , Adrian Drzewiecki , Maxime Austruy , Satish Pudi
IPC: G06F9/455 , G06F12/1036 , G06F12/02
Abstract: Examples provide a method of communication between a client driver and a filesystem server. The client driver executes in a virtual machine (VM) and the filesystem server executes in a hypervisor. The method includes: allocating, by the client driver, shared memory in an address space of the VM for the communication; sending identification information for the shared memory from the client driver to the filesystem server through an inter-process communication channel between the client driver and the filesystem server; identifying, by the filesystem server in cooperation with a kernel of the hypervisor, the shared memory within an address space of the hypervisor, based on the identification information, to create a shared memory channel; sending commands from the client driver to the filesystem server through the shared memory channel; and receiving completion messages for the commands from the filesystem server to the client driver through the shared memory channel.
-
公开(公告)号:US11385817B2
公开(公告)日:2022-07-12
申请号:US17028312
申请日:2020-09-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Ilya Languev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
IPC: G06F3/06 , G06F12/1018
Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in object storage using subset hashes. A plurality of hashes of a plurality of blocks of a plurality of log segments can be received from a software defined data center, wherein each block corresponds to a respective logical address. Each of the plurality of logical addresses can be associated with a respective sequentially-allocated chunk identifier in a logical map. A subset hash comprising a hash of a subset of the plurality of blocks can be determined that corresponds to a contiguous range of the plurality of logical addresses. A search of a hash map for the subset hash can be performed to determine if the subset hash is a duplicate. The subset of the plurality of blocks can be deduplicated responsive to a determination that the subset hash is a duplicate.
-
公开(公告)号:US20220066882A1
公开(公告)日:2022-03-03
申请号:US17002577
申请日:2020-08-25
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Petr Vandrovec , Ilya Languev , Maxime Austruy , Ilia Sokolinski , Satish Pudi
Abstract: Techniques for tiering data to a cold storage tier of a cloud object storage platform are provided. In one set of embodiments, a computer system can identify one or more old snapshots of a data set that reside in a first storage tier of the cloud object storage platform, where the one or more old snapshots are snapshots that are unlikely to be deleted from the cloud object storage platform within a period of N days. The computer system can further, for each snapshot in the one or more old snapshots: identify one or more data blocks in the snapshot that are superseded by a more recent snapshot in the one or more old snapshots; write the one or more data blocks to a second (i.e., cold) storage tier of the cloud object storage platform that has a lower storage cost than the first storage tier; and cause the one or more data blocks to be deleted from the first storage tier.
-
公开(公告)号:US11221944B1
公开(公告)日:2022-01-11
申请号:US17002667
申请日:2020-08-25
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Ilya Languev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
IPC: G06F12/12 , G06F12/02 , G06F12/109
Abstract: A method for managing metadata for data stored in a cloud storage is provided. The method receives, at a first of a plurality of metadata servers, information associated with an object stored in the cloud storage, the information comprising a plurality of LBAs for where the object is stored. Each metadata server allocates contiguous chunk IDs for a group of objects. The method generates a new chunk ID for the object, which is a combination of a unique fixed value and a monotonically incrementing local value associated with each LBA, such that a first LBA is mapped to a first chunk ID having a first local value and a next LBA is mapped to a second chunk ID having the first local value incremented as a second local value. The method stores the new chunk ID and other metadata in one or more tables stored in a metadata storage.
-
公开(公告)号:US11500819B2
公开(公告)日:2022-11-15
申请号:US17028405
申请日:2020-09-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Maxime Austruy , Petr Vandrovec , Ilya Languev , Ilia Sokolinski , Satish Pudi
IPC: G06F7/00 , G06F16/174 , G06F16/14 , G06F16/13
Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in file storage using file chunk hashes. A hash of a chunk of a log segment can be received from a software defined data center. A chunk identifier can be associated with the hash in a hash map that stores associations between sequentially-allocated chunk identifiers and hashes. The chunk identifier can be associated with a logical address corresponding to the chunk of the log segment in a logical map that stores associations between the sequentially-allocated chunk identifiers and logical addresses. A search of the hash map can be performed to determine if the chunk is a duplicate, and the chunk can be deduplicated responsive to a determination that the chunk is a duplicate.
-
公开(公告)号:US11314440B1
公开(公告)日:2022-04-26
申请号:US17072904
申请日:2020-10-16
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsidhar Gunturu , Junlong Gao , Ilya Languev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
Abstract: Techniques for the increased efficiency of storing data objects storage in the object storage of a software designed data center (SDDC) are provided. The techniques include the efficient storage of data, while enabling snapshots of each updating of the data. The snapshots of the data may be efficiently recovered via the techniques. Difference-level mappings for each snapshot are encoded in compact self-balancing data trees included in the object's metadata. The metadata mappings include mappings between various address spaces employed by the SDDC, as well as the address spaces employed by data stores that store the data on physical medium. Because the metadata is efficiently structured, the metadata for an object may be cached for quick lookups during data access and/or snapshot recovery. The techniques also provide low-latency recovery and/or system rollback in the event of any failure in the SDDC.
-
公开(公告)号:US20220091765A1
公开(公告)日:2022-03-24
申请号:US17028312
申请日:2020-09-22
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Ilya Languev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
IPC: G06F3/06 , G06F12/1018
Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in object storage using subset hashes. A plurality of hashes of a plurality of blocks of a plurality of log segments can be received from a software defined data center, wherein each block corresponds to a respective logical address. Each of the plurality of logical addresses can be associated with a respective sequentially-allocated chunk identifier in a logical map. A subset hash comprising a hash of a subset of the plurality of blocks can be determined that corresponds to a contiguous range of the plurality of logical addresses. A search of a hash map for the subset hash can be performed to determine if the subset hash is a duplicate. The subset of the plurality of blocks can be deduplicated responsive to a determination that the subset hash is a duplicate.
-
公开(公告)号:US20220066883A1
公开(公告)日:2022-03-03
申请号:US17002669
申请日:2020-08-25
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Junlong Gao , Petr Vandrovec , Ilya Languev , Maxime Austruy , Ilia Sokolinski , Satish Pudi
Abstract: Techniques for recovering metadata associated with data backed up in cloud object storage are provided. In one set of embodiments, a computer system can create a snapshot of a data set, where the snapshot includes a plurality of data blocks of the data set that have been modified since the creation of a prior snapshot of the data set. The computer system can further upload the snapshot to a cloud object storage platform of a cloud infrastructure, where the snapshot is uploaded as a plurality of log segments conforming to an object format of the cloud object storage platform, and where each log segment includes one or more data blocks in the plurality of data blocks, and a set of metadata comprising, for each of the one or more data blocks, an identifier of the data set, an identifier of the snapshot, and a logical block address (LBA) of the data block. The computer system can then communicate the set of metadata to a server component running in a cloud compute and block storage platform of the cloud infrastructure.
-
公开(公告)号:US20220058094A1
公开(公告)日:2022-02-24
申请号:US16998060
申请日:2020-08-20
Applicant: VMware, Inc.
Inventor: Vamsi Gunturu , Wenguang Wang , Junlong Gao , Ilia Langouev , Petr Vandrovec , Maxime Austruy , Ilia Sokolinski , Satish Pudi
Abstract: Solutions for managing archived storage include receiving, at a first node, a snapshot comprising object data (e.g., a virtual machine disk snapshot) from a second node (e.g., a software defined data center), and storing the snapshot in a tiered structure that includes a data tier and a metadata tier. Snapshots may be used for fail-over operations and/or backups, to support disaster recovery. The data tier comprises a log-structured file system (LFS), and the metadata tier comprises a content addressable storage (CAS) identifying addresses within the LFS. The metadata tier also comprises a logical layer indicating content in the CAS. Segment cleaning of the data tier is performed using a segment usage table (SUT). Some examples include performing a fail-over operation from the second node to a third node using at least the stored snapshot for workload recovery. In some examples, the CAS comprises a log-structured merge-tree (LSM-tree).
-
-
-
-
-
-
-
-
-