-
公开(公告)号:US20240070080A1
公开(公告)日:2024-02-29
申请号:US17900642
申请日:2022-08-31
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Yanxing PAN , Junlong GAO , Mounesh BADIGER , Eric KNAUFT , Vamsidhar GUNTURU
IPC: G06F12/0864
CPC classification number: G06F12/0864 , G06F2212/1016 , G06F2212/604
Abstract: The disclosure describes growing a data cache using a background hash bucket growth process. A first memory portion is allocated to the data buffer of the data cache and a second memory portion is allocated to the metadata buffer of the data cache based on the cache growth instruction. The quantity of hash buckets in the hash bucket buffer is increased and the background hash bucket growth process is initiated, wherein the process is configured to rehash hash bucket entries of the hash bucket buffer in the increased quantity of hash buckets. A data entry is stored in the data buffer using the allocated first memory portion of the data cache and metadata associated with the data entry is stored using the allocated second memory portion of the metadata buffer, wherein a hash bucket entry associated with the data entry is stored in the increased quantity of hash buckets.
-
2.
公开(公告)号:US20210064582A1
公开(公告)日:2021-03-04
申请号:US16552998
申请日:2019-08-27
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Junlong GAO , Marcos K. AGUILERA , Richard P. SPILLANE , Christos KARAMANOLIS , Maxime AUSTRUY
IPC: G06F16/174 , G06F16/13 , G06F16/172
Abstract: The present disclosure provides techniques for deduplicating files. The techniques include creating a data structure that organizes metadata about chunks of files, the organization of the metadata preserving order and locality of the chunks within files. The organization of the metadata within storage blocks of storage devices matches the order of chunks within files. Upon a read or write operation to a metadata, the preservation of locality of metadata results in the likely fetching, from storage into a memory cache, metadata of subsequent and contiguous chunks. The preserved locality results in faster subsequent read and write operations of metadata, because the read and write operations are likely to be executed from memory rather than from storage.
-
公开(公告)号:US20220414064A1
公开(公告)日:2022-12-29
申请号:US17357044
申请日:2021-06-24
Applicant: VMware, Inc.
Inventor: Abhay Kumar JAIN , Sriram PATIL , Junlong GAO , Wenguang WANG
IPC: G06F16/182 , G06F16/17 , G06F16/16 , G06F16/178 , G06F16/174
Abstract: A method for managing replication of cloned files is provided. Embodiments include determining, at a source system, that a first file has been cloned to create a second file. Embodiments include sending, from the source system to a replica system, an address of the first extent and an indication that a status of the first extent has changed from non-cloned to cloned. Embodiments include changing, at the replica system, a status of a second extent associated with a replica of the first file on the replica system from non-cloned to cloned and creating a mapping of the address of the first extent to an address of the second extent on the replica system. Embodiments include creating, at the replica system, a replica of the second file comprising a reference to the address of the second extent on the replica system.
-
公开(公告)号:US20210326049A1
公开(公告)日:2021-10-21
申请号:US16853623
申请日:2020-04-20
Applicant: VMware, Inc.
Inventor: Ye ZHANG , Wenguang WANG , Sriram PATIL , Richard P. SPILLANE , Junlong GAO , Wangping HE , Zhaohui GUO , Yang YANG
Abstract: System and method for writing updated versions of a configuration data file for a distributed file system in a storage system uses a directory renaming operation to write a new updated version of the configuration data file using the latest version of the configuration data file and a target directory. After the latest version of the configuration data file is modified by a particular host computer in the storage system, the modified configuration data file is written to a temporary file. The directory naming operation is then initiated on the temporary file to change the directory for the temporary file to the target directory. If the directory renaming operation has failed, a retry is performed by the particular host computer to write the new updated version of the configuration data file using a new latest version and a new target directory.
-
公开(公告)号:US20210064580A1
公开(公告)日:2021-03-04
申请号:US16552965
申请日:2019-08-27
Applicant: VMware, Inc.
Inventor: Junlong GAO , Wenguang WANG , Marcos K. AGUILERA , Richard P. SPILLANE , Christos KARAMANOLIS , Maxime AUSTRUY
IPC: G06F16/174 , G06F16/14 , G06F16/901
Abstract: The disclosure provides techniques for deduplicating files. The techniques include, upon creating or modifying a file, placing a logical timestamp of the current logical time, within a queue associated with the directory of the file. The techniques further include placing the logical timestamp within a queue of each parent directory of the directory of the file. To determine a set of files for deduplication, the techniques disclosed herein identify files that have been modified within a logical time range. The set of files modified within a logical time is identified by traversing directories of a storage system, the directories being organized within a tree structure. If a directory's queue does not contain a timestamp that is within the logical time range, then all child directories can be skipped over for further processing, such that no files within the child directories end up being within the set of files for deduplication.
-
公开(公告)号:US20230195576A1
公开(公告)日:2023-06-22
申请号:US17644618
申请日:2021-12-16
Applicant: VMware, Inc.
Inventor: Enning XIANG , Pranay SINGH , Junlong GAO , Hardik Singh NEGI , Wenguang WANG
CPC classification number: G06F11/1407 , G06F11/1474 , G06F16/128 , G06F16/2246
Abstract: A method for resumeable snapshot deletion is provided. A method for deletion of nodes maintained in an ordered data structure for a first snapshot includes processing the nodes maintained in the ordered data structure according to a defined order, setting a node path cursor with a pointer to a node and an indication of the deletion of the node; storing the node path cursor in a persistent storage; and during processing of the nodes: detecting a failure; after the failure, checking the pointer of the node path cursor; and resuming processing of the nodes starting from the first node indicated by the pointer.
-
公开(公告)号:US20220121532A1
公开(公告)日:2022-04-21
申请号:US17072961
申请日:2020-10-16
Applicant: VMware, Inc.
Inventor: Petr VANDROVEC , Wenguang WANG , Vamsidhar GUNTURU , Junlong GAO , Ilia SOKOLINSKI , Maxime AUSTRUY
IPC: G06F11/14
Abstract: Techniques for the increased efficiency of storing data objects storage in the object storage of a software designed data center (SDDC) are provided. The techniques include the efficient storage of data, while enabling snapshots of each updating of the data. The snapshots of the data may be efficiently recovered via the techniques. Difference-level mappings for each snapshot are encoded in compact self-balancing data trees included in the object's metadata. The metadata mappings include mappings between various address spaces employed by the SDDC, as well as the address spaces employed by data stores that store the data on physical medium. Because the metadata is efficiently structured, the metadata for an object may be cached for quick lookups during data access and/or snapshot recovery. The techniques also provide low-latency recovery and/or system rollback in the event of any failure in the SDDC, including when the failure occurs while uploading a snapshot.
-
公开(公告)号:US20210141728A1
公开(公告)日:2021-05-13
申请号:US16679570
申请日:2019-11-11
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Mounesh BADIGER , Abhay Kumar JAIN , Junlong GAO , Zhaohui GUO , Richard P. SPILLANE
IPC: G06F12/0842 , G06F12/0844 , G06F12/0871 , G06F12/1018 , G06F12/14
Abstract: Disclosed are a method and system for managing multi-threaded concurrent access to a cache data structure. The cache data structure includes a hash table and three queues. The hash table includes a list of elements for each hash bucket with each hash bucket containing a mutex object and elements in each of the queues containing lock objects. Multiple threads can each lock a different hash bucket to have access to the list, and multiple threads can each lock a different element in the queues. The locks permit highly concurrent access to the cache data structure without conflict. Also, atomic operations are used to obtain pointers to elements in the queues so that a thread can safely advance each pointer. Race conditions that are encountered with locking an element in the queues or entering an element into the hash table are detected, and the operation encountering the race condition is retried.
-
9.
公开(公告)号:US20220197861A1
公开(公告)日:2022-06-23
申请号:US17131155
申请日:2020-12-22
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Hardik Singh NEGI , Junlong GAO , Vamsi GUNTURU
IPC: G06F16/11 , G06F16/18 , G06F16/22 , G06F16/23 , G06F16/2455
Abstract: System and method for managing snapshots of storage objects in a storage system use a consolidation operation to reduce read amplification for stored snapshots of a storage object that are stored in log segments in the storage system according to a log-structured file system as storage service objects. The consolidation operation involves identifying target log segments among the log segments that include live blocks that are associated with the latest snapshot of the storage object and determining the number of the live blocks included in each of the target log segments. Based on the number of the live blocks in each of the target log segments, candidate consolidation log segments are determined from the target log segments. The live blocks in the candidate consolidation log segments are then consolidated to new log segments, which are uploaded to the storage system as new storage service objects.
-
公开(公告)号:US20220121365A1
公开(公告)日:2022-04-21
申请号:US17072904
申请日:2020-10-16
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Vamsidhar GUNTURU , Junlong GAO , Ilya LANGUEV , Petr VANDROVEC , Maxime AUSTRUY , Ilia SOKOLINSKI , Satish PUDI
Abstract: Techniques for the increased efficiency of storing data objects storage in the object storage of a software designed data center (SDDC) are provided. The techniques include the efficient storage of data, while enabling snapshots of each updating of the data. The snapshots of the data may be efficiently recovered via the techniques. Difference-level mappings for each snapshot are encoded in compact self-balancing data trees included in the object's metadata. The metadata mappings include mappings between various address spaces employed by the SDDC, as well as the address spaces employed by data stores that store the data on physical medium. Because the metadata is efficiently structured, the metadata for an object may be cached for quick lookups during data access and/or snapshot recovery. The techniques also provide low-latency recovery and/or system rollback in the event of any failure in the SDDC.
-
-
-
-
-
-
-
-
-