-
11.
公开(公告)号:US10609139B2
公开(公告)日:2020-03-31
申请号:US15674375
申请日:2017-08-10
Applicant: VMware, Inc.
Inventor: Enning Xiang , Pascal Renauld , Eric Knauft
IPC: H04L29/08 , H04L12/24 , G06F13/10 , G06F9/451 , G06F16/27 , G06F16/182 , H04L29/14 , G06F11/07 , G06F9/455
Abstract: Exemplary methods, apparatuses, and systems include a replica node storing a component of a storage object detecting that a primary coordinator for the storage object component is no longer available to serve as primary coordinator. The replica node is within a cluster of nodes storing components of the storage object. In response to detecting that the primary coordinator is no longer available, the replica node updates a first metadata entry indicating that a secondary coordinator for the storage object component is unhealthy. The replica node rejects connection requests from the secondary coordinator in response to the first metadata entry indicating that the secondary coordinator for the storage object component is unhealthy.
-
公开(公告)号:US20190303013A1
公开(公告)日:2019-10-03
申请号:US16445649
申请日:2019-06-19
Applicant: VMware, Inc.
Inventor: Enning Xiang , Eric Knauft , Pascal Renauld
IPC: G06F3/06
Abstract: Systems and techniques are described for transferring data. A described technique includes determining that a first replica of an object stored at a first host has become available to a distributed storage system after previously being unavailable to the distributed storage system. The object includes a range of memory addresses at which data of the object is stored. In response to determining that the first replica has become available, resyncing data for the first replica is obtained. The resyncing data indicates whether each range of memory addresses is synchronized at the first replica with other replicas of the object. Tracking data for the first replica is obtained. The tracking data indicates whether data stored at the range of memory addresses of the object has been modified at a second replica while the first replica was unavailable. The resyncing data is updated based on the tracking data.
-
公开(公告)号:US20190026042A1
公开(公告)日:2019-01-24
申请号:US15653249
申请日:2017-07-18
Applicant: VMware, Inc.
Inventor: Abhishek Gupta , Jorge Guerra Delgado , Sandeep Rangaswamy , Srinath Premachandran , Pascal Renauld
Abstract: Techniques for enabling deduplication-aware load balancing in a distributed storage system are provided. In one set of embodiments, a node of the distributed storage system can receive an I/O (Input/Output) request pertaining to a data block of a storage object stored on a local storage component of the node. The node can further determine whether the I/O request requires insertion of a new entry into a deduplication hash table associated with the local storage component or deletion of an existing entry from the deduplication hash table. If the I/O request requires insertion of a new hash table entry, the node can add an identifier of the data block into a probabilistic data structure associated with the local storage component, where the probabilistic data structure is configured to maintain information regarding distinct data blocks that are likely present in the local storage component. Alternatively, if the I/O request requires deletion of an existing hash table entry, the node can remove the identifier of the data block from the probabilistic data structure.
-
公开(公告)号:US20180067826A1
公开(公告)日:2018-03-08
申请号:US15810650
申请日:2017-11-13
Applicant: VMware, Inc.
Inventor: William Earl , Christos Karamanolis , Eric Knauft , Pascal Renauld
CPC classification number: G06F11/1471 , G06F16/1774 , G06F16/1815 , G06F16/2365 , G06F2009/45579 , H04L67/1097
Abstract: Embodiments of the disclosure provide techniques for updating a distributed transaction log on a previously offline resource object component using distributed transaction logs from active host computer nodes from separate RAID mirror configurations. Each component object maintains a journal (log) where distributed transactions are recorded. If a component object goes offline and subsequently returns (e.g., if the node hosting the component object reboots), the component object is marked as stale. To return the component object to an active state, a distributed resources module retrieves the journals from other resource component objects from other RAID configurations where the data is mirrored. The module filters corresponding data that is missing in the journal of the previously offline corresponding object and merges the filtered data to the journal.
-
15.
公开(公告)号:US11847033B1
公开(公告)日:2023-12-19
申请号:US17897178
申请日:2022-08-28
Applicant: VMware, Inc.
Inventor: Junlong Gao , Kevin Rayfeng Li , Wenguang Wang , Eric Knauft , Pascal Renauld , Quanxing Liu , Satish Pudi
CPC classification number: G06F11/2017 , G06F11/1662 , G06F11/2097
Abstract: System and method for managing distributed storage objects for host unavailability in a distributed storage system uses at least one of a crash indicator in a specific on-disk block and a paused object indicator for a distributed storage object to determine whether to perform data recovery for the distributed storage object. When the crash indicator is set or the paused object indicator implies that the distributed storage object is a paused object, the distributed storage object is left as a paused object without perform the data recovery for the distributed storage object. When the crash indicator is unset and the paused object indicator implies that the distributed storage object is not a paused object, the data recovery for the distributed storage object is performed.
-
16.
公开(公告)号:US11467746B2
公开(公告)日:2022-10-11
申请号:US17089605
申请日:2020-11-04
Applicant: VMware, Inc.
Inventor: Eric Knauft , Wenguang Wang , Pascal Renauld
Abstract: Techniques for issuing efficient writes to an erasure coded storage object in a distributed storage system via adaptive logging are provided. In one set of embodiments, a node of the system can receive a write request for updating one or more logical data blocks of the storage object and determine whether a size of the one or more logical data blocks meets or exceeds a threshold size. Upon determining that the size of the one or more logical data blocks meets or exceeds the threshold size, the node can allocate a segment in a capacity object of the storage object, write the one or more logical data blocks via a full stripe write to the segment, and write metadata for the one or more logical data blocks to a log record in a log of a metadata object of the storage object. The metadata written to the log record can include mappings between logical block addresses (LBAs) of the one or more logical data blocks and physical block addresses (PBAs) where the one or more logical data blocks reside in the segment.
-
17.
公开(公告)号:US20210311631A1
公开(公告)日:2021-10-07
申请号:US17089605
申请日:2020-11-04
Applicant: VMware, Inc.
Inventor: Eric Knauft , Wenguang Wang , Pascal Renauld
Abstract: Techniques for issuing efficient writes to an erasure coded storage object in a distributed storage system via adaptive logging are provided. In one set of embodiments, a node of the system can receive a write request for updating one or more logical data blocks of the storage object and determine whether a size of the one or more logical data blocks meets or exceeds a threshold size. Upon determining that the size of the one or more logical data blocks meets or exceeds the threshold size, the node can allocate a segment in a capacity object of the storage object, write the one or more logical data blocks via a full stripe write to the segment, and write metadata for the one or more logical data blocks to a log record in a log of a metadata object of the storage object. The metadata written to the log record can include mappings between logical block addresses (LBAs) of the one or more logical data blocks and physical block addresses (PBAs) where the one or more logical data blocks reside in the segment.
-
18.
公开(公告)号:US10803086B2
公开(公告)日:2020-10-13
申请号:US15660753
申请日:2017-07-26
Applicant: VMware, Inc.
Inventor: Pascal Renauld , Enning Xiang , Eric Knauft
IPC: G06F16/30 , G06F16/27 , G06F9/455 , G06F3/06 , G06F16/188
Abstract: Component objects of a virtual disk are backed by first storage nodes, which are at a primary site, and second storage nodes, which are at a secondary site. The method of resynchronizing the component objects of the virtual disk includes, at a coordinating node at the primary site, responsive to a second storage node coming back online, identifying an out-of-sync block of the second storage node, locating the out-of-sync block in an address space maintained for blocks of the virtual disk, and transmitting a resync command to a replication module of a coordinating node at the secondary site, the resync command identifying the out-of-sync block within the address space.
-
公开(公告)号:US10769036B2
公开(公告)日:2020-09-08
申请号:US15810650
申请日:2017-11-13
Applicant: VMware, Inc.
Inventor: William Earl , Christos Karamanolis , Eric Knauft , Pascal Renauld
Abstract: Embodiments of the disclosure provide techniques for updating a distributed transaction log on a previously offline resource object component using distributed transaction logs from active host computer nodes from separate RAID mirror configurations. Each component object maintains a journal (log) where distributed transactions are recorded. If a component object goes offline and subsequently returns (e.g., if the node hosting the component object reboots), the component object is marked as stale. To return the component object to an active state, a distributed resources module retrieves the journals from other resource component objects from other RAID configurations where the data is mirrored. The module filters corresponding data that is missing in the journal of the previously offline corresponding object and merges the filtered data to the journal.
-
20.
公开(公告)号:US20190215155A1
公开(公告)日:2019-07-11
申请号:US15866278
申请日:2018-01-09
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Jin Zhang , Haoran Zheng , Eric Knauft , Xin Li , Pascal Renauld , Bryan Fink
Abstract: A tenant's clear text data in a multi-tenant storage system can be encrypted using the tenant's cryptographic key to produce encrypted yet compressible data (“cryptographic data”). The cryptographic data can be encrypted using a system cryptographic key that is managed by the multi-tenant storage system and then stored. Use of the system cryptographic key allows for subsequent maintenance activities such as deduplication and compression to be performed on data stored in the multi-tenant storage system without having to access any of the tenants' cryptographic keys.
-
-
-
-
-
-
-
-
-