-
61.
公开(公告)号:US11467746B2
公开(公告)日:2022-10-11
申请号:US17089605
申请日:2020-11-04
Applicant: VMware, Inc.
Inventor: Eric Knauft , Wenguang Wang , Pascal Renauld
Abstract: Techniques for issuing efficient writes to an erasure coded storage object in a distributed storage system via adaptive logging are provided. In one set of embodiments, a node of the system can receive a write request for updating one or more logical data blocks of the storage object and determine whether a size of the one or more logical data blocks meets or exceeds a threshold size. Upon determining that the size of the one or more logical data blocks meets or exceeds the threshold size, the node can allocate a segment in a capacity object of the storage object, write the one or more logical data blocks via a full stripe write to the segment, and write metadata for the one or more logical data blocks to a log record in a log of a metadata object of the storage object. The metadata written to the log record can include mappings between logical block addresses (LBAs) of the one or more logical data blocks and physical block addresses (PBAs) where the one or more logical data blocks reside in the segment.
-
公开(公告)号:US11422904B2
公开(公告)日:2022-08-23
申请号:US17106004
申请日:2020-11-27
Applicant: VMware, Inc.
Inventor: Enning Xiang , Ojan Thornycroft , Varun Shah , Eric Knauft , Asit Desai , Yiqi Xu
Abstract: The disclosure herein describes placing delta components of a base component in target fault domains. One or more delta components are generated. When a first fault domain that lacks a sibling component of the base component is identified, the first fault domain is selected as a single delta target fault domain and a single delta component is placed on the single delta target fault domain. When a second fault domain that includes a first sibling component of the base component is identified and a third fault domain that includes a second sibling component of the base component is identified, the second fault domain and the third fault domain are selected as a first double delta target fault domain and a second double delta target fault domain, and a first double delta component and a second double delta component are placed on the first and second double delta target fault domains.
-
公开(公告)号:US11307932B2
公开(公告)日:2022-04-19
申请号:US16925145
申请日:2020-07-09
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Eric Knauft
Abstract: Writing to a storage system with data striping includes storing blocks of data in local memory until one or more full-stripe write operations can be performed, thus reducing write amplification on the data striped storage system. Crash recovery information includes storing the data associated metadata to respective persistent storage devices. Metadata associated with data from several clients is combined into fixed-size data blocks and stored on the respective persistent storage device.
-
公开(公告)号:US20220012131A1
公开(公告)日:2022-01-13
申请号:US16925145
申请日:2020-07-09
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Vamsi Gunturu , Eric Knauft
Abstract: Writing to a storage system with data striping includes storing blocks of data in local memory until one or more full-stripe write operations can be performed, thus reducing write amplification on the data striped storage system. Crash recovery information includes storing the data associated metadata to respective persistent storage devices. Metadata associated with data from several clients is combined into fixed-size data blocks and stored on the respective persistent storage device
-
65.
公开(公告)号:US20210311631A1
公开(公告)日:2021-10-07
申请号:US17089605
申请日:2020-11-04
Applicant: VMware, Inc.
Inventor: Eric Knauft , Wenguang Wang , Pascal Renauld
Abstract: Techniques for issuing efficient writes to an erasure coded storage object in a distributed storage system via adaptive logging are provided. In one set of embodiments, a node of the system can receive a write request for updating one or more logical data blocks of the storage object and determine whether a size of the one or more logical data blocks meets or exceeds a threshold size. Upon determining that the size of the one or more logical data blocks meets or exceeds the threshold size, the node can allocate a segment in a capacity object of the storage object, write the one or more logical data blocks via a full stripe write to the segment, and write metadata for the one or more logical data blocks to a log record in a log of a metadata object of the storage object. The metadata written to the log record can include mappings between logical block addresses (LBAs) of the one or more logical data blocks and physical block addresses (PBAs) where the one or more logical data blocks reside in the segment.
-
公开(公告)号:US10965739B2
公开(公告)日:2021-03-30
申请号:US15947313
申请日:2018-04-06
Applicant: VMware, Inc.
Inventor: Enning Xiang , Eric Knauft , Yiqi Xu , Xiaochuan Shen
Abstract: Computer system and method for managing storage requests in a distributed storage system uses congestion signals associated with storage requests, which are generated based on congestion at local storage of the computer system that supports a virtual storage area network. The storage requests are differentiated between a first class of storage requests and at least one other class of storage requests. For a storage request of the first class of storage requests, an actual ratio of a current average bandwidth of the first class of storage requests to a current average bandwidth of a second class of storage requests is calculated and compared with an expected ratio. The congestion signal associated with the storage request is then adjusted and transmitted to at least one source of storage requests for storage request fairness control.
-
67.
公开(公告)号:US10803086B2
公开(公告)日:2020-10-13
申请号:US15660753
申请日:2017-07-26
Applicant: VMware, Inc.
Inventor: Pascal Renauld , Enning Xiang , Eric Knauft
IPC: G06F16/30 , G06F16/27 , G06F9/455 , G06F3/06 , G06F16/188
Abstract: Component objects of a virtual disk are backed by first storage nodes, which are at a primary site, and second storage nodes, which are at a secondary site. The method of resynchronizing the component objects of the virtual disk includes, at a coordinating node at the primary site, responsive to a second storage node coming back online, identifying an out-of-sync block of the second storage node, locating the out-of-sync block in an address space maintained for blocks of the virtual disk, and transmitting a resync command to a replication module of a coordinating node at the secondary site, the resync command identifying the out-of-sync block within the address space.
-
公开(公告)号:US10769036B2
公开(公告)日:2020-09-08
申请号:US15810650
申请日:2017-11-13
Applicant: VMware, Inc.
Inventor: William Earl , Christos Karamanolis , Eric Knauft , Pascal Renauld
Abstract: Embodiments of the disclosure provide techniques for updating a distributed transaction log on a previously offline resource object component using distributed transaction logs from active host computer nodes from separate RAID mirror configurations. Each component object maintains a journal (log) where distributed transactions are recorded. If a component object goes offline and subsequently returns (e.g., if the node hosting the component object reboots), the component object is marked as stale. To return the component object to an active state, a distributed resources module retrieves the journals from other resource component objects from other RAID configurations where the data is mirrored. The module filters corresponding data that is missing in the journal of the previously offline corresponding object and merges the filtered data to the journal.
-
公开(公告)号:US10445144B1
公开(公告)日:2019-10-15
申请号:US15955900
申请日:2018-04-18
Applicant: VMware, Inc.
Inventor: Enning Xiang , Eric Knauft , Vishal Kher
Abstract: Embodiments include estimating a workload of a resynchronization task for a data component in a logical storage system. Embodiments include detecting a connection from the data component, wherein the data component was previously disconnected. Embodiments further include determining a layout of the data storage system. Embodiments further include estimating the workload of the resynchronization task based on the layout. If the layout indicates that the data component has an active mirror component, embodiments include estimating the workload to be a used capacity of the active mirror component. If the layout indicates that the data component does not have an active mirror component and that the data component has a stale mirror component, embodiments include estimating the workload to be a sum of a used capacity of the stale mirror component and an estimated size of missed input/output operations. One or more actions may be performed based on the estimate.
-
70.
公开(公告)号:US20190215155A1
公开(公告)日:2019-07-11
申请号:US15866278
申请日:2018-01-09
Applicant: VMware, Inc.
Inventor: Wenguang Wang , Jin Zhang , Haoran Zheng , Eric Knauft , Xin Li , Pascal Renauld , Bryan Fink
Abstract: A tenant's clear text data in a multi-tenant storage system can be encrypted using the tenant's cryptographic key to produce encrypted yet compressible data (“cryptographic data”). The cryptographic data can be encrypted using a system cryptographic key that is managed by the multi-tenant storage system and then stored. Use of the system cryptographic key allows for subsequent maintenance activities such as deduplication and compression to be performed on data stored in the multi-tenant storage system without having to access any of the tenants' cryptographic keys.
-
-
-
-
-
-
-
-
-