-
公开(公告)号:US20150058306A1
公开(公告)日:2015-02-26
申请号:US14469526
申请日:2014-08-26
Applicant: VMware, Inc.
Inventor: William EARL , Christos KARAMANOLIS , Eric KNAUFT , Pascal RENAULD
CPC classification number: G06F11/1471 , G06F17/30171 , G06F17/30191 , G06F17/30371 , G06F2009/45579 , H04L67/1097
Abstract: Embodiments of the disclosure provide techniques for updating a distributed transaction log on a previously offline resource object component using distributed transaction logs from active host computer nodes from separate RAID mirror configurations. Each component object maintains a journal (log) where distributed transactions are recorded. If a component object goes offline and subsequently returns (e.g., if the node hosting the component object reboots), the component object is marked as stale. To return the component object to an active state, a distributed resources module retrieves the journals from other resource component objects from other RAID configurations where the data is mirrored. The module filters corresponding data that is missing in the journal of the previously offline corresponding object and merges the filtered data to the journal.
Abstract translation: 本公开的实施例提供了使用来自分离的RAID镜像配置的活动主机计算机节点的分布式事务日志更新先前脱机的资源对象组件上的分布式事务日志的技术。 每个组件对象都维护记录分布式事务的日志(日志)。 如果组件对象脱机并随后返回(例如,如果托管组件对象的节点重新启动),组件对象将被标记为过时。 要将组件对象返回到活动状态,分布式资源模块从其他数据镜像的其他RAID配置中检索来自其他资源组件对象的日志。 该模块将先前脱机对应对象的日记中缺少的相应数据过滤,并将过滤后的数据合并到日志中。
-
公开(公告)号:US20230281180A1
公开(公告)日:2023-09-07
申请号:US18144797
申请日:2023-05-08
Applicant: VMware, Inc.
Inventor: Hardik Singh NEGI , Wenguang WANG , Eric KNAUFT
IPC: G06F16/22 , G06F16/28 , G06F16/2455 , G06F16/23
CPC classification number: G06F16/2246 , G06F16/288 , G06F16/24552 , G06F16/2343
Abstract: A method for modifying key-value pairs of a B+ tree is provided. The method receives a request to modify a particular key-value pair. Each node of the tree has a modification number. The method traverses a path on the tree from the root node toward the particular node. The traversing includes upon reaching a parent node of the path, acquiring a shared lock on both the parent node and a child node one level below the parent node. Upon determining that the child node is the particular node, the method stores the modification number of the particular node, releases the shared lock on the particular node, compares a current modification number of the node with its stored number, and acquires an exclusive lock on the node if the numbers are the same. The method increments the current modification number of the node and modifies it while in the exclusive lock.
-
公开(公告)号:US20230064693A1
公开(公告)日:2023-03-02
申请号:US17410673
申请日:2021-08-24
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Long YANG , Maxime AUSTRUY , Matthew B. AMDUR , Eric KNAUFT
IPC: G06F12/0804 , G06F16/22 , G06F12/1009
Abstract: The disclosure herein describes storing data using a capacity data storage tier and a smaller performance data storage tier. The capacity data storage tier includes capacity data storage hardware configured to store log-structured leaf pages (LLPs), and the performance data storage tier includes performance data storage hardware. A virtual address table (VAT) includes a set of virtual address entries referencing the LLPs. A tree-structured index includes index nodes referencing the set of virtual address entries of the VAT. Data to be stored is received, and at least a first portion of metadata associated with the received data is stored in the LLPs using the VAT, and at least a second portion of metadata associated with the received data is stored in the performance data storage tier. The architecture reduces space usage of the performance data storage tier.
-
34.
公开(公告)号:US20230018790A1
公开(公告)日:2023-01-19
申请号:US17377128
申请日:2021-07-15
Applicant: VMware, Inc.
Inventor: Yiqi XU , Eric KNAUFT , Enning XIANG , Ojan THORNYCROFT , Asit DESAI , Varun SHAH
Abstract: The disclosure herein describes enhancing data durability of a base component of a data object using an unplanned delta component during transient fault unavailability. A base component of a data object becoming unavailable due to a transient fault is detected. A delta component associated with the base component is generated, wherein the delta component includes unwritten storage space with an address space and a tracking bitmap including a plurality of bits associated with data blocks of the address space of the delta component. The stale LSN with which the base component is associated is assigned to the delta component and the delta component is synchronized with an active component of the data object based on the assigned stale LSN. The delta component records write I/O targeted for the base component and, based on detecting the base component becoming available, the base component is synchronized with the delta component.
-
公开(公告)号:US20210382858A1
公开(公告)日:2021-12-09
申请号:US16894663
申请日:2020-06-05
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Vamsidhar GUNTURU , Eric KNAUFT , Pascal RENAULD
IPC: G06F16/18 , G06F16/182 , G06F16/17 , G06F16/22 , G06F16/2457 , G06F12/02 , G06F12/0817
Abstract: Techniques for efficiently storing client data blocks on a distributed-computing system are provided. The system includes a fast performance tier and a large capacity tier. The capacity tier stores the client data blocks in erasure encoded data stripes. The performance tier stores logical map data including an address map indicating a correspondence between logical addresses associated with a first layer of the system and physical addresses associated with a second layer. A method includes receiving a request to include additional client data blocks in the client blocks. The request indicates logical addresses for additional blocks. Corresponding physical addresses for additional block are determined. Each additional block is stored at the physical address. Additional logical map data is stored in the performance tier. Storing the additional logical map data includes updating the address map to indicate the correspondence between the logical addresses and the physical addresses for the additional blocks.
-
公开(公告)号:US20210357291A1
公开(公告)日:2021-11-18
申请号:US16875838
申请日:2020-05-15
Applicant: VMware, Inc.
Inventor: Enning XIANG , Ojan THORNYCROFT , Yiqi XU , Zhihao YAO , Eric KNAUFT
Abstract: The disclosure herein describes placing a delta component of a base component in a target fault domain. A delta component associated with a base component is generated. The generation includes selecting a first fault domain as a target fault domain for the delta component based on the first fault domain including a witness component associated with the distributed data object of the base component. Otherwise, the generation includes selecting a second fault domain as the target fault domain based on the second fault domain including at least one data component that includes a different address space than the base component. Otherwise, the generation includes selecting a third fault domain as the target fault domain based on the third fault domain being unused. Then, the delta component is placed on the target fault domain, whereby data durability of the distributed data object is enhanced, and available fault domains are preserved.
-
公开(公告)号:US20210349793A1
公开(公告)日:2021-11-11
申请号:US16870861
申请日:2020-05-08
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Vamsi GUNTURU , Enning XIANG , Eric KNAUFT
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resynchronizing data in a storage system. One of the methods includes determining that a particular disk of a capacity object of a storage system was offline for an interval of time, wherein the capacity object comprises a plurality of segments, and wherein the storage system comprises a segment usage table identifying a linked list of particular segments of the capacity object that are in use; determining a time point at which the particular disk went offline; determining one or more first segments of the capacity object that were modified after the time point, wherein determining one or more first segments comprises determining each segment of the segment usage table having a transaction ID that is larger than the time point; and resynchronizing, for each first segment, a portion of the particular disk corresponding to the first segment.
-
公开(公告)号:US20210349790A1
公开(公告)日:2021-11-11
申请号:US16870801
申请日:2020-05-08
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Enning XIANG , Vamsi GUNTURU , Eric KNAUFT , Pascal RENAULD
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resynchronizing data in a storage system. One of the methods includes determining that a particular primary disk of a capacity object of a storage system has failed, wherein the capacity comprises a plurality of segments, and wherein the each segment comprises: a plurality of primary columns each corresponding to a respective primary disk of the capacity object, and a plurality of parity columns each corresponding to a respective parity disk of the capacity object; and resynchronizing, for each segment of one or more segments of the capacity object, the primary column of the segment corresponding to the particular primary disk using i) the primary columns of the segment corresponding to each other primary disk of the capacity object, ii) one or more parity columns of the segment, and iii) the column summaries of the segment.
-
39.
公开(公告)号:US20210294502A1
公开(公告)日:2021-09-23
申请号:US16827692
申请日:2020-03-23
Applicant: VMware, Inc.
Inventor: Wenguang WANG , Eric KNAUFT , Vamsi GUNTURU , Pascal RENAULD
Abstract: A method for encrypting data in one or more data blocks is provided. The method receives a first data block to be written to a physical storage that includes one or more physical disks. The method applies a first random tweak to data indicative of the first data block to generate a first encrypted data block, and writes the first encrypted data block and the first random tweak to a first physical block of the physical storage. The method receives a second data block to be written to the physical storage. The method then applies a second random tweak, different than the first random tweak, to data indicative of the second data block to generate a second encrypted data block, and writes the second encrypted data block and the second random tweak to a second physical block of the physical storage.
-
公开(公告)号:US20190324807A1
公开(公告)日:2019-10-24
申请号:US15955900
申请日:2018-04-18
Applicant: VMware, Inc.
Inventor: Enning XIANG , Eric KNAUFT , Vishal KHER
Abstract: Embodiments include estimating a workload of a resynchronization task for a data component in a logical storage system. Embodiments include detecting a connection from the data component, wherein the data component was previously disconnected. Embodiments further include determining a layout of the data storage system. Embodiments further include estimating the workload of the resynchronization task based on the layout. If the layout indicates that the data component has an active mirror component, embodiments include estimating the workload to be a used capacity of the active mirror component. If the layout indicates that the data component does not have an active mirror component and that the data component has a stale mirror component, embodiments include estimating the workload to be a sum of a used capacity of the stale mirror component and an estimated size of missed input/output operations. One or more actions may be performed based on the estimate.
-
-
-
-
-
-
-
-
-