HASH-BASED DATA TRANSFER IN DISTRIBUTED DEDUPLICATION STORAGE SYSTEMS

    公开(公告)号:US20190026303A1

    公开(公告)日:2019-01-24

    申请号:US15652572

    申请日:2017-07-18

    Applicant: VMware, Inc.

    Abstract: Exemplary methods, apparatuses, and systems include a destination node receiving, from a source node, a hash of data stored by the source node and designated for transfer from the source node to the destination node. In response to determining the hash received from the source node maps to data stored by the destination node, the destination node reads the data into a write buffer and transmits an indication to the source node that the data is present at the source node. The destination node reads the data from write buffer and, in response to determining the data read from the write buffer remains unchanged as stored by the destination node, increases a reference count for the data as stored by the destination node.

    TRADING OFF CACHE SPACE AND WRITE AMPLIFICATION FOR B(epsilon)-TREES

    公开(公告)号:US20200233801A1

    公开(公告)日:2020-07-23

    申请号:US16252488

    申请日:2019-01-18

    Applicant: VMware, Inc.

    Abstract: Certain aspects provide systems and methods for performing an operation on a Bε-tree. A method comprises writing a message associated with the operation to a first slot in a first buffer of a first non-leaf node of the Bε-tree in an append-only manner, wherein a first filter associated with the first slot is used for query operations associated with the first slot. The method further comprises determining that the first buffer is full and, upon determining to flush the message to a non-leaf child node, flushing the message in an append-only manner to a second slot in a second buffer of the non-leaf child node, wherein a second filter associated with the second slot is used for query operations associated with the second slot. The method further comprises, upon determining to flush the message to a leaf node, flushing the message to the leaf node in a sorted manner.

    RANGE LOOKUP OPERATIONS FOR B E-TREES USING UPDATE MESSAGES

    公开(公告)号:US20190294709A1

    公开(公告)日:2019-09-26

    申请号:US15927019

    申请日:2018-03-20

    Applicant: VMware, Inc.

    Abstract: Exemplary methods, apparatuses, and systems include a file system process inserting a first key/value pair and a second key/value pair into a first tree. The second key is a duplicate of the first key and the value of the second key/value pair is an operation changing the value. In response to a request for a range of key/value pairs, the process reads the second key/value pair and inserts it in a second tree. The process reads the first pair and determines, while inserting the first pair in the second tree, that the second key is a duplicate of the first key. The file system process determines an updated value of the first value by applying the operation in the second value to first value. The file system operation updates the second key/value pair in the second tree with the updated value and returns the requested range of key/value pairs.

    BULK-LOAD FOR B-TREES
    7.
    发明申请

    公开(公告)号:US20200293506A1

    公开(公告)日:2020-09-17

    申请号:US16353535

    申请日:2019-03-14

    Applicant: VMware, Inc.

    Abstract: Embodiments described herein are related to bulk loading data into a B-tree. Embodiments include generating a first leaf node of a B-tree by allocating a first page for the first leaf node from a leaf page queue comprising a first plurality of sequential pages; and writing one or more tuples to the first page allocated for the first leaf node. Embodiments further include generating an parent node for the first leaf node and a second leaf node of the B-tree by allocating a third page for the parent node from an parent page queue comprising a second plurality of sequential pages, the parent node comprising a first indication of the first leaf node and a second indication of the second leaf node, the first indication and the second indication stored in the third page allocated for the parent.

    OPTIMAL SNAPSHOT DELETION
    8.
    发明申请

    公开(公告)号:US20190311047A1

    公开(公告)日:2019-10-10

    申请号:US15947072

    申请日:2018-04-06

    Applicant: VMware, Inc.

    Abstract: Embodiments described herein involve improved management of snapshots of a file system. Embodiments include copying a first root node of a first snapshot to a second snapshot, the second snapshot referencing other nodes of the first snapshot. Embodiments further include incrementing reference counts of the other nodes of the first snapshot. Embodiments further include adding a storage address of the first root node to a list. Embodiments further include, each time that a copy on write operation is performed for a node of the other nodes, adding a storage address of the node to the list and decrementing the reference count of the node. Embodiments further include iterating through the list and, for each storage address in the list, decrementing the reference count of the node corresponding to the storage address and, if the reference count of the node reaches zero, freeing storage space at the storage address.

Patent Agency Ranking