EFFICIENT WRITE-BACK FOR JOURNAL TRUNCATION
    1.
    发明公开

    公开(公告)号:US20240078179A1

    公开(公告)日:2024-03-07

    申请号:US17929197

    申请日:2022-09-01

    Applicant: VMware, Inc.

    CPC classification number: G06F12/0804 G06F12/0882

    Abstract: A method for efficient write-back for journal truncation is provided. A method includes maintaining a journal in a memory of a computing system including a plurality of records. Each record indicates a transaction associated with one or more pages in an ordered data structure and maintaining a dirty list including an entry for each page indicated by a record in the journal. Each entry in the dirty list includes a respective first log sequence number (LSN) associated with a least recent record of the plurality of records that indicates the page and a respective second LSN associated with a most recent record of the plurality of records that indicates the page. The method includes determining to truncate the journal. The method includes identifying one or more records, of the plurality of records, from the journal to write back to a disk, where the identifying is based on the dirty list.

    Fast algorithm to find file system difference for deduplication

    公开(公告)号:US11775484B2

    公开(公告)日:2023-10-03

    申请号:US16552965

    申请日:2019-08-27

    Applicant: VMware, Inc.

    CPC classification number: G06F16/1752 G06F16/152 G06F16/9027

    Abstract: The disclosure provides techniques for deduplicating files. The techniques include, upon creating or modifying a file, placing a logical timestamp of the current logical time, within a queue associated with the directory of the file. The techniques further include placing the logical timestamp within a queue of each parent directory of the directory of the file. To determine a set of files for deduplication, the techniques disclosed herein identify files that have been modified within a logical time range. The set of files modified within a logical time is identified by traversing directories of a storage system, the directories being organized within a tree structure. If a directory's queue does not contain a timestamp that is within the logical time range, then all child directories can be skipped over for further processing, such that no files within the child directories end up being within the set of files for deduplication.

    Optimizing file access statistics collection

    公开(公告)号:US11755537B2

    公开(公告)日:2023-09-12

    申请号:US17202342

    申请日:2021-03-15

    Applicant: VMware, Inc.

    CPC classification number: G06F16/1734 G06F21/6218 G06N5/04 G06N20/00

    Abstract: Optimizing file access includes a process for identifying a file access event for a first accessed file, and incrementing a first access counter in an access list in a memory, which also includes access counters for other accessed files. The process further includes exporting the first access counter to a performance monitoring dashboard, or exporting to a storage allocator and, based on the value, moving the first accessed file between a first storage and a second storage. The process also includes determining whether the value of the first access counter meets a first threshold, or a sum of values of the access counters for the other accessed files meets a second threshold. Based on meeting the first threshold or meeting the second threshold, the process includes persisting the access counters on a storage media. The access counters also provide security monitoring (e.g., identifying excessive file access).

    SYSTEM AND METHOD FOR DELETING PARENT SNAPSHOTS OF RUNNING POINTS OF STORAGE OBJECTS USING EXCLUSIVE NODE LISTS OF THE PARENT SNAPSHOTS

    公开(公告)号:US20230281084A1

    公开(公告)日:2023-09-07

    申请号:US17684177

    申请日:2022-03-01

    Applicant: VMware, Inc.

    CPC classification number: G06F11/1453 G06F2201/84

    Abstract: System and method for deleting parent snapshots of running points of storage objects stored in a storage system, in response to a request to delete a parent snapshot of a running point of a storage object stored in the storage system, traverses a subtree of a B tree that corresponds to a logical map of the parent snapshot to find nodes of the subtree that are exclusively owned by the parent snapshot, which are added to an exclusive node list of the parent snapshot. The minimum node ownership value of the running point is then changed to the minimum node ownership value of the parent snapshot so that any node of the subtree of the B tree with a node ownership value equal to or greater than the changed minimum node ownership value is deemed to be owned by the running point. The nodes of the subtree of the B tree that are found in the exclusive node list of the parent snapshot are then deleted.

    Data compression in a multi-layer distributed datastore

    公开(公告)号:US11687236B2

    公开(公告)日:2023-06-27

    申请号:US17181476

    申请日:2021-02-22

    Applicant: VMware, Inc.

    Abstract: A method for compressing is provided. The method including receiving a block of data to store on at least one physical disk; determining whether to store the data in a data log as uncompressed or compressed data based on a determined size of resulting compressed data. When the method determines to store the data as compressed, compressing the data and storing the compressed data in at least one sector in the data log. Otherwise, the method stores the data, uncompressed, in a plurality of sectors in the data log. The method generates a one or more state bits indicating (i) whether the data is stored as uncompressed or compressed, and (ii) if the data is stored as compressed, a size of the compressed data. The method then stores the one or more state bits in an entry of a logical map table associated with an LBA that corresponds to the data block.

    EFFICIENT JOURNAL LOG RECORD FOR COPY-ON-WRITE B+ TREE OPERATION

    公开(公告)号:US20230177069A1

    公开(公告)日:2023-06-08

    申请号:US17643268

    申请日:2021-12-08

    Applicant: VMware, Inc.

    CPC classification number: G06F16/27 G06F16/2246

    Abstract: A method for copy on write (COW) operations generally includes receiving a write request to a first node in an ordered data structure and updating a write ahead log record associated with COW operation with, instead of the content of the first node, a physical disk address of a second node owned by the run point in the ordered data structure that is a parent node of the first node, a pointer to the first node in the second node, a physical disk address of the first node, and a physical disk address of the third node. A metadata table record for a snapshot that owns the first node may be updated with a log sequence number (LSN) of the COW operation. A method for deleting a snapshot includes determining whether the COW operation recorded in the WAL record for the LSN is completed before deleting the snapshot.

    MANAGING RESOURCE DISTRIBUTION IN GLOBAL AND LOCAL POOLS BASED ON A FLUSH THRESHOLD

    公开(公告)号:US20220382591A1

    公开(公告)日:2022-12-01

    申请号:US17332133

    申请日:2021-05-27

    Applicant: VMware, Inc.

    Abstract: The disclosure herein describes management of distribution of resources between a global pool and an associated plurality of local pools using a flush threshold. A request for resources is received at the global pool from a local pool, the request indicating a requested quantity of resources. Based on the received request, it is determined that available resources in the global pool are below a flush threshold of the global pool. Based on this determination, flush instructions are sent to the local pools, wherein the flush instructions instruct each local pool to release unused resources (e.g., available to be released) to the global pool. Based on the available resources of the global pool then exceeding the requested quantity of resources and/or the flush threshold, resources of the global pool are allocated to the requesting local pool, whereby the local pool is enabled to use the allocated resources.

Patent Agency Ranking