Shrinking segment cleaning algorithm in an object storage

    公开(公告)号:US11435935B2

    公开(公告)日:2022-09-06

    申请号:US17100663

    申请日:2020-11-20

    Applicant: VMware, Inc.

    Abstract: A method for cleaning an object storage having a plurality of segments is provided. Each segment includes an identifier through which the segment is accessed. The method identifies a first segment in the plurality of segments. The first segment includes a first identifier and a first size. The method determines that a utilization ratio for the first segment is below a threshold. As a result, the method generates a second segment from the first segment, such that the second segment includes a second identifier that is the same as the first identifier and a second size that is smaller than the first size. The method then writes the second segment to the object storage.

    PROVIDING BLOCK SIZE COMPATIBILITY WITH A STORAGE FILTER
    3.
    发明申请
    PROVIDING BLOCK SIZE COMPATIBILITY WITH A STORAGE FILTER 审中-公开
    使用存储过滤器提供块大小兼容性

    公开(公告)号:US20160350010A1

    公开(公告)日:2016-12-01

    申请号:US14726598

    申请日:2015-05-31

    Applicant: VMware, Inc.

    Abstract: Examples provide input and output request block size compatibility. A storage filter converts input and output (IO) requests associated with a first data block size into modified IO requests compatible with a data storage organized in a second data block size where the first data block size is different than the first data block size. The storage filter translates read IO requests for a smaller block size into modified read requests for a data storage organized with a larger data block size. Write IO requests for smaller block size are converted into modified write IO requests for larger data block size data storage. The storage filter also converts read IO requests generated for larger block size into smaller block size read IO requests. Likewise, the storage filter also translates write IO requests corresponding to larger data block size into modified write IO requests of smaller block size.

    Abstract translation: 示例提供输入和输出请求块大小兼容性。 存储过滤器将与第一数据块大小相关联的输入和输出(IO)请求转换为与第一数据块大小不同于第一数据块大小的第二数据块大小组织的数据存储兼容的修改的IO请求。 存储过滤器将用于较小块大小的读IO请求转换为以较大数据块大小组织的数据存储的修改读请求。 将较小块大小的IO请求转换为更大的数据块大小数据存储的写入IO请求。 存储过滤器还将为较大块大小生成的读IO请求转换为较小块大小的读IO请求。 同样,存储过滤器也将对应于较大数据块大小的写入IO请求转换为更小块大小的修改写入IO请求。

    Log-structured formats for managing archived storage of objects

    公开(公告)号:US11436102B2

    公开(公告)日:2022-09-06

    申请号:US16998060

    申请日:2020-08-20

    Applicant: VMware, Inc.

    Abstract: Solutions for managing archived storage include receiving, at a first node, a snapshot comprising object data (e.g., a virtual machine disk snapshot) from a second node (e.g., a software defined data center), and storing the snapshot in a tiered structure that includes a data tier and a metadata tier. Snapshots may be used for fail-over operations and/or backups, to support disaster recovery. The data tier comprises a log-structured file system (LFS), and the metadata tier comprises a content addressable storage (CAS) identifying addresses within the LFS. The metadata tier also comprises a logical layer indicating content in the CAS. Segment cleaning of the data tier is performed using a segment usage table (SUT). Some examples include performing a fail-over operation from the second node to a third node using at least the stored snapshot for workload recovery. In some examples, the CAS comprises a log-structured merge-tree (LSM-tree).

    Tiering data to a cold storage tier of cloud object storage

    公开(公告)号:US11288129B2

    公开(公告)日:2022-03-29

    申请号:US17002577

    申请日:2020-08-25

    Applicant: VMware, Inc.

    Abstract: Techniques for tiering data to a cold storage tier of a cloud object storage platform are provided. In one set of embodiments, a computer system can identify one or more old snapshots of a data set that reside in a first storage tier of the cloud object storage platform, where the one or more old snapshots are snapshots that are unlikely to be deleted from the cloud object storage platform within a period of N days. The computer system can further, for each snapshot in the one or more old snapshots: identify one or more data blocks in the snapshot that are superseded by a more recent snapshot in the one or more old snapshots; write the one or more data blocks to a second (i.e., cold) storage tier of the cloud object storage platform that has a lower storage cost than the first storage tier; and cause the one or more data blocks to be deleted from the first storage tier.

    SUPPORTING DEDUPLICATION IN FILE STORAGE USING FILE CHUNK HASHES

    公开(公告)号:US20220092033A1

    公开(公告)日:2022-03-24

    申请号:US17028405

    申请日:2020-09-22

    Applicant: VMware, Inc.

    Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in file storage using file chunk hashes. A hash of a chunk of a log segment can be received from a software defined data center. A chunk identifier can be associated with the hash in a hash map that stores associations between sequentially-allocated chunk identifiers and hashes. The chunk identifier can be associated with a logical address corresponding to the chunk of the log segment in a logical map that stores associations between the sequentially-allocated chunk identifiers and logical addresses. A search of the hash map can be performed to determine if the chunk is a duplicate, and the chunk can be deduplicated responsive to a determination that the chunk is a duplicate.

    OPTIMIZED NETWORKING THREAD ASSIGNMENT

    公开(公告)号:US20220350647A1

    公开(公告)日:2022-11-03

    申请号:US17244242

    申请日:2021-04-29

    Applicant: VMware, Inc.

    Abstract: Some embodiments provide a method for scheduling networking threads associated with a data compute node (DCN) executing at a host computer. When a virtual networking device is instantiated for the DCN, the method assigns the virtual networking device to a particular non-uniform memory access (NUMA) node of multiple NUMA nodes associated with the DCN. Based on the assignment of the virtual networking device to the particular NUMA node, the method assigns networking threads associated with the DCN to the same particular NUMA node and provides information to the DCN regarding the particular NUMA node in order for the DCN to assign a thread associated with an application executing on the DCN to the same particular NUMA node.

    Supporting deduplication in object storage using subset hashes

    公开(公告)号:US11385817B2

    公开(公告)日:2022-07-12

    申请号:US17028312

    申请日:2020-09-22

    Applicant: VMware, Inc.

    Abstract: The present disclosure is related to methods, systems, and machine-readable media for supporting deduplication in object storage using subset hashes. A plurality of hashes of a plurality of blocks of a plurality of log segments can be received from a software defined data center, wherein each block corresponds to a respective logical address. Each of the plurality of logical addresses can be associated with a respective sequentially-allocated chunk identifier in a logical map. A subset hash comprising a hash of a subset of the plurality of blocks can be determined that corresponds to a contiguous range of the plurality of logical addresses. A search of a hash map for the subset hash can be performed to determine if the subset hash is a duplicate. The subset of the plurality of blocks can be deduplicated responsive to a determination that the subset hash is a duplicate.

Patent Agency Ranking