INTELLIGENT GARBAGE COLLECTION BASED ON CONTENT SIMILARITY

    公开(公告)号:US20240402934A1

    公开(公告)日:2024-12-05

    申请号:US18799925

    申请日:2024-08-09

    Abstract: A storage system performs garbage collection, with data compression, in storage memory. The system obtains hash results from data segments. The system determines similarity of content of data segments, based on the hash results. The system performs data compression of live data of two or more data segments that have similarity of content meeting a similarity threshold. The system writes the compressed live data of the two or more data segments into the storage memory.

    Prioritizing I/O Operations Directed To Replicating Datasets

    公开(公告)号:US20240378216A1

    公开(公告)日:2024-11-14

    申请号:US18779292

    申请日:2024-07-22

    Abstract: Providing Quality of Service (QOS) for replicating datasets including: receiving, by a target data repository from a source data repository, a checkpoint describing one or more updates to one or more datasets stored in the source data repository and the target data repository; adding, by the target data repository, the checkpoint to a first queue for checkpoints directed to one or more volumes in the target data repository, wherein the first queue is included in a plurality of queues for the target data repository; selecting, by the target data repository, one or more queues from the plurality of queues; and servicing an operation from each of the selected one or more queues.

    Optimizing Data Reduction Operations
    3.
    发明公开

    公开(公告)号:US20240319901A1

    公开(公告)日:2024-09-26

    申请号:US18732515

    申请日:2024-06-03

    Abstract: Preparing data for deduplication including: generating, by a storage system for a compressed data block, a padded compressed data block by padding the compressed data block to conform to a fixed block size, wherein the fixed block size is greater than a size of the compressed data block; storing, in the storage system, the padded compressed data block beginning at a block boundary of a storage device in the storage system; and performing block-based deduplication on the storage system, wherein the block-based deduplication determines whether the padded compressed data block matches one or more other padded compressed data blocks stored in the storage system.

    DATA REBUILD INDEPENDENT OF ERROR DETECTION
    6.
    发明公开

    公开(公告)号:US20240160540A1

    公开(公告)日:2024-05-16

    申请号:US18514317

    申请日:2023-11-20

    CPC classification number: G06F11/2056 G06F11/1076 G06F11/1092 G06F11/1096

    Abstract: A method for proactively rebuilding user data in a plurality of storage nodes of a storage cluster in a single chassis is provided. The method includes distributing user data and metadata throughout the plurality of storage nodes such that the plurality of storage nodes can read the user data, using erasure coding, despite loss of two of the plurality of storage nodes. The method includes determining to rebuild the user data for one of the plurality of storage nodes in the absences of an error condition. The method includes rebuilding the user data for the one of the plurality of storage nodes. A plurality of storage nodes within a single chassis that can proactively rebuild the user data stored within the storage nodes is also provided.

    Replication Utilizing Cloud-Based Storage Systems

    公开(公告)号:US20230353635A1

    公开(公告)日:2023-11-02

    申请号:US18349293

    申请日:2023-07-10

    CPC classification number: H04L67/1097

    Abstract: Synchronously replicating a dataset across cloud-based storage systems, including adding a cloud-based storage system to a set of storage systems that the dataset is synchronously replicated across, where access operations are applied to the dataset equivalently through all storage systems in the set, all storage systems in the set store a separate copy of the dataset, and operations to modify the dataset performed and completed through any of the storage systems in the set are reflected in access operations to read the dataset, the cloud-based storage system including one or more cloud computing instances executing a storage controller application, a virtual drive layer that includes one or more cloud computing instances with local storage for storing at least a portion of the dataset as block data, and an object storage layer for storing at least a portion of the dataset as object data.

Patent Agency Ranking