SELECTIVE UTILIZATION OF PROCESSOR CORES WHILE REBUILDING DATA PREVIOUSLY STORED ON A FAILED DATA STORAGE DRIVE

    公开(公告)号:US20220100621A1

    公开(公告)日:2022-03-31

    申请号:US17038534

    申请日:2020-09-30

    IPC分类号: G06F11/20 G06F13/16

    摘要: In response to failure of a data storage drive, data previously stored on the failed drive is rebuilt on a replacement data storage drive by executing drive rebuild jobs on a first set of processor cores. While the data previously stored on the failed data storage drive is being rebuilt, a second set of processor cores is used to process host I/O (Input/Output) requests. The host I/O requests processed by the second set of processor cores may be a first set of host I/O requests processed during the data rebuild. When a current total number of outstanding host I/O requests exceeds a threshold maximum number of outstanding host I/O requests, at least one processor core in the first set of processor cores may be used to process a second set of host I/O requests while the data previously stored on the failed data storage drive is being rebuilt.

    Techniques for selectively activating and deactivating entropy computation

    公开(公告)号:US11221991B2

    公开(公告)日:2022-01-11

    申请号:US16174791

    申请日:2018-10-30

    摘要: Techniques for data processing may include: receiving a data chunk of the data set; determining, in accordance with criteria including a compressibility ratio for the data set and a cost ratio of compression computation cost and entropy computation cost, whether to activate or deactivate entropy computation for the data set, wherein the compressibility ratio is ratio of a number of compressible data chunks of the data set and a number of uncompressible data chunks of the data set; and responsive to determining to activate entropy computation for the data set, performing first processing comprising: determining an entropy value for the data chunk; and determining, in accordance with the entropy value for the data chunk, whether to compress the data chunk.

    Coordinated reclaiming of data storage space

    公开(公告)号:US11221948B2

    公开(公告)日:2022-01-11

    申请号:US16663507

    申请日:2019-10-25

    IPC分类号: G06F12/02

    摘要: Coordinating a reclaiming of data storage space among processing nodes of a data storage system includes, by a first node in response to an event, performing local cleanup of first-node mapping data and issuing a request message to a second node, the request message identifying the data storage space to be reclaimed. The event may be a notification from a RAID component rebuilding a RAID volume. In response to the request message, the second node performs local cleanup of second-node mapping data and issues a completion message to the first node indicating that the second node has completed its part of the reclaiming of the data storage space. The first node responds to the completion message by marking the data storage space as being reclaimed and signaling to a source of the event that the data storage space is available for new use.

    Sharing blocks of non-volatile data storage to support cache flushes in a multi-node data storage system

    公开(公告)号:US11216195B1

    公开(公告)日:2022-01-04

    申请号:US16945383

    申请日:2020-07-31

    摘要: An allocation history is maintained for each node in a multi-node data storage system. The allocation history for each node indicates sets of physical blocks previously allocated to the node to store dirty pages of user data flushed from a cache of the node. The allocation history indicates non-quarantined sets of physical blocks that are currently being used to provide non-volatile data storage to cache flush operations, and quarantined sets of physical blocks not currently being used to provide non-volatile data storage to cache flush operations. Allocation is prevented to any node of any set of physical blocks that is indicated as non-quarantined by any of the allocation histories, and of any set of physical blocks that is indicated as quarantined by any of the allocation histories.

    Cache retention for inline deduplication based on number of physical blocks with common fingerprints among multiple cache entries

    公开(公告)号:US11210230B2

    公开(公告)日:2021-12-28

    申请号:US16863433

    申请日:2020-04-30

    摘要: Techniques are provided for inline deduplication based on a number of physical blocks having common fingerprints among multiple entries of a buffer cache. One method comprises storing input/output operations in a first cache comprising a plurality of entries each corresponding to a physical storage entity comprising a plurality of physical blocks. A given entry is maintained in the first cache based on a first number of physical blocks of the given entry having a duplicate fingerprint with at least one physical block of another entry in the first cache. A second number can be determined of the physical blocks of each entry having a fingerprint in a second cache, and a first ratio is determined for two entries in the first cache using the second number and the first number. A comparison of the first ratios can be performed to sort and possibly evict entries in the first cache based on the comparison.

    MANAGING DATAPATH VALIDATION ON PER-TRANSACTION BASIS

    公开(公告)号:US20210374126A1

    公开(公告)日:2021-12-02

    申请号:US16887697

    申请日:2020-05-29

    IPC分类号: G06F16/23 G06F12/121

    摘要: A technique for managing a datapath of a data storage system includes receiving a request to access target data and creating a transaction that includes multiple datapath elements in a cache, where the datapath elements are used for accessing the target data. In response to detecting that one of the datapath elements is invalid, the technique further includes processing the transaction in a rescue mode. The rescue mode attempts to replace each invalid datapath element of the transaction with a valid version thereof obtained from elsewhere in the data storage system. The technique further includes committing the transaction as processed in the rescue mode.

    System and method for inline tiering of write data

    公开(公告)号:US11176034B2

    公开(公告)日:2021-11-16

    申请号:US16592588

    申请日:2019-10-03

    IPC分类号: G06F12/02 G06F12/123

    摘要: A method, computer program product, and computer system for receiving, by a computing device, new data to write to a leaf. At least two timestamps of the leaf may be examined. It may be determined whether a time interval between the at least two timestamps of the leaf is greater than an age threshold. The new data may be written to a first tier storage device when the time interval between the at least two timestamps of the leaf is less than the age threshold; The new data may be written to a second tier storage device when the time interval between the at least two timestamps of the leaf is greater than the age threshold.