SYSTEM AND METHOD FOR INLINE TIERING OF WRITE DATA

    公开(公告)号:US20210103520A1

    公开(公告)日:2021-04-08

    申请号:US16592588

    申请日:2019-10-03

    IPC分类号: G06F12/02 G06F12/123

    摘要: A method, computer program product, and computer system for receiving, by a computing device, new data to write to a leaf. At least two timestamps of the leaf may be examined. It may be determined whether a time interval between the at least two timestamps of the leaf is greater than an age threshold. The new data may be written to a first tier storage device when the time interval between the at least two timestamps of the leaf is less than the age threshold; The new data may be written to a second tier storage device when the time interval between the at least two timestamps of the leaf is greater than the age threshold.

    Optimizing space utilization by retaining metadata in cache

    公开(公告)号:US10970221B2

    公开(公告)日:2021-04-06

    申请号:US16398670

    申请日:2019-04-30

    摘要: A technique for managing data received into a cache operates in cycles. To process a current batch of compressed blocks during a current cache processing cycle, a storage system obtains a new mapping structure and a new segment of contiguous storage space. If the system can place some of the current batch of compressed blocks into previously-allocated space, the system does so and partially populates the new mapping structure with entries for mapping the other compressed blocks that were not placed. The system then asserts a hold on the new mapping structure, so that the mapping structure is retained in cache at the end of the current cache processing cycle, and more completely populates the new mapping structure with entries for other compressed blocks during a later cache processing cycle before releasing the hold.

    MULTI-GENERATIONAL VIRTUAL BLOCK COMPACTION

    公开(公告)号:US20210064236A1

    公开(公告)日:2021-03-04

    申请号:US16458496

    申请日:2019-07-01

    IPC分类号: G06F3/06

    摘要: A method, system, and computer program product for multi-generational virtual block compaction comprising identifying a first virtual block, the first virtual block being associated with a first generation number, determining a second virtual block as an appropriate target for live information in the first virtual block, creating an association between the second virtual block and the first virtual block, updating the live information in the first virtual block to be associated with the second virtual block, updating a generation information mapping associated with the first virtual block; and associating a second generation number with the first virtual block.

    Providing data deduplication in a data storage system with parallelized computation of crypto-digests for blocks of host I/O data

    公开(公告)号:US10936228B2

    公开(公告)日:2021-03-02

    申请号:US16450390

    申请日:2019-06-24

    摘要: In response to a cache flush event indicating that host data accumulated in a cache of a storage processor of a data storage system is to be flushed to a lower deck file system, an aggregation set of blocks is formed within the cache, and a digest calculation group is selected from within the aggregation set. Hardware vector processing logic is caused to simultaneously calculate crypto-digests from the blocks in the digest calculation group. If one of the resulting crypto-digests matches a previously generated crypto-digest, deduplication is performed that i) causes the lower deck file system to indicate the block of data from which the previously generated crypto-digest was generated and ii) discards the block that corresponds to the matching crypto-digest. Objects required by a digest generation component may be allocated in a just in time manner to avoid having to manage a pool of pre-allocated objects.

    SYSTEM AND METHOD FOR PRIORITY ORCHESTRATION AND SCHEDULING OF DIFFERENT BACKGROUND OPERATIONS

    公开(公告)号:US20210034409A1

    公开(公告)日:2021-02-04

    申请号:US16530839

    申请日:2019-08-02

    IPC分类号: G06F9/48 G06F3/06

    摘要: A method, computer program product, and computer system for receiving an indication about an amount of background IOs a background scheduler is capable of performing on a plurality of applications, wherein the indication may be based upon, at least in part, one of host IO latency and a rate of the host IO latency being at one of a plurality of levels. One or more applications of the plurality of applications on which to perform the background IOs may be determined. The background IO applications to be performed on the one or more applications of the plurality of applications may be scheduled based upon, at least in part, one of the host IO latency and the rate of the host IO latency being at one of the plurality of levels.

    Method, apparatus and computer program product for managing data storage

    公开(公告)号:US10896152B2

    公开(公告)日:2021-01-19

    申请号:US16054216

    申请日:2018-08-03

    IPC分类号: G06F16/174

    摘要: There is disclosed techniques for managing data storage. In one embodiment, the techniques comprise determining whether a part of a file system block is associated with deduplication. The techniques also comprise updating a deduplication record of the file system block, based on the said determination, to indicate a deduplication status in connection with the part such that the updated deduplication record describes for multiple occurrences of one or more parts of the file system block whether the respective one or more parts are associated with deduplication.

    DATA REDUCTION IMPROVEMENT USING AGGREGATED MACHINE LEARNING

    公开(公告)号:US20200379659A1

    公开(公告)日:2020-12-03

    申请号:US16426152

    申请日:2019-05-30

    IPC分类号: G06F3/06 G06N3/04

    摘要: A method system, and computer program product for improving data reduction using aggregate machine learning systems comprising receiving, by an aggregating machine learning system from one or more machine learning systems associated with a set of one or more storage arrays, a first set of output parameters indicative of performance metrics for the set of the one or more storage arrays, aggregating, by the aggregating machine learning system, the first set of output parameters, resulting in a second set of output parameters, and sending, from the aggregating machine learning system, at least one member of the second set of output parameters as an input to at least one of the one or more machine learning systems associated with the set of the one or more storage arrays.

    USING MACHINE LEARNING TO DETECT SYSTEM CHANGES

    公开(公告)号:US20200349047A1

    公开(公告)日:2020-11-05

    申请号:US16398477

    申请日:2019-04-30

    IPC分类号: G06F11/32 G06N3/04 G06N3/08

    摘要: Techniques for detecting state changes in a system may include receiving a first neural network that is trained to detect when the system transitions into a first resulting state, wherein the system transitions into at least a first intermediate state prior to transitioning into the final resulting state; training the first neural network using a first plurality of inputs denoting the system in the first intermediate state; obtaining a plurality of sets of internal state information of the first neural network, each set of the plurality of sets denoting an internal state of the first neural network at a different point in time after the first neural network has processed at least a portion of the first plurality of inputs; and training a second neural network, using the plurality of sets of internal state information, to detect the first intermediate state.