Techniques for scavenging of free provisioned blocks

    公开(公告)号:US11204703B2

    公开(公告)日:2021-12-21

    申请号:US16708515

    申请日:2019-12-10

    IPC分类号: G06F12/00 G06F3/06 G06F16/13

    摘要: Techniques for scavenging blocks may include: determining, in accordance with a selected option, a set of candidate upper deck file systems, wherein at least a first of the candidate upper deck file systems has storage allocated from at least one block of a lower deck file system; and performing, in accordance with the selected option, scavenging of the set of candidate upper deck file systems to attempt to free blocks of the lower deck file system. Scavenging may include issuing a request to perform hole punching of a backed free block of the first candidate upper deck file system, wherein the backed free block has first provisioned storage that is associated with a block of the lower deck file system. The selected option may be one of multiple options each specifying a different candidate set of upper deck file systems upon which hole punching is performed when selected.

    Optmizing data deduplication
    22.
    发明授权

    公开(公告)号:US11112987B2

    公开(公告)日:2021-09-07

    申请号:US16392913

    申请日:2019-04-24

    IPC分类号: G06F3/06

    摘要: Techniques for processing data may include: receiving a candidate block; performing partial deduplication processing of the candidate block; receiving a second candidate block subsequent to performing partial deduplication processing for the candidate block; and performing first processing to determine whether to perform promotion processing for the entry, The partial deduplication processing may include: partially deduplicating at least one sub-block of the candidate block; and creating an entry in a deduplication database for the candidate block, wherein the entry includes a digest of the candidate block and the entry denotes a potential target block having the digest, and wherein the entry includes a counter that tracks a number of missed full block deduplications between the potential target block and subsequently processed candidate blocks. The promotion processing promotes the potential target block, having the first digest of the entry, to a new target block.

    DIRECT INPUT/OUTPUT PATH TO COMPRESSED DATA

    公开(公告)号:US20210117328A1

    公开(公告)日:2021-04-22

    申请号:US16656222

    申请日:2019-10-17

    摘要: Techniques for providing a direct IO path to compressed data on storage media of a storage system. The techniques include triggering a transaction cache to perform a flush operation for updating mapping metadata for a storage object containing the compressed data. Having updated the mapping metadata for the storage object, the techniques further include issuing, by a copier module, an IO read request for the compressed data of the storage object to a namespace layer, which issues the IO read request to a mapping layer. The techniques further include forwarding the IO read request to a logical layer of the mapping layer, bypassing the transaction cache. The techniques further include reading, by the logical layer, the compressed data of the storage object from the storage media, and providing, via the mapping layer and the namespace layer, the compressed data to the copier module for transfer to a destination storage system.

    DEEP DATA-COMPRESSION
    26.
    发明申请

    公开(公告)号:US20210117132A1

    公开(公告)日:2021-04-22

    申请号:US16656046

    申请日:2019-10-17

    IPC分类号: G06F3/06 G06F12/02

    摘要: Host data stored in one or more source physical extents of non-volatile data storage is identified as valid and determined to be infrequently written by host I/O requests, and is therefore compressed to generate a highly compressed version of the valid host data. The highly compressed version is then stored into at least one target physical extent. The valid host data may be initially compressed before it is stored in the source physical extent(s), and may be re-compressed to generate the highly compressed version. If the valid host data is also infrequently read, it may be recompressed using larger blocks of host data than were used to perform the initial compression. The performance tier of the target physical extent may be different from (e.g. lower than) the performance tier of the source physical extent. The technology may be embodied in a background process such as a garbage collector.

    Managing data storage in storage systems

    公开(公告)号:US10922027B2

    公开(公告)日:2021-02-16

    申请号:US16179256

    申请日:2018-11-02

    摘要: There is disclosed techniques for use in managing data storage in storage systems. For example, in one embodiment, there is disclosed a method comprising receiving a request to store data of a data object in a storage system. The method also comprising determining that at least a portion of the data is to be stored in an uncompressed format in the storage system in response to receiving the request. The method also comprising storing at least a portion of the data in the uncompressed format in an allocation unit of a segment in the storage system such that the stored data in the uncompressed format emulates stored data in a compressed format based on the said determination.

    Adaptive compression mode
    30.
    发明授权

    公开(公告)号:US10852993B2

    公开(公告)日:2020-12-01

    申请号:US16261906

    申请日:2019-01-30

    IPC分类号: G06F12/00 G06F3/06

    摘要: Techniques for data processing may include: receiving, at a data storage system, a write operation that writes a data chunk; selecting, in accordance with one or more criteria related to load of at least one of the data storage system and the hardware device, a compression level to be used in connection with compressing the data chunk; issuing a request to a hardware device to perform compression processing that compresses the data chunk in accordance with the compression level selected and generates a compressed form of the data chunk; and storing the compressed form of the data chunk on a non-volatile storage device. The criteria may include any one or more of processor workload of the data storage system, write cache level, response time of the hardware device, and read load.