Back-reference data structure for a deduplication storage system

    公开(公告)号:US12130707B2

    公开(公告)日:2024-10-29

    申请号:US18185202

    申请日:2023-03-16

    IPC分类号: G06F16/215 G06F11/14

    CPC分类号: G06F11/1453 G06F2201/84

    摘要: Example implementations relate to deduplication operations in a storage system. An example includes generating a housekeeping work map to delete a backup item stored in a deduplication storage system; selecting a first work entry of the housekeeping work map, where the first work entry identifies a first container index and a first manifest; in response to a selection of the first work entry, loading the first container index into the memory, the first container index comprising a back-reference data structure; identifying, in the back-reference data structure, a back-reference entry indexed to the first manifest; determining, using the back-reference entry indexed to the first manifest, a first set of data units included in the first manifest and that are indexed in the first container index; and decrementing, in the first container index, a set of reference counts for the determined first set of data units.

    MIGRATION OF JOURNAL GROUPS IN A STORAGE SYSTEM

    公开(公告)号:US20240028465A1

    公开(公告)日:2024-01-25

    申请号:US17814575

    申请日:2022-07-25

    IPC分类号: G06F11/14 G06F3/06

    摘要: Example implementations relate to metadata operations in a storage system. An example includes receiving, by a storage controller of a deduplication storage system, a plurality of data streams to be stored in persistent storage of the deduplication storage system; identifying, by the storage controller, a set of journals in a first journal group that are modified during a first backup process; determining, by the storage controller, a count of the set of journals that are modified during the first backup process; comparing, by the storage controller, the determined count to a migration threshold; and migrating, by the storage controller, at least one journal of the set of journals to a second journal group based at least on a comparison of the determined count to the migration threshold.

    DATA INDEX FOR DEDUPLICATION STORAGE SYSTEM
    3.
    发明公开

    公开(公告)号:US20230259488A1

    公开(公告)日:2023-08-17

    申请号:US17583617

    申请日:2022-01-25

    IPC分类号: G06F16/11 G06F16/13 G06F16/16

    摘要: Example implementations relate to metadata operations in a storage system. An example includes a machine-readable medium storing instructions that upon execution cause a processor to: receive a data stream to be stored in persistent storage of a deduplication storage system; store data units of the data stream in a container entity group object according to arrival time, where the data units of the container entity group object are referenced by a plurality of container indexes; generate a data index to list each container index that references at least one data unit included in the container entity group object; and in response to a determination that the total size of the container entity group object exceeds the threshold size, transfer the container entity group object from memory to the persistent storage.

    JOURNAL GROUPS FOR METADATA HOUSEKEEPING OPERATION

    公开(公告)号:US20230237048A1

    公开(公告)日:2023-07-27

    申请号:US17586131

    申请日:2022-01-27

    IPC分类号: G06F16/23 G06F16/28

    CPC分类号: G06F16/2379 G06F16/285

    摘要: Example implementations relate to metadata operations in a storage system. An example includes detecting, by a storage controller of a deduplication storage system, a housekeeping operation to perform updates to data of a set of container indexes of stored deduplicated data; in response to a detection of the housekeeping operation, the storage controller identifying, among the set of container indexes, a plurality of container indexes associated with a particular journal group, the particular journal group comprising a plurality of journals to store updates to data of the plurality of container indexes, respectively; and maintaining, by the storage controller, the particular journal group loaded in memory until all of the updates of the housekeeping operation to the plurality of container indexes have been stored in the respective journals of the particular journal group.

    Acquisition of object names for portion index objects

    公开(公告)号:US10365974B2

    公开(公告)日:2019-07-30

    申请号:US15267899

    申请日:2016-09-16

    摘要: Examples include the acquisition of objects names for portion index objects. Some examples include acquisition, from a remote object storage system, of a list of object names for a plurality of portion index objects, stored in the remote object storage system. In some examples, for each of the portion index objects, the acquired object name includes an identifier of an associated deduplicated backup item and information identifying a data range of the associated deduplicated backup item that is represented by metadata of the portion index object.

    HOUSEKEEPING WORK MAP FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240311363A1

    公开(公告)日:2024-09-19

    申请号:US18185184

    申请日:2023-03-16

    IPC分类号: G06F16/23 G06F16/22

    CPC分类号: G06F16/2365 G06F16/2272

    摘要: Example implementations relate to deduplication operations in a storage system. An example includes initiating a housekeeping job to delete a backup item stored in a deduplication storage system; identifying a set of container indexes affected by the housekeeping job, the set of container indexes including at least a first container index; in response to a determination that a housekeeping work map includes a first work entry associated with the first container index, determining an ingest count of new data units added to a first container during an ingest of the backup item into the deduplication storage system, the first container associated with the first container index; and adding the determined ingest count to the cumulative counter included in the first work entry associated with the first container index.

    Migration of journal groups in a storage system

    公开(公告)号:US11940882B2

    公开(公告)日:2024-03-26

    申请号:US17814575

    申请日:2022-07-25

    IPC分类号: G06F11/14 G06F3/06

    摘要: Example implementations relate to metadata operations in a storage system. An example includes receiving, by a storage controller of a deduplication storage system, a plurality of data streams to be stored in persistent storage of the deduplication storage system; identifying, by the storage controller, a set of journals in a first journal group that are modified during a first backup process; determining, by the storage controller, a count of the set of journals that are modified during the first backup process; comparing, by the storage controller, the determined count to a migration threshold; and migrating, by the storage controller, at least one journal of the set of journals to a second journal group based at least on a comparison of the determined count to the migration threshold.

    MATCHING OPERATION FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240037078A1

    公开(公告)日:2024-02-01

    申请号:US17814943

    申请日:2022-07-26

    IPC分类号: G06F16/215 G06F16/245

    CPC分类号: G06F16/215 G06F16/245

    摘要: Example implementations relate to metadata operations in a storage system. An example includes generating, by a storage controller of a deduplication storage system, a candidate list of container indexes for matching operations of a received data segment, each container index in the candidate list having an associated match cost; identifying, by the storage controller, a journal group associated with a first container index listed in the candidate list; reducing, by the storage controller, a match cost associated with the first container index in response to a determination that the identified journal group is in a modified state; and performing, by the storage controller, the matching operations of the received data segment based at least on the reduced match cost of the first container index.

    RESOURCE ALLOCATION FOR SYNTHETIC BACKUPS

    公开(公告)号:US20230098965A1

    公开(公告)日:2023-03-30

    申请号:US17448968

    申请日:2021-09-27

    摘要: Example implementations relate to metadata operations in a storage system. An example storage system includes a machine-readable storage storing instructions executable by a processor to determine to generate a synthetic full backup based on data stream representations of a plurality of data streams. The instructions are also executable to, in response to a determination to generate the synthetic full backup, create a logical group including the data stream representations. The instructions are also executable to specify a cache resource allocation for the logical group, and generate the synthetic full backup from data stream representations using an amount of a cache resource limited by the cache resource allocation for the logical group.