PRUNING DATA SEGMENTS STORED IN CLOUD STORAGE TO RECLAIM CLOUD STORAGE SPACE

    公开(公告)号:US20230153010A1

    公开(公告)日:2023-05-18

    申请号:US17526927

    申请日:2021-11-15

    CPC classification number: G06F3/0652 G06F3/0604 G06F3/0644 G06F3/067

    Abstract: An information management system uses cloud storage resources store secondary copies of primary data created by client computing devices managed by a storage manager. Deduplication operations are performed on the secondary copies, which results in chunk metadata indices that allow for tracking and faster retrieval of the deduplicated secondary copies. The chunk metadata indices may reference data segments of the deduplicated secondary copies. The data segments may be stored in, and across, one or more sub-files. As the secondary copies are aged out from the cloud storage resources, data segments are identified as being orphaned or non-orphaned. Data segments that are orphaned are pruned to remove their corresponding sub-files from the cloud storage resources, where the sub-files are replaced with new sub-files that do not contain the orphaned data segments.

Patent Agency Ranking