HASHING A DATA SET WITH MULTIPLE HASH ENGINES

    公开(公告)号:US20220245112A1

    公开(公告)日:2022-08-04

    申请号:US17165915

    申请日:2021-02-02

    Abstract: A system for calculating a fingerprint across a data set by identifying a data set to hash, the data set comprising a set of data blocks, generating, by a first hash engine, a first hash for each data block in the set of data blocks within the data set, and generating, by a second hash engine, a second hash for each data block in the set of data blocks within the data set.

    HASHING WITH DIFFERING HASH SIZE AND COMPRESSION SIZE

    公开(公告)号:US20220245097A1

    公开(公告)日:2022-08-04

    申请号:US17165910

    申请日:2021-02-02

    Abstract: A system for hashing a data set by identifying a data set to deduplicate based on a hash block size and to compress based on a compression block size, where the hash block size is smaller than the compression block size, defining a set of data blocks within the data set based on the hash block size, generating a hash for each data block in the set of data blocks within the data set, deduplicating a data block in the data set based on a respective hash for the data block, and compressing the data set based on the compression block size.

Patent Agency Ranking