Abstract:
A data deduplication method includes receiving an overwrite request sent by an external device, where the overwrite request carries a data block and a first address into which the data block is to be stored, determining whether an overwrite quantity of the first address exceeds a first threshold within a time period [t1, t2], where both t1 and t2 are time points, and t2 is later than t1, and when the overwrite quantity of the first address exceeds the first threshold within the time period [t1, t2], skipping performing a deduplication operation on the data block or when the overwrite quantity of the first address does not exceed the first threshold within the time period [t1, t2], performing a deduplication operation on the data block.
Abstract:
A data deduplication method includes receiving an overwrite request sent by an external device, where the overwrite request carries a data block and a first address into which the data block is to be stored, determining whether an overwrite quantity of the first address exceeds a first threshold within a time period [t1, t2], where both t1 and t2 are time points, and t2 is later than t1, and when the overwrite quantity of the first address exceeds the first threshold within the time period [t1, t2], skipping performing a deduplication operation on the data block or when the overwrite quantity of the first address does not exceed the first threshold within the time period [t1, t2], performing a deduplication operation on the data block.
Abstract:
A checkpoint reclaim method in a copy-on-write (COW) file system includes: obtaining, according to a checkpoint reclaim instruction, M data blocks allocated by the file system between a moment of a previous checkpoint reclaim and a moment of a current checkpoint reclaim, and the M data blocks are data blocks allocated for at least one of a checkpoint or a snapshot generated between the moment of the previous checkpoint reclaim and the moment of the current checkpoint reclaim; performing an addition operation with a fixed step on a reference count of a data block that needs to be reserved in the M data blocks, and determining, in the M data blocks, a first data block for reclaiming; determining, in N data blocks allocated for at least one of a checkpoint or a snapshot reserved at the moment of the previous checkpoint reclaim, a second data block for reclaiming.
Abstract:
A data deduplication method and apparatus are provided. The method includes receiving an overwrite request sent by an external device, where the overwrite request carries a data block and a first address into which the data block is to be stored; determining whether an overwrite quantity of the first address exceeds a first threshold within a time period [t1, t2], where both t1 and t2 are time points, and t2 is later than t1; and when the overwrite quantity of the first address exceeds the first threshold within the time period [t1, t2], skipping performing a deduplication operation on the data block; or when the overwrite quantity of the first address does not exceed the first threshold within the time period [t1, t2], performing a deduplication operation on the data block. Therefore, a large quantity of computing resources of a storage server are saved, and impact brought by a deduplication operation on storage server performance is also decreased.
Abstract:
A data deduplication method and apparatus are provided. The method includes receiving an overwrite request sent by an external device, where the overwrite request carries a data block and a first address into which the data block is to be stored; determining whether an overwrite quantity of the first address exceeds a first threshold within a time period [t1, t2], where both t1 and t2 are time points, and t2 is later than t1; and when the overwrite quantity of the first address exceeds the first threshold within the time period [t1, t2], skipping performing a deduplication operation on the data block; or when the overwrite quantity of the first address does not exceed the first threshold within the time period [t1, t2], performing a deduplication operation on the data block. Therefore, a large quantity of computing resources of a storage server are saved, and impact brought by a deduplication operation on storage server performance is also decreased.