Abstract:
A de-duplication-based remote replication method and an apparatus are provided in a system including a primary end device and a disaster recovery end device, and both the primary end device and the disaster recovery end device store a first snapshot; the primary end device obtains a second snapshot of the primary end device, and sends the first data block, the fingerprint of the first data block, and metadata of the added data blocks to the disaster recovery end device when a fingerprint of a first data block in the added data blocks is different from the fingerprints of the data blocks in the first snapshot.
Abstract:
A data storage method and an apparatus are provided in a distributed storage system including a computing node and a plurality of storage nodes. The computing node writes the N data slices and the M check slices into the R storage nodes in each storage node group to improve reliability and stability of data in a data center.
Abstract:
A de-duplication-based remote replication method and an apparatus are provided in a system including a primary end device and a disaster recovery end device, and both the primary end device and the disaster recovery end device store a first snapshot; the primary end device obtains a second snapshot of the primary end device, and sends the first data block, the fingerprint of the first data block, and metadata of the added data blocks to the disaster recovery end device when a fingerprint of a first data block in the added data blocks is different from the fingerprints of the data blocks in the first snapshot.
Abstract:
A data storage method and an apparatus are provided in a distributed storage system including a computing node and a plurality of storage nodes. The computing node writes the N data slices and the M check slices into the R storage nodes in each storage node group to improve reliability and stability of data in a data center.
Abstract:
Embodiments of the present invention provide a data processing method and apparatus. According to the embodiments of the present invention, when it is found that a data hash value in a currently received data stream exceeds a preset first threshold, a part or all of data in the data stream is not deduplicated, and is directly stored, so as to prevent the data in the data stream from being dispersedly stored into a plurality of storage areas; instead, the part or all of the data is stored into a storage area in a centralized manner, so that a deduplication rate is effectively improved on the whole, particularly in a scenario of large data storage amount.
Abstract:
A checkpoint reclaim method in a copy-on-write (COW) file system includes: obtaining, according to a checkpoint reclaim instruction, M data blocks allocated by the file system between a moment of a previous checkpoint reclaim and a moment of a current checkpoint reclaim, and the M data blocks are data blocks allocated for at least one of a checkpoint or a snapshot generated between the moment of the previous checkpoint reclaim and the moment of the current checkpoint reclaim; performing an addition operation with a fixed step on a reference count of a data block that needs to be reserved in the M data blocks, and determining, in the M data blocks, a first data block for reclaiming; determining, in N data blocks allocated for at least one of a checkpoint or a snapshot reserved at the moment of the previous checkpoint reclaim, a second data block for reclaiming.
Abstract:
Embodiments of the present invention provide a data processing method and apparatus. According to the embodiments of the present invention, when it is found that a data hash value in a currently received data stream exceeds a preset first threshold, a part or all of data in the data stream is not deduplicated, and is directly stored, so as to prevent the data in the data stream from being dispersedly stored into a plurality of storage areas; instead, the part or all of the data is stored into a storage area in a centralized manner, so that a deduplication rate is effectively improved on the whole, particularly in a scenario of large data storage amount.