-
公开(公告)号:US20190108102A1
公开(公告)日:2019-04-11
申请号:US16212219
申请日:2018-12-06
Applicant: Huawei Technologies Co., Ltd.
Inventor: Chengwei ZHANG , Chuanshuai YU , Zongquan ZHANG
CPC classification number: G06F11/1469 , G06F3/0608 , G06F3/0619 , G06F3/064 , G06F3/065 , G06F11/14 , G06F11/1451 , G06F11/1453 , G06F11/1458 , G06F11/2094 , H03M7/46 , H03M13/15
Abstract: Embodiments of the present disclosure disclose a solution for data backup and recovery in a storage system. When a source device in the storage system backs up, to a backup-end device, a data block that is written after a snapshot Sn, the source device performs a logical operation such as an exclusive-NOR or exclusive-OR operation on the written data block and an original data block, which is recorded in the snapshot Sn, of the written data block, and then compresses a data block obtained after the logical operation, which improves a compression ratio of a data block, thereby reducing an amount of data that is sent to the backup-end device, and saving transmission bandwidth. The solution may be further applied to a scenario of data recovery in a storage system.
-
公开(公告)号:US20140189237A1
公开(公告)日:2014-07-03
申请号:US14140945
申请日:2013-12-26
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yanhui ZHONG , Zongquan ZHANG
CPC classification number: G06F3/0641 , G06F3/0608 , G06F3/061 , G06F3/0619 , G06F3/067 , G06F3/0673 , G06F12/0868 , G06F12/0875 , G06F2212/452
Abstract: Embodiments of the present invention provide a data processing method and apparatus. According to the embodiments of the present invention, when it is found that a data hash value in a currently received data stream exceeds a preset first threshold, a part or all of data in the data stream is not deduplicated, and is directly stored, so as to prevent the data in the data stream from being dispersedly stored into a plurality of storage areas; instead, the part or all of the data is stored into a storage area in a centralized manner, so that a deduplication rate is effectively improved on the whole, particularly in a scenario of large data storage amount.
Abstract translation: 本发明的实施例提供一种数据处理方法和装置。 根据本发明的实施例,当发现当前接收的数据流中的数据散列值超过预设的第一阈值时,数据流中的部分或全部数据不被重复数据删除,并被直接存储,所以 以防止数据流中的数据被分散地存储到多个存储区域中; 相反,部分或全部数据以集中的方式存储在存储区域中,从而整体上有效地提高了重复数据删除率,特别是在大数据存储量的情况下。
-
公开(公告)号:US20180267896A1
公开(公告)日:2018-09-20
申请号:US15959273
申请日:2018-04-22
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Zongquan ZHANG , Chengwei ZHANG
CPC classification number: G06F12/0253 , G06F3/0608 , G06F3/061 , G06F3/0641 , G06F3/067 , G06F3/0671 , G06F16/00 , G06F16/1748 , G06F2212/1044 , H03M7/3091
Abstract: The present disclosure directs to solutions for performing deduplication by a storage device. In the solutions, according to a duplicate data locality principle, non-duplicate data blocks whose logical addresses are contiguous are stored in contiguous physical addresses in a sequence of the logical addresses, and fingerprints of the non-duplicate data blocks whose logical addresses are contiguous are also stored in contiguous physical addresses in the sequence of the logical addresses, and in addition, a mapping from a logical address, which is of one data block in the non-duplicate data blocks whose logical addresses are contiguous, to an aggregation address is established.
-
-