-
公开(公告)号:US20230409525A1
公开(公告)日:2023-12-21
申请号:US18462217
申请日:2023-09-06
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Asaf Yeger , Assaf Natanzon , Michael Gutman , Shmoolik Yosub , Shahar Salzman , David Segal , Idan Zach
CPC classification number: G06F16/137 , G06F16/1727
Abstract: A method of data management in a data storage system including a plurality of data storage units, the method includes dividing each file in the data storage system into a plurality of blocks having a common size. The method further includes generating a hash value for each block using a common hash algorithm. The method further includes identifying a plurality of similar files in different data storage units, based on a comparison of the hashes for each file. The method further includes copying one or more of the identified similar files, such that similar files are stored in a single data storage unit and for one or more of the copied files, generating a link to a new location of the file. The method provides an efficient, effective, and adequate reduction of the primary storage as well as the secondary storage space.
-
公开(公告)号:US20230418497A1
公开(公告)日:2023-12-28
申请号:US18463604
申请日:2023-09-08
Applicant: Huawei Technologies Co., Ltd.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/067 , G06F3/0604
Abstract: A memory controller is for receiving incoming data and storing and/or transmitting the incoming data utilizing deduplication. The memory controller determines a second hash corresponding to one or more first hashes. The second hash is a sparse hash and the first hashes are strong hashes. The memory controller populates a sparse index with the second hash. The sparse index associates the second hash with the corresponding first hashes. The memory controller determines the second hash based on a probability that the second hash corresponds to a plurality of first hashes using a probabilistic method.
-
公开(公告)号:US20230409222A1
公开(公告)日:2023-12-21
申请号:US18461261
申请日:2023-09-05
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent , Yair Toaff , Elizabeth Firman , David Spinadel
IPC: G06F3/06
CPC classification number: G06F3/064 , G06F3/0626 , G06F3/0671
Abstract: A computer-implemented method for indexing a data item in a data storage system includes: dividing the data item into one or more large blocks; dividing each large block into one or more small blocks; calculating a strong hash value for each of the small blocks and storing a list of strong hash values with a pointer to a location of the large block; from the list of strong hash values calculated for each large block, selecting one or more representative hash values for the large block; and compiling a sparse index including an entry for each large block. Each entry is based on the representative hash values and a pointer to the list of strong hash values for each large block.
-
-