-
公开(公告)号:US20230409222A1
公开(公告)日:2023-12-21
申请号:US18461261
申请日:2023-09-05
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent , Yair Toaff , Elizabeth Firman , David Spinadel
IPC: G06F3/06
CPC classification number: G06F3/064 , G06F3/0626 , G06F3/0671
Abstract: A computer-implemented method for indexing a data item in a data storage system includes: dividing the data item into one or more large blocks; dividing each large block into one or more small blocks; calculating a strong hash value for each of the small blocks and storing a list of strong hash values with a pointer to a location of the large block; from the list of strong hash values calculated for each large block, selecting one or more representative hash values for the large block; and compiling a sparse index including an entry for each large block. Each entry is based on the representative hash values and a pointer to the list of strong hash values for each large block.
-
公开(公告)号:US20230418497A1
公开(公告)日:2023-12-28
申请号:US18463604
申请日:2023-09-08
Applicant: Huawei Technologies Co., Ltd.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent
IPC: G06F3/06
CPC classification number: G06F3/0641 , G06F3/067 , G06F3/0604
Abstract: A memory controller is for receiving incoming data and storing and/or transmitting the incoming data utilizing deduplication. The memory controller determines a second hash corresponding to one or more first hashes. The second hash is a sparse hash and the first hashes are strong hashes. The memory controller populates a sparse index with the second hash. The sparse index associates the second hash with the corresponding first hashes. The memory controller determines the second hash based on a probability that the second hash corresponds to a plurality of first hashes using a probabilistic method.
-