-
公开(公告)号:US20230409222A1
公开(公告)日:2023-12-21
申请号:US18461261
申请日:2023-09-05
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent , Yair Toaff , Elizabeth Firman , David Spinadel
IPC: G06F3/06
CPC classification number: G06F3/064 , G06F3/0626 , G06F3/0671
Abstract: A computer-implemented method for indexing a data item in a data storage system includes: dividing the data item into one or more large blocks; dividing each large block into one or more small blocks; calculating a strong hash value for each of the small blocks and storing a list of strong hash values with a pointer to a location of the large block; from the list of strong hash values calculated for each large block, selecting one or more representative hash values for the large block; and compiling a sparse index including an entry for each large block. Each entry is based on the representative hash values and a pointer to the list of strong hash values for each large block.