-
公开(公告)号:US10437817B2
公开(公告)日:2019-10-08
申请号:US16113151
申请日:2018-08-27
Applicant: Huawei Technologies Co., Ltd.
Inventor: Michael Hirsch , Yair Toaff , Yehonatan David
Abstract: A system for segmenting an input data stream, comprising a processor adapted to split an input data stream to a plurality of data sub-streams such that each of the plurality of data sub-streams has an overlapping portion with a consecutive data sub-stream of the plurality of data sub-streams, create concurrently a plurality of segmented data sub-streams by concurrently segmenting the plurality of data sub-streams each in one of a plurality of processing pipelines of the processor and join the plurality of segmented data sub-streams to create a segmented data stream by synchronizing a sequencing of each of the plurality of segmented data sub-streams according to one or more overlapping segments in the overlapping portion of each two consecutive data sub-streams of the plurality of data sub-streams.
-
公开(公告)号:US20230409222A1
公开(公告)日:2023-12-21
申请号:US18461261
申请日:2023-09-05
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Ovad Somech , Assaf Natanzon , Idan Zach , Aviv Kuvent , Yair Toaff , Elizabeth Firman , David Spinadel
IPC: G06F3/06
CPC classification number: G06F3/064 , G06F3/0626 , G06F3/0671
Abstract: A computer-implemented method for indexing a data item in a data storage system includes: dividing the data item into one or more large blocks; dividing each large block into one or more small blocks; calculating a strong hash value for each of the small blocks and storing a list of strong hash values with a pointer to a location of the large block; from the list of strong hash values calculated for each large block, selecting one or more representative hash values for the large block; and compiling a sparse index including an entry for each large block. Each entry is based on the representative hash values and a pointer to the list of strong hash values for each large block.
-
公开(公告)号:US11656991B2
公开(公告)日:2023-05-23
申请号:US17567675
申请日:2022-01-03
Applicant: Huawei Technologies Co., Ltd.
Inventor: Aviv Kuvent , Yair Toaff
IPC: G06F12/00 , G06F12/0837 , G06F12/0842
CPC classification number: G06F12/0837 , G06F12/0842 , G06F2212/608
Abstract: An information processing device comprises: a memory comprising a cache for storing information related to an object from a plurality of objects, and a summary structure configured to store a summary for the object; a volume configured to store a merge file including the plurality of objects, and a set of dump-files, each dump-file being associated with a specific cache-dump operation of the cache; and a processor configured to assign, to the cache, a first identifier; perform a cache-dump operation based on generating a dump-file associated with the first identifier and storing the information related to the object from the cache to the generated dump-file; and assign, to the cache, a second identifier, wherein the second identifier is larger than the first identifier.
-
公开(公告)号:US11507539B2
公开(公告)日:2022-11-22
申请号:US16800743
申请日:2020-02-25
Applicant: Huawei Technologies Co., Ltd.
Inventor: Michael Hirsch , Yehonatan David , Yair Toaff
IPC: G06F16/174 , G06F16/16 , G06F16/182 , G06F16/14
Abstract: An apparatus stores received data blocks as deduplicated data blocks. The apparatus is configured to: maintain a plurality of containers, where a reference to a container is unique within the apparatus and each container includes one or more data segments and segment metadata for each data segment, the segment metadata including a segment identifier and a segment reference, where the segment identifier is unique within the container and the segment reference is unique within the apparatus; and maintain a plurality of deduplicated data blocks storing received data blocks, where each deduplicated data block includes a plurality of identified container references, where a container reference identifier is unique within the deduplicated data block, and an ordered list of one or more segment indicators.
-
公开(公告)号:US10459961B2
公开(公告)日:2019-10-29
申请号:US15667267
申请日:2017-08-02
Applicant: HUAWEI TECHNOLOGIES CO., LTD.
Inventor: Yehonatan David , Yair Toaff , Michael Hirsch
Abstract: A system for segmenting an input data stream using vector processing, comprising a processor adapted to repeat the following steps throughout an input data stream to create a segmented data stream consisting a plurality of segments: apply a rolling sequence over a sequence of consecutive data items of an input data stream, the rolling sequence includes a subset of consecutive data items of the sequence, calculate concurrently a plurality of partial hash values each by one of a plurality of processing pipelines of the processor, each for a respective one of a plurality of partial rolling sequences each including evenly spaced data items of the subset, determine compliance of each of the plurality of partial hash values with one or more respective partial segmentation criteria and designate the sequence as a variable size segment when at least some of the partial hash values comply with the respective partial segmentation criteria.
-
-
-
-