Range-based data deduplication using a hash table with entries replaced based on address alignment information

Invention Grant

US09921773B2 Range-based data deduplication using a hash table with entries replaced based on address alignment information 有权

Please log in to see more content

Patent Title: Range-based data deduplication using a hash table with entries replaced based on address alignment information
Application No.: US14743520

Application Date: 2015-06-18
Publication No.: US09921773B2

Publication Date: 2018-03-20
Inventor: Ivan Georgiev
Applicant: Citrix Systems, Inc.
Applicant Address: US FL Fort Lauderdale
Assignee: Citrix Systems, Inc.
Current Assignee: Citrix Systems, Inc.
Current Assignee Address: US FL Fort Lauderdale
Agency: BainwoodHuang
Main IPC: G06F3/06
IPC: G06F3/06 ; G06F17/30

Range-based data deduplication using a hash table with entries replaced based on address alignment information

Abstract:

Deduplicated data storage is provided by presenting a virtual volume mapped by a translation table to a physical volume of a physical data storage system. The translation table maps sets of ranges of duplicate data blocks of the virtual volume to corresponding individual ranges of shared data blocks of the physical volume. A hash table for identifying duplicate data is indexed by a portion of a hash value calculated from newly written data blocks, and has entries each identifying an address alignment of the corresponding data block. In operation, existing entries are replaced with new entries for colliding data blocks having better address alignment, promoting wider address-space separation of the entries. Upon occurrence of a hit in the hash table, for a given data block in a range of newly written data blocks, data blocks of the range are compared to corresponding blocks in a range identified by the hit to maximize a size of a region to be identified by the translation table as duplicate data.

Public/Granted literature

US20150370495A1 RANGE-BASED DATA DEDUPLICATION Public/Granted day:2015-12-24

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F3/00	用于将所要处理的数据转变成为计算机能够处理的形式的输入装置；用于将数据从处理机传送到输出设备的输出装置，例如，接口装置
G06F3/06	.来自记录载体的数字输入，或者到记录载体上去的数字输出