Invention Application
US20150205816A1 SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION
审中-公开
用于组织数据以促进数据重复的系统和方法
- Patent Title: SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION
- Patent Title (中): 用于组织数据以促进数据重复的系统和方法
-
Application No.: US14552292Application Date: 2014-11-24
-
Publication No.: US20150205816A1Publication Date: 2015-07-23
- Inventor: Subramanian Periyagaram , Rahul Khona , Dnyaneshwar Pawar , Sandeep Yadav
- Applicant: NetApp, Inc.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A technique for organizing data to facilitate data deduplication includes dividing a block-based set of data into multiple “chunks”, where the chunk boundaries are independent of the block boundaries (due to the hashing algorithm). Metadata of the data set, such as block pointers for locating the data, are stored in a tree structure that includes multiple levels, each of which includes at least one node. The lowest level of the tree includes multiple nodes that each contain chunk metadata relating to the chunks of the data set. In each node of the lowest level of the buffer tree, the chunk metadata contained therein identifies at least one of the chunks. The chunks (user-level data) are stored in one or more system files that are separate from the buffer tree and not visible to the user.
Information query