SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION
    1.
    发明申请
    SYSTEM AND METHOD FOR ORGANIZING DATA TO FACILITATE DATA DEDUPLICATION 审中-公开
    用于组织数据以促进数据重复的系统和方法

    公开(公告)号:US20150205816A1

    公开(公告)日:2015-07-23

    申请号:US14552292

    申请日:2014-11-24

    申请人: NetApp, Inc.

    IPC分类号: G06F17/30

    摘要: A technique for organizing data to facilitate data deduplication includes dividing a block-based set of data into multiple “chunks”, where the chunk boundaries are independent of the block boundaries (due to the hashing algorithm). Metadata of the data set, such as block pointers for locating the data, are stored in a tree structure that includes multiple levels, each of which includes at least one node. The lowest level of the tree includes multiple nodes that each contain chunk metadata relating to the chunks of the data set. In each node of the lowest level of the buffer tree, the chunk metadata contained therein identifies at least one of the chunks. The chunks (user-level data) are stored in one or more system files that are separate from the buffer tree and not visible to the user.

    摘要翻译: 用于组织数据以便于重复数据删除的技术包括将基于块的数据集划分成多个“块”,其中块边界独立于块边界(由于散列算法)。 数据集的元数据(例如用于定位数据的块指针)被存储在包括多个级别的树结构中,每个级别包括至少一个节点。 树的最低级别包括多个节点,每个节点包含与数据集块大小相关的块元数据。 在缓冲树的最低级别的每个节点中,其中包含的块元数据标识至少一个块。 块(用户级数据)存储在与缓冲树分开的一个或多个系统文件中,并且对用户不可见。