SYSTEM AND METHOD OF SEARCHING FOR DUPLICATE DATA
    2.
    发明申请
    SYSTEM AND METHOD OF SEARCHING FOR DUPLICATE DATA 有权
    用于双重数据搜索的系统和方法

    公开(公告)号:US20090228484A1

    公开(公告)日:2009-09-10

    申请号:US12130514

    申请日:2008-05-30

    IPC分类号: G06F17/30 G06F7/02

    CPC分类号: G06F11/1453 G06F17/30153

    摘要: A computer implemented method and system obtains current signatures of data chunks and performs a proximity search of a library of previous signatures as a function of the likely location of corresponding data chunks. A full search of the library of previous signatures for those current signatures not found in the proximity search is also performed.

    摘要翻译: 计算机实现的方法和系统获得数据块的当前签名,并且作为相应数据块的可能位置的函数执行先前签名的库的邻近搜索。 还执行对在邻近搜索中未找到的当前签名的先前签名的库的完整搜索。

    Duplicate backup data identification and consolidation
    3.
    发明授权
    Duplicate backup data identification and consolidation 有权
    重复的备份数据识别和合并

    公开(公告)号:US08504528B2

    公开(公告)日:2013-08-06

    申请号:US12614765

    申请日:2009-11-09

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    CPC分类号: G06F11/1453 G06F2201/83

    摘要: The various embodiments herein include operate to identify, consolidate, and reduce redundant backup data storage. One embodiment includes storing data blocks and first signatures of data chunks of each stored data block, the first signature of each data chunk including a reference to a storage location of the data chunk within a stored data block, the stored data blocks including data blocks of previous and recent backup sessions. Some embodiments further include storing second signatures in a second signature repository, where the second signatures are calculated based on determined boundaries of the first signatures from previous backup sessions. At least one of the second signatures is calculated based on at least two first signatures, and in the range of 32 to 64 first signatures in some embodiments. Some embodiments may identify data chunks of the recent backup session present in the stored data blocks prior to the recent backup session.

    摘要翻译: 这里的各种实施例包括操作以识别,整合和减少冗余备份数据存储。 一个实施例包括存储数据块和每个存储的数据块的数据块的第一签名,每个数据块的第一签名包括对存储的数据块中的数据块的存储位置的引用,所存储的数据块包括 上一个和最近的备份会话。 一些实施例还包括将第二签名存储在第二签名库中,其中基于来自先前备份会话的所述第一签名的确定的边界来计算第二签名。 在一些实施例中,基于至少两个第一签名计算第二签名中的至少一个,并且在32到64个第一签名的范围内。 一些实施例可以在最近的备份会话之前识别存储在所存储的数据块中的最近备份会话的数据块。

    DUPLICATE BACKUP DATA IDENTIFICATION AND CONSOLIDATION
    4.
    发明申请
    DUPLICATE BACKUP DATA IDENTIFICATION AND CONSOLIDATION 有权
    复制备份数据标识和合并

    公开(公告)号:US20110113013A1

    公开(公告)日:2011-05-12

    申请号:US12614765

    申请日:2009-11-09

    IPC分类号: G06F12/16 G06F17/30 G06F12/00

    CPC分类号: G06F11/1453 G06F2201/83

    摘要: The various embodiments herein include operate to identify, consolidate, and reduce redundant backup data storage. One embodiment includes storing data blocks and first signatures of data chunks of each stored data block, the first signature of each data chunk including a reference to a storage location of the data chunk within a stored data block, the stored data blocks including data blocks of previous and recent backup sessions. Some embodiments further include storing second signatures in a second signature repository, where the second signatures are calculated based on determined boundaries of the first signatures from previous backup sessions. At least one of the second signatures is calculated based on at least two first signatures, and in the range of 32 to 64 first signatures in some embodiments. Some embodiments may identify data chunks of the recent backup session present in the stored data blocks prior to the recent backup session.

    摘要翻译: 这里的各种实施例包括操作以识别,整合和减少冗余备份数据存储。 一个实施例包括存储数据块和每个存储的数据块的数据块的第一签名,每个数据块的第一签名包括对存储的数据块中的数据块的存储位置的引用,所存储的数据块包括 上一个和最近的备份会话。 一些实施例还包括将第二签名存储在第二签名库中,其中基于来自先前备份会话的所述第一签名的确定的边界来计算第二签名。 在一些实施例中,基于至少两个第一签名计算第二签名中的至少一个,并且在32到64个第一签名的范围内。 一些实施例可以在最近的备份会话之前识别存储在所存储的数据块中的最近备份会话的数据块。

    INTEGRATING CLIENT AND SERVER DEDUPLICATION SYSTEMS
    5.
    发明申请
    INTEGRATING CLIENT AND SERVER DEDUPLICATION SYSTEMS 审中-公开
    集成客户端和服务器重用系统

    公开(公告)号:US20120011101A1

    公开(公告)日:2012-01-12

    申请号:US12834616

    申请日:2010-07-12

    IPC分类号: G06F17/30 G06F15/16 H04L9/00

    CPC分类号: H04L69/04 H04L63/12

    摘要: According to one embodiment of the present invention, a method for integrating client and server deduplication systems may be provided. In this method, a first hash set of a previous backup session may be received from a server. The first hash set may comprise a plurality of cryptographic values generated using a plurality of data blocks of a first data set of a client. A second hash set may be generated using a plurality of data blocks of a second data set of the client. A deduplicated data set may be generated by the client according to the first hash set and the second hash set and may comprise a plurality of non-redundant data blocks of the second data set. The second hash set and the deduplicated data set may be transmitted to the server.

    摘要翻译: 根据本发明的一个实施例,可以提供用于整合客户端和服务器重复数据消除系统的方法。 在该方法中,可以从服务器接收先前备份会话的第一哈希集。 第一散列集可以包括使用客户端的第一数据集的多个数据块生成的多个密码值。 可以使用客户端的第二数据集的多个数据块来生成第二散列集。 可以由客户端根据第一散列集合和第二散列集合生成重复数据删除的数据集,并且可以包括第二数据集合的多个非冗余数据块。 可以将第二散列集和重复数据删除的数据集发送到服务器。