Multi-Level Deduplication
    6.
    发明申请
    Multi-Level Deduplication 有权
    多级重复数据删除

    公开(公告)号:US20160239511A1

    公开(公告)日:2016-08-18

    申请号:US14625112

    申请日:2015-02-18

    IPC分类号: G06F17/30

    摘要: A method, a system, and a computer-implemented method for performing multi-level deduplication of data are disclosed. A zone stamp is generated for each zone in a plurality of zones contained in at least one data stream. The zone stamp is compared to another zone stamp. The zone stamp and another zone stamp represent zones in the plurality of zones. The comparison is performed for zones at corresponding zone levels based on a determination that a zone stamp of a zone of a preceding zone level is not similar to another zone stamp of another preceding zone level. The zone at the preceding zone level includes at least one zone of a next zone level having a size smaller than or equal to a size of the zone of the preceding zone level. The zone and another zone are deduplicated based on a determination that the zone stamp is similar to another zone stamp.

    摘要翻译: 公开了一种用于执行数据的多级重复数据删除的方法,系统和计算机实现的方法。 为包含在至少一个数据流中的多个区域中的每个区域生成区域戳。 区域邮票与另一个区域邮票进行比较。 区域戳和另一个区域标记表示多个区域中的区域。 基于前一区域级别的区域区域标签与另一先前区域级别的另一区域标签不相似的确定,对相应区域级别的区域进行比较。 前一区域级别的区域包括具有小于或等于前一区域级别的区域的大小的尺寸的下一个区域级别的至少一个区域。 基于区域戳与其他区域戳相似的确定,区域和另一个区域被重复数据删除。

    Adaptive scheduled periodic caching
    7.
    发明授权
    Adaptive scheduled periodic caching 有权
    自适应预定义周期性缓存

    公开(公告)号:US09223812B2

    公开(公告)日:2015-12-29

    申请号:US14084136

    申请日:2013-11-19

    IPC分类号: G06F17/30

    摘要: A system, a method, and a computer program product for adaptive scheduled periodic caching are disclosed. A data stream is received. The data stream contains a plurality of versions of data arranged in a plurality of data clusters. Each data cluster includes an anchor version having a plurality of versions of data dependent on the anchor version. A size of each anchor version of each data cluster is determined. A number of versions of data dependent on each anchor version is also determined. For each anchor version, a ratio of the determined number of dependent versions of data to the determined size of each anchor is computed. At least one anchor version for storing in a memory location is selected based on the computed ratio.

    摘要翻译: 公开了一种用于自适应调度周期性高速缓存的系统,方法和计算机程序产品。 接收数据流。 数据流包含布置在多个数据簇中的数据的多个版本。 每个数据集群包括具有取决于锚版本的多个数据版本的锚版本。 确定每个数据集群的每个锚版本的大小。 还确定了依赖于每个锚版本的多个数据版本。 对于每个锚版本,计算确定的数据的依赖版本数与确定的每个锚的大小的比率。 基于所计算的比例来选择用于存储在存储器位置中的至少一个锚版本。

    ADAPTIVE SCHEDULED PERIODIC CACHING
    8.
    发明申请
    ADAPTIVE SCHEDULED PERIODIC CACHING 有权
    自适应调度周期性高速缓存

    公开(公告)号:US20140143219A1

    公开(公告)日:2014-05-22

    申请号:US14084136

    申请日:2013-11-19

    IPC分类号: G06F17/30

    摘要: A system, a method, and a computer program product for adaptive scheduled periodic caching are disclosed. A data stream is received. The data stream contains a plurality of versions of data arranged in a plurality of data clusters. Each data cluster includes an anchor version having a plurality of versions of data dependent on the anchor version. A size of each anchor version of each data cluster is determined. A number of versions of data dependent on each anchor version is also determined. For each anchor version, a ratio of the determined number of dependent versions of data to the determined size of each anchor is computed. At least one anchor version for storing in a memory location is selected based on the computed ratio.

    摘要翻译: 公开了一种用于自适应调度周期性高速缓存的系统,方法和计算机程序产品。 接收数据流。 数据流包含布置在多个数据簇中的数据的多个版本。 每个数据集群包括具有取决于锚版本的多个数据版本的锚版本。 确定每个数据集群的每个锚版本的大小。 还确定了依赖于每个锚版本的多个数据版本。 对于每个锚版本,计算确定的数据的依赖版本数与确定的每个锚的大小的比率。 基于所计算的比例来选择用于存储在存储器位置中的至少一个锚版本。

    Scalable Grid Deduplication
    10.
    发明申请
    Scalable Grid Deduplication 审中-公开
    可扩展网格重复数据删除

    公开(公告)号:US20160253351A1

    公开(公告)日:2016-09-01

    申请号:US14633366

    申请日:2015-02-27

    IPC分类号: G06F17/30 H04L29/08

    摘要: A system, a method, and a computer program product for performing deduplication of data using a scalable deduplication grid are disclosed. A listing of a plurality of zone stamps is generated, where each zone stamp represents a zone in the plurality of zones in a data stream. The listing contains a logical arrangement of the plurality of zone stamps obtained from each storage location and being accessible by a plurality of servers. A first zone stamp in the listing is compared to a second zone stamp in the listing. The first and second zones are delta-compressed based on a determination that the first zone stamp is substantially similar to the second zone stamp. A server is selected to perform the comparison and delta-compression.

    摘要翻译: 公开了一种使用可伸缩的重复数据删除网格执行数据重复数据删除的系统,方法和计算机程序产品。 生成多个区域标记的列表,其中每个区段标记表示数据流中的多个区域中的区域。 列表包含从每个存储位置获得并且可由多个服务器访问的多个区段邮票的逻辑布置。 列表中的第一个区域邮票与列表中的第二个区域邮票进行比较。 基于确定第一区域戳基本上类似于第二区域戳的第一和第二区域是delta压缩的。 选择一个服务器来执行比较和增量压缩。