Garbage collection aware deduplication
    3.
    发明授权
    Garbage collection aware deduplication 有权
    垃圾收集感知重复数据删除

    公开(公告)号:US09563632B2

    公开(公告)日:2017-02-07

    申请号:US13555534

    申请日:2012-07-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30129

    摘要: Mechanisms are provided for improving the efficiency of garbage collection in a deduplication system by intelligently managing storage of deduplication segments. When a duplicate segment is identified, a reference count for an already maintained segment is incremented only if the already maintained segment has the same lifecycle as the identified duplicate segment. In some instances, an already maintained segment is assumed to have the same lifecycle if it is not stale or the age is not significantly different from the age of the newly identified duplicate. If the already maintained segment is has a different lifecycle, the new segment is stored again even though duplicates are already maintained.

    摘要翻译: 提供了通过智能管理重复数据删除段的存储来提高重复数据删除系统中垃圾收集效率的机制。 当识别出重复段时,仅当已维护的段与所标识的重复段具有相同的生命周期时,已经维护段的引用计数才会递增。 在某些情况下,假设已维护的段具有相同的生命周期,如果它不是过时的,或者年龄与新识别的副本的年龄没有显着差异。 如果已维护的段具有不同的生命周期,即使已经维护了重复项,新的段也将被再次存储。

    GARBAGE COLLECTION AWARE DEDUPLICATION
    5.
    发明申请
    GARBAGE COLLECTION AWARE DEDUPLICATION 有权
    GARBAGE收藏品评估

    公开(公告)号:US20140025644A1

    公开(公告)日:2014-01-23

    申请号:US13555534

    申请日:2012-07-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30129

    摘要: Mechanisms are provided for improving the efficiency of garbage collection in a deduplication system by intelligently managing storage of deduplication segments. When a duplicate segment is identified, a reference count for an already maintained segment is incremented only if the already maintained segment has the same lifecycle as the identified duplicate segment. In some instances, an already maintained segment is assumed to have the same lifecycle if it is not stale or the age is not significantly different from the age of the newly identified duplicate. If the already maintained segment is has a different lifecycle, the new segment is stored again even though duplicates are already maintained.

    摘要翻译: 提供了通过智能管理重复数据删除段的存储来提高重复数据删除系统中垃圾收集效率的机制。 当识别出重复段时,仅当已维护的段与所标识的重复段具有相同的生命周期时,已经维护段的引用计数才会递增。 在某些情况下,假设已维护的段具有相同的生命周期,如果它不是过时的,或者年龄与新识别的副本的年龄没有显着差异。 如果已维护的段具有不同的生命周期,即使已经维护了重复项,新的段也将被再次存储。

    Methods and apparatus for efficient compression and deduplication
    7.
    发明授权
    Methods and apparatus for efficient compression and deduplication 有权
    用于高效压缩和重复数据删除的方法和设备

    公开(公告)号:US08423520B2

    公开(公告)日:2013-04-16

    申请号:US12624354

    申请日:2009-11-23

    IPC分类号: G06F17/30

    摘要: Mechanisms are provided for performing efficient compression and deduplication of data segments. Compression algorithms are learning algorithms that perform better when data segments are large. Deduplication algorithms, however, perform better when data segments are small, as more duplicate small segments are likely to exist. As an optimizer is processing and storing data segments, the optimizer applies the same compression context to compress multiple individual deduplicated data segments as though they are one segment. By compressing deduplicated data segments together within the same context, data reduction can be improved for both deduplication and compression. Mechanisms are applied to compensate for possible performance degradation.

    摘要翻译: 提供了用于执行数据段的有效压缩和重复数据删除的机制。 压缩算法是在数据段较大时表现更好的学习算法。 然而,当数据段很小时,重复数据删除算法的表现会更好,因为可能存在更多重复的小段。 作为优化器处理和存储数据段,优化器应用相同的压缩上下文来压缩多个单独的重复数据删除的数据段,就像它们是一个段一样。 通过在相同的上下文中将重复数据删除的数据段压缩在一起,可以对重复数据删除和压缩进行数据缩减。 应用机制来补偿可能的性能下降。

    METHODS AND APPARATUS FOR ACTIVE OPTIMIZATION OF DATA
    8.
    发明申请
    METHODS AND APPARATUS FOR ACTIVE OPTIMIZATION OF DATA 有权
    有效优化数据的方法和装置

    公开(公告)号:US20110270810A1

    公开(公告)日:2011-11-03

    申请号:US13041282

    申请日:2011-03-04

    IPC分类号: G06F17/30

    摘要: Techniques and mechanisms are provided to support live file optimization. Active I/O access to an optimization target is monitored during optimization. Active files need not be taken offline or made unavailable to an application during optimization and retain the ability to support file operations such as read, write, unlink, and truncate while an optimization engine performs deduplication and/or compression on active file ranges.

    摘要翻译: 提供技术和机制来支持实时文件优化。 在优化期间监控对优化目标的活动I / O访问。 在优化期间,活动文件不需要脱机或不可用于应用程序,并且在优化引擎对活动文件范围执行重复数据删除和/或压缩时,可以保留支持文件操作(如读取,写入,取消链接和截断)的能力。

    Methods and apparatus for active optimization of data
    9.
    发明授权
    Methods and apparatus for active optimization of data 有权
    数据主动优化的方法和装置

    公开(公告)号:US08849773B2

    公开(公告)日:2014-09-30

    申请号:US13041282

    申请日:2011-03-04

    IPC分类号: G06F17/30

    摘要: Techniques and mechanisms are provided to support live file optimization. Active I/O access to an optimization target is monitored during optimization. Active files need not be taken offline or made unavailable to an application during optimization and retain the ability to support file operations such as read, write, unlink, and truncate while an optimization engine performs deduplication and/or compression on active file ranges.

    摘要翻译: 提供技术和机制来支持实时文件优化。 在优化期间监控对优化目标的活动I / O访问。 在优化期间,活动文件不需要脱机或不可用于应用程序,并且在优化引擎对活动文件范围执行重复数据删除和/或压缩时,可以保留支持文件操作(如读取,写入,取消链接和截断)的能力。

    Accelerated deduplication
    10.
    发明授权
    Accelerated deduplication 有权
    加速重复数据删除

    公开(公告)号:US08521705B2

    公开(公告)日:2013-08-27

    申请号:US13309455

    申请日:2011-12-01

    IPC分类号: G06F17/00

    摘要: Mechanisms are provided for accelerated data deduplication. A data stream is received an input interface and maintained in memory. Chunk boundaries are detected and chunk fingerprints are calculated using a deduplication accelerator while a processor maintains a state machine. A deduplication dictionary is accessed using a chunk fingerprint to determine if the associated data chunk has previously been written to persistent memory. If the data chunk has previously been written, reference counts may be updated but the data chunk need not be stored again. Otherwise, datastore suitcases, filemaps, and the deduplication dictionary may be updated to reflect storage of the data chunk. Direct memory access (DMA) addresses are provided to directly transfer a chunk to an output interface as needed.

    摘要翻译: 为加速重复数据删除提供了机制。 数据流被接收到输入接口并保存在存储器中。 检测到块边界,并在处理器维护状态机时使用重复数据消除加速器计算块指纹。 使用块指纹来访问重复数据删除字典,以确定关联的数据块是否先前已写入持久存储器。 如果先前写过数据块,则可能会更新引用计数,但是不需要再次存储数据块。 否则,可能会更新数据存储手提箱,文件映射和重复数据删除字典,以反映数据块的存储。 提供直接存储器访问(DMA)地址以根据需要将块直接传输到输出接口。