File aware block level deduplication
    1.
    发明授权
    File aware block level deduplication 有权
    文件感知块级重复数据删除

    公开(公告)号:US08510275B2

    公开(公告)日:2013-08-13

    申请号:US12858258

    申请日:2010-08-17

    IPC分类号: G06F17/00

    摘要: A system provides file aware block level deduplication in a system having multiple clients connected to a storage subsystem over a network such as an Internet Protocol (IP) network. The system includes client components and storage subsystem components. Client components include a walker that traverses the namespace looking for files that meet the criteria for optimization, a file system daemon that rehydrates the files, and a filter driver that watches all operations going to the file system. Storage subsystem components include an optimizer resident on the nodes of the storage subsystem. The optimizer can use idle processor cycles to perform optimization. Sub-file compression can be performed at the storage subsystem.

    摘要翻译: 系统在具有通过诸如因特网协议(IP)网络的网络连接到存储子系统的多个客户端的系统中提供文件感知块级重复数据删除。 该系统包括客户端组件和存储子系统组件。 客户端组件包括遍历命名空间的步行器,寻找满足优化条件的文件,重新水化文件的文件系统守护程序以及监视文件系统的所有操作的过滤器驱动程序。 存储子系统组件包括驻留在存储子系统节点上的优化器。 优化器可以使用空闲处理器周期执行优化。 可以在存储子系统中执行子文件压缩。

    FILE AWARE BLOCK LEVEL DEDUPLICATION
    2.
    发明申请
    FILE AWARE BLOCK LEVEL DEDUPLICATION 有权
    FILE AWARE BLACK LEVEL DEDUPLICATION

    公开(公告)号:US20110071989A1

    公开(公告)日:2011-03-24

    申请号:US12858258

    申请日:2010-08-17

    IPC分类号: G06F17/30

    摘要: A system provides file aware block level deduplication in a system having multiple clients connected to a storage subsystem over a network such as an Internet Protocol (IP) network. The system includes client components and storage subsystem components. Client components include a walker that traverses the namespace looking for files that meet the criteria for optimization, a file system daemon that rehydrates the files, and a filter driver that watches all operations going to the file system. Storage subsystem components include an optimizer resident on the nodes of the storage subsystem. The optimizer can use idle processor cycles to perform optimization. Sub-file compression can be performed at the storage subsystem.

    摘要翻译: 系统在具有通过诸如因特网协议(IP)网络的网络连接到存储子系统的多个客户端的系统中提供文件感知块级重复数据删除。 该系统包括客户端组件和存储子系统组件。 客户端组件包括遍历命名空间的步行器,寻找满足优化条件的文件,重新水化文件的文件系统守护程序以及监视文件系统的所有操作的过滤器驱动程序。 存储子系统组件包括驻留在存储子系统节点上的优化器。 优化器可以使用空闲处理器周期执行优化。 可以在存储子系统中执行子文件压缩。

    Methods and apparatus for efficient compression and deduplication
    5.
    发明授权
    Methods and apparatus for efficient compression and deduplication 有权
    用于高效压缩和重复数据删除的方法和设备

    公开(公告)号:US08423520B2

    公开(公告)日:2013-04-16

    申请号:US12624354

    申请日:2009-11-23

    IPC分类号: G06F17/30

    摘要: Mechanisms are provided for performing efficient compression and deduplication of data segments. Compression algorithms are learning algorithms that perform better when data segments are large. Deduplication algorithms, however, perform better when data segments are small, as more duplicate small segments are likely to exist. As an optimizer is processing and storing data segments, the optimizer applies the same compression context to compress multiple individual deduplicated data segments as though they are one segment. By compressing deduplicated data segments together within the same context, data reduction can be improved for both deduplication and compression. Mechanisms are applied to compensate for possible performance degradation.

    摘要翻译: 提供了用于执行数据段的有效压缩和重复数据删除的机制。 压缩算法是在数据段较大时表现更好的学习算法。 然而,当数据段很小时,重复数据删除算法的表现会更好,因为可能存在更多重复的小段。 作为优化器处理和存储数据段,优化器应用相同的压缩上下文来压缩多个单独的重复数据删除的数据段,就像它们是一个段一样。 通过在相同的上下文中将重复数据删除的数据段压缩在一起,可以对重复数据删除和压缩进行数据缩减。 应用机制来补偿可能的性能下降。

    METHODS AND APPARATUS FOR ACTIVE OPTIMIZATION OF DATA
    6.
    发明申请
    METHODS AND APPARATUS FOR ACTIVE OPTIMIZATION OF DATA 有权
    有效优化数据的方法和装置

    公开(公告)号:US20110270810A1

    公开(公告)日:2011-11-03

    申请号:US13041282

    申请日:2011-03-04

    IPC分类号: G06F17/30

    摘要: Techniques and mechanisms are provided to support live file optimization. Active I/O access to an optimization target is monitored during optimization. Active files need not be taken offline or made unavailable to an application during optimization and retain the ability to support file operations such as read, write, unlink, and truncate while an optimization engine performs deduplication and/or compression on active file ranges.

    摘要翻译: 提供技术和机制来支持实时文件优化。 在优化期间监控对优化目标的活动I / O访问。 在优化期间,活动文件不需要脱机或不可用于应用程序,并且在优化引擎对活动文件范围执行重复数据删除和/或压缩时,可以保留支持文件操作(如读取,写入,取消链接和截断)的能力。

    Garbage collection aware deduplication
    7.
    发明授权
    Garbage collection aware deduplication 有权
    垃圾收集感知重复数据删除

    公开(公告)号:US09563632B2

    公开(公告)日:2017-02-07

    申请号:US13555534

    申请日:2012-07-23

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30129

    摘要: Mechanisms are provided for improving the efficiency of garbage collection in a deduplication system by intelligently managing storage of deduplication segments. When a duplicate segment is identified, a reference count for an already maintained segment is incremented only if the already maintained segment has the same lifecycle as the identified duplicate segment. In some instances, an already maintained segment is assumed to have the same lifecycle if it is not stale or the age is not significantly different from the age of the newly identified duplicate. If the already maintained segment is has a different lifecycle, the new segment is stored again even though duplicates are already maintained.

    摘要翻译: 提供了通过智能管理重复数据删除段的存储来提高重复数据删除系统中垃圾收集效率的机制。 当识别出重复段时,仅当已维护的段与所标识的重复段具有相同的生命周期时,已经维护段的引用计数才会递增。 在某些情况下,假设已维护的段具有相同的生命周期,如果它不是过时的,或者年龄与新识别的副本的年龄没有显着差异。 如果已维护的段具有不同的生命周期,即使已经维护了重复项,新的段也将被再次存储。

    Methods and apparatus for active optimization of data
    8.
    发明授权
    Methods and apparatus for active optimization of data 有权
    数据主动优化的方法和装置

    公开(公告)号:US08849773B2

    公开(公告)日:2014-09-30

    申请号:US13041282

    申请日:2011-03-04

    IPC分类号: G06F17/30

    摘要: Techniques and mechanisms are provided to support live file optimization. Active I/O access to an optimization target is monitored during optimization. Active files need not be taken offline or made unavailable to an application during optimization and retain the ability to support file operations such as read, write, unlink, and truncate while an optimization engine performs deduplication and/or compression on active file ranges.

    摘要翻译: 提供技术和机制来支持实时文件优化。 在优化期间监控对优化目标的活动I / O访问。 在优化期间,活动文件不需要脱机或不可用于应用程序,并且在优化引擎对活动文件范围执行重复数据删除和/或压缩时,可以保留支持文件操作(如读取,写入,取消链接和截断)的能力。

    Accelerated deduplication
    9.
    发明授权
    Accelerated deduplication 有权
    加速重复数据删除

    公开(公告)号:US08521705B2

    公开(公告)日:2013-08-27

    申请号:US13309455

    申请日:2011-12-01

    IPC分类号: G06F17/00

    摘要: Mechanisms are provided for accelerated data deduplication. A data stream is received an input interface and maintained in memory. Chunk boundaries are detected and chunk fingerprints are calculated using a deduplication accelerator while a processor maintains a state machine. A deduplication dictionary is accessed using a chunk fingerprint to determine if the associated data chunk has previously been written to persistent memory. If the data chunk has previously been written, reference counts may be updated but the data chunk need not be stored again. Otherwise, datastore suitcases, filemaps, and the deduplication dictionary may be updated to reflect storage of the data chunk. Direct memory access (DMA) addresses are provided to directly transfer a chunk to an output interface as needed.

    摘要翻译: 为加速重复数据删除提供了机制。 数据流被接收到输入接口并保存在存储器中。 检测到块边界,并在处理器维护状态机时使用重复数据消除加速器计算块指纹。 使用块指纹来访问重复数据删除字典,以确定关联的数据块是否先前已写入持久存储器。 如果先前写过数据块,则可能会更新引用计数,但是不需要再次存储数据块。 否则,可能会更新数据存储手提箱,文件映射和重复数据删除字典,以反映数据块的存储。 提供直接存储器访问(DMA)地址以根据需要将块直接传输到输出接口。

    Active file instant cloning
    10.
    发明授权
    Active file instant cloning 有权
    活动文件即时克隆

    公开(公告)号:US08396843B2

    公开(公告)日:2013-03-12

    申请号:US13076271

    申请日:2011-03-30

    IPC分类号: G06F17/00

    摘要: Techniques and mechanisms are provided to instantly clone active files including active optimized files. When a new instance of an active file is created, a new stub is generated in the user namespace and a block map file is cloned. The block map file includes the same offsets and location pointers that existed in the original block map file. No user file data needs to be copied. If the cloned file is later modified, the behavior can be same as what happens when a de-duplicated file is modified.

    摘要翻译: 提供技术和机制来即时克隆活动文件,包括主动优化的文件。 当创建活动文件的新实例时,将在用户命名空间中生成一个新的存根,并克隆块映射文件。 块映射文件包含原始块映射文件中存在的相同偏移量和位置指针。 不需要复制用户文件数据。 如果克隆的文件稍后被修改,则该行为可以与修改重复文件时发生的情况相同。