Efficient content meta-data collection and trace generation from deduplicated storage

    公开(公告)号:US08631052B1

    公开(公告)日:2014-01-14

    申请号:US13335746

    申请日:2011-12-22

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30156

    摘要: The method and apparatus collect file recipes from deduplicated data storage systems, the file recipes consist of a list of fingerprints of data chunks of a file. Detailed meta-data for each unique data chunk is also collected. In an offline process, research and analysis can be performed on either the meta-data itself or on a reconstruction of a full trace of meta-data constructed by matching recipe fingerprints to the corresponding meta-data. The method and system can generate the full meta-data trace efficiently in an on-line or off-line process. Typical deduplicated storage systems achieve 10× or higher deduplication rates, and the meta-data collection is faster than processing all of the original files and produces compact meta-data that is smaller to store.

    Efficient content meta-data collection and trace generation from deduplicated storage
    6.
    发明授权
    Efficient content meta-data collection and trace generation from deduplicated storage 有权
    从重复数据删除的存储中高效内容元数据收集和跟踪生成

    公开(公告)号:US08667032B1

    公开(公告)日:2014-03-04

    申请号:US13335750

    申请日:2011-12-22

    IPC分类号: G06F7/00 G06F17/30

    摘要: The method and apparatus collect file recipes from deduplicated data storage systems, the file recipes consist of a list of fingerprints of data chunks of a file. Detailed meta-data for each unique data chunk is also collected. In an offline process, research and analysis can be performed on either the meta-data itself or on a reconstruction of a full trace of meta-data constructed by matching recipe fingerprints to the corresponding meta-data. The method and system can generate the full meta-data trace efficiently in an on-line or off-line process. Typical deduplicated storage systems achieve 10× or higher deduplication rates, and the meta-data collection is faster than processing all of the original files and produces compact meta-data that is smaller to store.

    摘要翻译: 该方法和设备从重复数据删除的数据存储系统收集文件配方,文件配方由文件数据块指纹列表组成。 还收集了每个唯一数据块的详细元数据。 在离线过程中,可以对元数据本身进行研究和分析,也可以对通过将配方指纹与对应的元数据进行匹配而构建的完整的元数据轨迹进行重构。 该方法和系统可以在线或离线过程中有效地生成完整的元数据跟踪。 典型的重复数据删除存储系统实现10倍或更高的重复数据删除率,元数据收集比处理所有原始文件更快,并生成较小存储的紧凑型元数据。

    State-based directing of segments in a multinode deduplicated storage system
    7.
    发明授权
    State-based directing of segments in a multinode deduplicated storage system 有权
    在多节点重复数据删除的存储系统中基于状态的段指导

    公开(公告)号:US08751448B1

    公开(公告)日:2014-06-10

    申请号:US12653313

    申请日:2009-12-11

    IPC分类号: G06F7/00 G06F17/00

    摘要: A system for directing for storage includes a processor and a memory. The processor is configured to determine a segment overlap for each of a plurality of nodes. The processor is further configured to determine a selected node of the plurality of nodes based at least in part on the segment overlap for each of the plurality of nodes and based at least in part on a selection criteria. The memory is coupled to the processor and configured to provide the processor with instructions.

    摘要翻译: 用于引导存储的系统包括处理器和存储器。 处理器被配置为确定多个节点中的每一个的段重叠。 处理器还被配置为至少部分地基于多个节点中的每个节点的段重叠来确定多个节点中的选定节点,并且至少部分地基于选择标准。 存储器耦合到处理器并且被配置为向处理器提供指令。