发明授权
- 专利标题: Efficient content meta-data collection and trace generation from deduplicated storage
-
申请号: US13335746申请日: 2011-12-22
-
公开(公告)号: US08631052B1公开(公告)日: 2014-01-14
- 发明人: Philip Shilane , Grant Wallace , Frederick Douglis
- 申请人: Philip Shilane , Grant Wallace , Frederick Douglis
- 申请人地址: US MA Hopkinton
- 专利权人: EMC Corporation
- 当前专利权人: EMC Corporation
- 当前专利权人地址: US MA Hopkinton
- 代理机构: Blakely, Sokoloff, Taylor & Zafman, LLP
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/30
摘要:
The method and apparatus collect file recipes from deduplicated data storage systems, the file recipes consist of a list of fingerprints of data chunks of a file. Detailed meta-data for each unique data chunk is also collected. In an offline process, research and analysis can be performed on either the meta-data itself or on a reconstruction of a full trace of meta-data constructed by matching recipe fingerprints to the corresponding meta-data. The method and system can generate the full meta-data trace efficiently in an on-line or off-line process. Typical deduplicated storage systems achieve 10× or higher deduplication rates, and the meta-data collection is faster than processing all of the original files and produces compact meta-data that is smaller to store.
信息查询