发明授权
- 专利标题: Storage reports duplicate file detection
- 专利标题(中): 存储报告重复文件检测
-
申请号: US11206710申请日: 2005-08-17
-
公开(公告)号: US07401080B2公开(公告)日: 2008-07-15
- 发明人: James R. Benton , Ran Kalach , Paul Adrian Oltean , Georgi M. Matev
- 申请人: James R. Benton , Ran Kalach , Paul Adrian Oltean , Georgi M. Matev
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Workman Nydegger
- 主分类号: G06F12/06
- IPC分类号: G06F12/06
摘要:
Described is a storage reports duplicate file detector that operates by receiving file records during a first scan of file system metadata. The detector computes a hash based on attributes in the record, and maintains the hash value in association with information that indicates whether a hash value corresponds to more than one file. In one implementation, the information corresponds to the amount of space wasted by duplication. The information is used to determine which hash values correspond to groups of potentially duplicate files, and eliminate non-duplicates. A second scan locates file information for each of the potentially duplicate files, and the file information is then used to determine which groups of potentially duplicate files are actually duplicate files.
公开/授权文献
- US20070043757A1 Storage reports duplicate file detection 公开/授权日:2007-02-22
信息查询