发明申请
- 专利标题: Method and apparatus for data redundancy elimination at the block level
- 专利标题(中): 在块级消除数据冗余的方法和装置
-
申请号: US10737213申请日: 2003-12-16
-
公开(公告)号: US20050131939A1公开(公告)日: 2005-06-16
- 发明人: Frederick Douglis , Purushottam Kulkarni , Jason LaVoie , John Tracey
- 申请人: Frederick Douglis , Purushottam Kulkarni , Jason LaVoie , John Tracey
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: G06F17/00
- IPC分类号: G06F17/00
摘要:
A redundancy elimination mechanism is provided, which applies aspects of duplicate block elimination and delta encoding at the block level. The redundancy elimination mechanism divides file objects into content-defined blocks or “chunks.” Identical chunks are suppressed. The redundancy elimination mechanism also performs resemblance detection on remaining chunks to identify chunks with sufficient redundancy to benefit from delta encoding of individual chunks. Any remaining chunks that do not benefit from delta encoding are compressed. Resemblance detection is optimized by merging groups of fingerprints into super fingerprints. This merging can be constructed to ensure that if two objects have a single super fingerprint in common, they are extremely likely to be substantially similar.
公开/授权文献
信息查询