Invention Grant
- Patent Title: Method for finding the longest common subsequences between files with applications to differential compression
- Patent Title (中): 找到文件与应用于差分压缩的最长公共子序列的方法
-
Application No.: US10904732Application Date: 2004-11-24
-
Publication No.: US07487169B2Publication Date: 2009-02-03
- Inventor: Ramesh Chandra Agarwal
- Applicant: Ramesh Chandra Agarwal
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Marc D. McSwain
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/00 ; G06F12/00 ; G06F17/30

Abstract:
A differential compression method and computer program product combines hash value techniques and suffix array techniques. The invention finds the best matches for every offset of the version file, with respect to a certain granularity and above a certain length threshold. The invention has two variations depending on block size choice. If the block size is kept fixed, the compression performance of the invention is similar to that of the greedy algorithm, without the expensive space and time requirements. If the block size is varied linearly with the reference file size, the invention can run in linear-time and constant-space. It has been shown empirically that the invention performs better than certain known differential compression algorithms in terms of compression and speed.
Public/Granted literature
Information query