Compressed Differential Encoding for Semi-Static Compressors

The Compressed Differential Encoding is introduced, i.e., delta encoding directly in two specified compressed files without decompressing. In this paper, we investigate the case where the two given files are compressed using WBTC, and formulate the theoretical framework for modeling delta encoding of compressed files. In put into practice, even though working on the compressed versions in processing time proportional to the compressed files, our target file may be noticeably smaller than the corresponding WBTC form.

[1]  Darrell D. E. Long,et al.  Efficient distributed backup and restore with delta compression , 1997 .

[2]  G.G. Langdon,et al.  Data compression , 1988, IEEE Potentials.

[3]  Ian H. Witten,et al.  Managing gigabytes , 1994 .

[4]  Ramesh C. Agarwal,et al.  An approximation to the greedy algorithm for differential compression of very large files , 2004, Data Compression Conference, 2004. Proceedings. DCC 2004.

[5]  Peter Weiner,et al.  Linear Pattern Matching Algorithms , 1973, SWAT.

[6]  Terry A. Welch,et al.  A Technique for High-Performance Data Compression , 1984, Computer.

[7]  Wojciech Rytter,et al.  Almost-optimal fully LZW-compressed pattern matching , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[8]  Dana Shapira,et al.  In Place Differential File Compression , 2005, Comput. J..

[9]  Alistair Moffat,et al.  Word‐based text compression , 1989, Softw. Pract. Exp..

[10]  Gonzalo Navarro,et al.  A General Practical Approach to Pattern Matching over Ziv-Lempel Compressed Text , 1999, CPM.

[11]  Randal C. Burns,et al.  In-place reconstruction of delta compressed files , 1998, PODC '98.

[12]  Mikkel Thorup,et al.  String Matching in Lempel—Ziv Compressed Strings , 1998, Algorithmica.

[13]  Ayumi Shinohara,et al.  Multiple pattern matching in LZW compressed text , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[14]  Ricardo A. Baeza-Yates,et al.  Adding Compression to Block Addressing Inverted Indexes , 2000, Information Retrieval.

[15]  Ricardo A. Baeza-Yates,et al.  Compression: A Key for Next-Generation Text Retrieval Systems , 2000, Computer.

[16]  Shmuel Tomi Klein,et al.  Compressed Delta Encoding for LZSS Encoded Files , 2007, 2007 Data Compression Conference (DCC'07).

[17]  J. Storer An Introduction to Data Structures and Algorithms , 2002, Birkhäuser Boston.

[18]  Paul Heckel,et al.  A technique for isolating differences between files , 1978, CACM.

[19]  Ricardo A. Baeza-Yates,et al.  Fast and flexible word searching on compressed text , 2000, TOIS.

[20]  Walter F. Tichy,et al.  Delta algorithms: an empirical analysis , 1998, TSEM.

[21]  Walter F. Tichy,et al.  The string-to-string correction problem with block moves , 1984, TOCS.

[22]  Ronald Fagin,et al.  Compactly encoding unstructured inputs with differential compression , 2002, JACM.

[23]  Michael Factor,et al.  Software compression in the client/server environment , 2001, Proceedings DCC 2001. Data Compression Conference.