One attempt of a compression algorithm using the BWT

In 1994 Burrows and Wheeler [5] described a universal data compression algorithm (BW-algorithm, for short) which achieved compression rates that were close to the best known compression rates. Due to it’s simplicity, the algorithm can be implemented with relatively low complexity. Fenwick [8] described ideas to improve the efficiency (i.e. the compression rate) and complexity of the BW-algorithm. He also discusses relationships of the algorithm with other compression methods. Schindler [14] proposed a Burrows and Wheeler Transformation (BWT, for short) that is based on a limited ordering. This speeds up the algorithm for compression, but slows it down for decompression and slightly decreases the efficiency. Larsson [10] describes the relationship of the BWT with suffix and context trees. Sadakane [13] suggests a method to compute the BWT faster, and compares it to other methods. Recently Balkenhol and Kurtz [2] gave a thorough analysis of the BWT from an information theoretic point of view. They described implementation techniques for data compression algorithms based on the BWT, and developed a program with a better compression rate. In [4] these previous results on the BW-algorithm are improved. Based on the context tree model, the authors consider the specific statistical properties of the data at the output of the BWT. They describe six important properties. These considerations lead to modifications of the coding method, which in turn improve the coding efficiency. Further improvements related to the sorting are presented in [3, 11, 12].

[1]  John G. Cleary,et al.  Unbounded length contexts for PPM , 1995, Proceedings DCC '95 Data Compression Conference.

[2]  Bernhard Balkenhol,et al.  Universal Data Compression Based on the Burrows-Wheeler Transformation: Theory and Practice , 2000, IEEE Trans. Computers.

[3]  Bernhard Balkenhol,et al.  Space Efficient Linear Time Computation of the Burrows and Wheeler-Transformation , 2000 .

[4]  Kunihiko Sadakane,et al.  A fast algorithm for making suffix arrays and for Burrows-Wheeler transformation , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[5]  M. Schindler,et al.  A fast block-sorting algorithm for lossless data compression , 1997, Proceedings DCC '97. Data Compression Conference.

[6]  P.A.J. Volf,et al.  The switching method: elaborations , 1998 .

[7]  N. Jesper Larsson,et al.  The context trees of block sorting compression , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[8]  Jan Åberg A Universal Source Coding Perspective on PPM , 1999 .

[9]  N. Jesper Larsson Structures of String Matching and Data Compression , 1999 .

[10]  Stephen R. Tate,et al.  Higher compression from the Burrows-Wheeler transform by modified sorting , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[11]  Bernhard Balkenhol,et al.  Modifications of the Burrows and Wheeler data compression algorithm , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[12]  Frans M. J. Willems,et al.  Universal data compression and repetition times , 1989, IEEE Trans. Inf. Theory.

[13]  D. J. Wheeler,et al.  A Block-sorting Lossless Data Compression Algorithm , 1994 .

[14]  Yuri M. Shtarkov Universal Coding of Non-Prefix Context Tree Sources , 2000 .