Bounded size dictionary compression: SCk-completeness and NC algorithms

We study the parallel complexity of a bounded size dictionary version (LRU deletion heuristic) of the LZ2 compression algorithm. The unbounded version was shown to be P-complete. When the size of the dictionary is O(logk n), the problem of computing the LZ2 compression is shown to be hard for the class of problems solvable simultaneously in polynomial time and O(logk n) space (that is, SCk). We also introduce a variation of this heuristic that turns out to be an SCk-complete problem (the original heuristic belongs to SCk+1). In virtue of these results, we argue that there are no practical parallel algorithms for LZ2 compression with LRU deletion heuristic or any other heuristic deleting dictionary elements in a continuous way. For simpler heuristics (SWAP, RESTART, FREEZE), practical parallel algorithms are given.

[1]  Andrzej Lingas,et al.  Multilist Layering: Complexity and Applications , 1995, Theor. Comput. Sci..

[2]  James A. Storer Massively Parallel Systolic Algorithms for Real-Time Dictionary-Based Text Compression , 1992 .

[3]  Stephan Olariu,et al.  Optimal Parallel Algorithms for Problems Modeled by a Family of Intervals , 1992, IEEE Trans. Parallel Distributed Syst..

[4]  Stephen A. Cook,et al.  Deterministic CFL's are accepted simultaneously in polynomial time and log squared space , 1979, STOC.

[5]  Walter L. Ruzzo On Uniform Circuit Complexity , 1981, J. Comput. Syst. Sci..

[6]  Stephen A. Cook,et al.  Complexity Theory of Parallel Time and Hardware , 1989, Inf. Comput..

[7]  Nicholas Pippenger,et al.  On simultaneous resource bounds , 1979, 20th Annual Symposium on Foundations of Computer Science (sfcs 1979).

[8]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[9]  Wojciech Rytter,et al.  Efficient Parallel Algorithms to Test Square-Freeness and Factorize Strings , 1991, Inf. Process. Lett..

[10]  Sergio De Agostino P-complete Problems in Data Compression , 1994, Theor. Comput. Sci..

[11]  Stephen A. Cook,et al.  A Taxonomy of Problems with Fast Parallel Algorithms , 1985, Inf. Control..

[12]  Michael J. Fischer,et al.  Relations Among Complexity Measures , 1979, JACM.

[13]  A. R. Meyer,et al.  Handbook of Theoretical Computer Science: Algorithms and Complexity , 1990 .

[14]  Abraham Lempel,et al.  Compression of individual sequences via variable-rate coding , 1978, IEEE Trans. Inf. Theory.

[15]  Allan Borodin,et al.  On Relating Time and Space to Size and Depth , 1977, SIAM J. Comput..

[16]  Ian H. Witten,et al.  Text Compression , 1990, 125 Problems in Text Algorithms.

[17]  Sergio De Agostino Erratum to "P-complete Problems in Data Compression" , 2000, Theor. Comput. Sci..

[18]  James A. Storer,et al.  Data Compression: Methods and Theory , 1987 .

[19]  H. James Hoover,et al.  Limits to Parallel Computation: P-Completeness Theory , 1995 .

[20]  David S. Johnson,et al.  A Catalog of Complexity Classes , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.