Optimal Alphabet Partitioning for Semi-Adaptive Coding

Practical applications that employ entropy coding for large alphabets often partition the alphabet set into two or more layers and encode each symbol by using some suitable prefix coding for each layer. In this paper we formulate the problem of optimal alphabet partitioning for the design of a two layer semi-adaptive code and give a solution based on dynamic programming. However, the complexity of the dynamic programming approach can be quite prohibitive for a long sequence and a very large alphabet size. Hence, we give a simple greedy heuristic algorithm whose running time is linear in the number of symbols being encoded, irrespective of the underlying alphabet size. We give experimental results that demonstrate the fact that superior prefix coding schemes for large alphabets can be designed using our approach as opposed to the typically ad-hoc partitioning approach applied in the literature. ∗dchen01@utopia.poly.edu. Research supported by NSF Grant ACI-0118915. †yjc@poly.edu. Research supported in part by NSF Grant ACI-0118915, NSF CAREER Grant CCR-0093373, and NSF ITR Grant CCR-0081964. ‡memon@poly.edu. Research supported in part by NSF Grant ACI-0118915 and NSF Grant CCR-0208678. §xwu@poly.edu. Research supported in part by NSF Grant CCR-0208678.

[1]  Yunwei Jia,et al.  Universal lossless coding of sources with large and unbounded alphabets , 2000, 2000 IEEE International Symposium on Information Theory (Cat. No.00CH37060).

[2]  Ahmed H. Tewfik,et al.  Arithmetic coding with dual symbol sets and its performance analysis , 1999, IEEE Trans. Image Process..

[3]  William A. Pearlman,et al.  Lapped orthogonal transform coding by amplitude and group partitioning , 1999, Optics & Photonics.

[4]  Alistair Moffat,et al.  Efficient Construction of Minimum-Redundancy Codes for Large Alphabets , 1998, IEEE Trans. Inf. Theory.

[5]  William A. Pearlman,et al.  Low-complexity waveform coding via alphabet and sample-set partitioning , 1997, Proceedings of IEEE International Symposium on Information Theory.

[6]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[7]  Jeffrey Scott Vitter,et al.  Design and analysis of dynamic Huffman codes , 1987, JACM.

[8]  Donald E. Knuth,et al.  Dynamic Huffman Coding , 1985, J. Algorithms.

[9]  Jorma Rissanen,et al.  Universal coding, information, prediction, and estimation , 1984, IEEE Trans. Inf. Theory.

[10]  Ian H. Witten,et al.  Data Compression Using Adaptive Coding and Partial String Matching , 1984, IEEE Trans. Commun..

[11]  David A. Huffman,et al.  A method for the construction of minimum-redundancy codes , 1952, Proceedings of the IRE.