Cache issues with JPEG2000 wavelet lifting

In this paper, we have a close look at the runtime performance of the intra-component transform employed in the reference implementations of the JPEG2000 image coding standard. Typically, wavelet lifting is used to obtain a wavelet decomposition of the source image in a computationally efficient way. However, so far no attention has been paid to the impact of the CPU's memory cache on the overall performance. We propose two simple techniques that dramatically reduce the number of cache misses and cut column filtering runtime by a factor of 10. Theoretical estimates as well as experimental results on a number of hardware platforms show the effectivity of our approach.

[1]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[2]  Donald Yeung,et al.  Evaluating the impact of memory system performance on software prefetching and locality optimizations , 2001, ICS '01.

[3]  Aleksandar Milenkovic,et al.  Achieving high performance in bus-based shared-memory multiprocessors , 2000, IEEE Concurr..

[4]  Touradj Ebrahimi,et al.  JPEG2000: the new still picture compression standard , 2000, MULTIMEDIA '00.

[5]  Hiroshi Nakamura,et al.  Improving cache Performance Through Tiling and Data Alignment , 1997, IRREGULAR.

[6]  Touradj Ebrahimi,et al.  A study of JPEG 2000 still image coding versus other standards , 2000, 2000 10th European Signal Processing Conference.

[7]  Faouzi Kossentini,et al.  Reversible integer-to-integer wavelet transforms for image compression: performance evaluation and analysis , 2000, IEEE Trans. Image Process..

[8]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[9]  T. Ebrahimi,et al.  JPEG 2000 : The Next Generation Still Image Compression Standard , 2000 .

[10]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[11]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[12]  Kathryn S. McKinley,et al.  Tile size selection using cache organization and data layout , 1995, PLDI '95.

[13]  David S. Taubman,et al.  High performance scalable image compression with EBCOT , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[14]  A. Robert Calderbank,et al.  Lossless image compression using integer to integer wavelet transforms , 1997, Proceedings of International Conference on Image Processing.

[15]  M. Charrier,et al.  JPEG2000, the next millennium compression standard for still images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[16]  Andreas Uhl,et al.  Hardware and Software Aspects for 3-D Wavelet Decomposition on Shared Memory MIMD Computers , 1999, ACPC.