High performance JPEG 2000 and MPEG-4 VTC on SMPs using OpenMP

JPEG 2000 and MPEG-4 Visual Texture Coding (VTC) are both wavelet-based and state of the art in still image coding. In this paper we show sequential as well as parallel strategies for speeding up two selected implementations of MPEG-4 VTC and JPEG 2000 using the popular shared memory programming paradigm OpenMP. Furthermore, we discuss the sequential and parallel performance of the improved versions and compare the efficiency of both algorithms.

[1]  Andreas Uhl,et al.  Optimization of 3-D Wavelet Decomposition on Multiprocessors , 2000 .

[2]  Rohit Chandra,et al.  Parallel programming in openMP , 2000 .

[3]  Gerd Heber,et al.  Load adaptive algorithms and implementations for the 2D discrete wavelet transform on fine-grain multithreaded architectures , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[4]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[5]  Jian Lu Parallelizing Mallat Algorithm for 2-D Wavelet Transforms , 1993, Inf. Process. Lett..

[6]  Touradj Ebrahimi,et al.  A study of JPEG 2000 still image coding versus other standards , 2000, 2000 10th European Signal Processing Conference.

[7]  David S. Taubman,et al.  High performance scalable image compression with EBCOT. , 2000, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[8]  Elias S. Manolakos,et al.  On the Scalability of 2-D Discrete Wavelet Transform Algorithms , 1997, Multidimens. Syst. Signal Process..

[9]  Andreas Uhl,et al.  Performance issues in MPEG-4 VTC image coding , 2003, Visual Communications and Image Processing.

[10]  Monica S. Lam,et al.  A data locality optimizing algorithm , 1991, PLDI '91.

[11]  Ping-Sing Tsai,et al.  JPEG: Still Image Compression Standard , 2005 .

[12]  Reinhold Orglmeister,et al.  Parallel image processing using a Pentium-based shared-memory multiprocessor system , 1997, Optics & Photonics.

[13]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[14]  Andreas Uhl,et al.  Wavelet packet zerotree image coding on multicomputers , 2002, Proceedings 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing.

[15]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[16]  Gregory W. Cook,et al.  An Investigation of Scalable SIMD I/O Techniques with Application to Parallel JPEG Compression , 1995, J. Parallel Distributed Comput..

[17]  Tor Sørevik,et al.  PARALLEL WAVELET-BASED COMPRESSION OF TWO-DIMENSIONAL DATA , 2000 .

[18]  Iraj Sodagar,et al.  Scalable wavelet coding for synthetic/natural hybrid images , 1999, IEEE Trans. Circuits Syst. Video Technol..

[19]  Linda Yang,et al.  Coarse-Grained Parallel Algorithms for Multi-Dimensional Wavelet Transforms , 2004, The Journal of Supercomputing.

[20]  Andreas Uhl,et al.  Motion-compensated wavelet packet zerotree video coding on multicomputers , 2003, J. Syst. Archit..

[21]  Andreas Uhl,et al.  Cache issues with JPEG2000 wavelet lifting , 2002, IS&T/SPIE Electronic Imaging.

[22]  Rade Kutil,et al.  Approaches to zerotree image and video coding on MIMD architectures , 2002, Parallel Comput..

[23]  Andreas Uhl,et al.  Parallel JPEG2000 image coding on multiprocessors , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.