Parallel POD Compression of Time-Varying Big Datasets Using m-Swap on the K Computer

Thanks to the supercomputer, more and more complicated simulations are successfully achieved. On the other hand, to analyze and understand the intrinsic properties of the big datasets from the simulations is an urgent research for scientists. However, the explosive size of the big datasets makes such kind of task difficult. Therefore, reduction of the size of the big datasets becomes an important topic, in which data compression and parallel computing are the two key techniques. In this paper, we presented a parallel data compression approach to reduce the size of time-varying big datasets. Firstly, we employ the proper orthogonal decomposition (POD) method for compression. The POD method can extract the underlying features of datasets to greatly reduce the size of big datasets. Meanwhile, the compressed datasets can be decompressed linearly. This feature can help scientists to interactively visualize big datasets for analysis. Then, we introduced a novel m-swap method to effectively parallelize the POD compression algorithm. The m-swap method can reach a high performance through fully using all parallel computing processors. In another word, no idle processors exist in the parallel compression process. Furthermore, the m-swap method can greatly reduce the cost of interprocessor communication. This is achieved by controlling the data transfer among 2m processors to obtain the best balance of computation cost of these processors. Finally, the effectiveness of our method will be demonstrated through compressing several time-varying big datasets on the K computer with ten thousands of processors.

[1]  U. Neumann Parallel volume-rendering algorithm performance on mesh-connected multicomputers , 1993, Proceedings of 1993 IEEE Parallel Rendering Symposium.

[2]  Kwan-Liu Ma,et al.  Parallel volume rendering using binary-swap compositing , 1994, IEEE Computer Graphics and Applications.

[3]  Kwan-Liu Ma,et al.  A Study of Parallel Data Compression Using Proper Orthogonal Decomposition on the K Computer , 2014, EGPGV@EuroVis.

[4]  A. Chatterjee An introduction to the proper orthogonal decomposition , 2000 .

[5]  Cécile Paris,et al.  Data Abstraction and Visualisation in Next Step: Experiences from a Government Services Delivery Trial , 2013, 2013 IEEE International Congress on Big Data.

[6]  Nikolay N. Ponomarenko,et al.  Lossy compression of images without visible distortions and its application , 2010, IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS.

[7]  Antonio J. Plaza,et al.  On the Impact of Lossy Compression on Hyperspectral Image Classification and Unmixing , 2011, IEEE Geoscience and Remote Sensing Letters.

[8]  Bryan Usevitch JPEG2000 extensions for bit plane coding of floating point data , 2003, Data Compression Conference, 2003. Proceedings. DCC 2003.

[9]  Chongke Bi,et al.  2-3-4 Combination for Parallel Compression on the K Computer , 2014, 2014 IEEE Pacific Visualization Symposium.

[10]  Enrico Magli,et al.  Low-complexity lossy compression of hyperspectral images via informed quantization , 2010, 2010 IEEE International Conference on Image Processing.

[11]  Jeremy Iverson,et al.  Fast and Effective Lossy Compression Algorithms for Scientific Datasets , 2012, Euro-Par.

[12]  Tomas Akenine-Möller,et al.  Error-bounded lossy compression of floating-point color buffers using quadtree decomposition , 2009, The Visual Computer.

[13]  Martin Isenburg,et al.  Fast and Efficient Compression of Floating-Point Data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[14]  Takehiro Moriya,et al.  A lossless audio compression scheme with random access property , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  P. Holmes,et al.  The Proper Orthogonal Decomposition in the Analysis of Turbulent Flows , 1993 .

[16]  Jarek Rossignac,et al.  Out‐of‐core compression and decompression of large n‐dimensional scalar fields , 2003, Comput. Graph. Forum.

[17]  Martin Isenburg,et al.  Lossless compression of predicted floating-point geometry , 2005, Comput. Aided Des..

[18]  Bryan Usevitch JPEG2000 compliant lossless coding of floating point data , 2005, Data Compression Conference.

[19]  Kwan-Liu Ma,et al.  An Adaptive Prediction-Based Approach to Lossless Compression of Floating-Point Volume Data , 2012, IEEE Transactions on Visualization and Computer Graphics.

[20]  Martin Burtscher,et al.  Fast lossless compression of scientific floating-point data , 2006, Data Compression Conference (DCC'06).