High Performance Computing

Data intense scientific domains use data compression to reduce the storage space needed. Lossless data compression preserves the original information accurately but on the domain of climate data usually yields a compression factor of only 2:1. Lossy data compression can achieve much higher compression rates depending on the tolerable error/precision needed. Therefore, the field of lossy compression is still subject to active research. From the perspective of a scientist, the compression algorithm does not matter but the qualitative information about the implied loss of precision of data is a concern. With the Scientific Compression Library (SCIL), we are developing a meta-compressor that allows users to set various quantities that define the acceptable error and the expected performance behavior. The ongoing work a preliminary stage for the design of an automatic compression algorithm selector. The task of this missing key component is the construction of appropriate chains of algorithms to yield the users requirements. This approach is a crucial step towards a scientifically safe use of much-needed lossy data compression, because it disentangles the tasks of determining scientific ground characteristics of tolerable noise, from the task of determining an optimal compression strategy given target noise levels and constraints. Future algorithms are used without change in the application code, once they are integrated into SCIL. In this paper, we describe the user interfaces and quantities, two compression algorithms and evaluate SCIL’s ability for compressing climate data. This will show that the novel algorithms are competitive with stateof-the-art compressors ZFP and SZ and illustrate that the best algorithm depends on user settings and data properties.

[1]  Amer Diwan,et al.  SUIF Explorer: an interactive and interprocedural parallelizer , 1999, PPoPP '99.

[2]  Dhabaleswar K. Panda,et al.  MVAPICH2 over OpenStack with SR-IOV: An Efficient Approach to Build HPC Clouds , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[3]  Mitsuhisa Sato,et al.  Development and Implementation of an Interactive Parallelization Assistance Tool for OpenMP: iPat/OMP , 2006, IEICE Trans. Inf. Syst..

[4]  Haoqiang Jin,et al.  The ParaWise Expert Assistant - Widening Accessibility to Efficient and Scalable Tool Generated OpenMP Code , 2004, WOMPAT.

[5]  Samuel Williams,et al.  The Landscape of Parallel Computing Research: A View from Berkeley , 2006 .

[6]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[7]  David H. Bailey,et al.  The NAS parallel benchmarks summary and preliminary results , 1991, Proceedings of the 1991 ACM/IEEE Conference on Supercomputing (Supercomputing '91).

[8]  Arch D. Robison,et al.  Structured Parallel Programming: Patterns for Efficient Computation , 2012 .

[9]  Gregory A. Newman,et al.  Three-dimensional inverse modelling of damped elastic wave propagation in the Fourier domain , 2014 .

[10]  Mark F. Adams,et al.  High-Resolution Simulation of Pore-Scale Reactive Transport Processes Associated with Carbon Sequestration , 2014, Computing in Science & Engineering.

[11]  D. Trebotich,et al.  An adaptive finite volume method for the incompressible Navier–Stokes equations in complex geometries , 2015 .

[12]  Timothy G. Mattson,et al.  Patterns for parallel programming , 2004 .

[13]  Gregory A. Newman,et al.  3D finite-difference modeling of elastic wave propagation in the Laplace-Fourier domain , 2012 .

[14]  Dhabaleswar K. Panda,et al.  Performance Characterization of Hypervisor-and Container-Based Virtualization for HPC on SR-IOV Enabled InfiniBand Clusters , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[15]  Bei Wang,et al.  Performance Portability of HPC Discovery Science Software: Fusion Energy Turbulence Simulations at Extreme Scale , 2017, Supercomput. Front. Innov..

[16]  Todd D. Ringler,et al.  Evaluation of the arbitrary Lagrangian–Eulerian vertical coordinate method in the MPAS-Ocean model , 2015 .

[17]  Matt Martineau,et al.  Pragmatic Performance Portability with OpenMP 4.x , 2016, IWOMP.

[18]  Juan Touriño,et al.  XARK: An extensible framework for automatic recognition of computational kernels , 2008, TOPL.

[19]  Manuel Arenaz,et al.  Novel source-to-source compiler approach for the automatic parallelization of codes based on the method of moments , 2015, 2015 9th European Conference on Antennas and Propagation (EuCAP).

[20]  César A. F. De Rose,et al.  Performance Evaluation of Container-Based Virtualization for High Performance Computing Environments , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[21]  Jack J. Dongarra,et al.  Towards Achieving Performance Portability Using Directives for Accelerators , 2016, 2016 Third Workshop on Accelerator Programming Using Directives (WACCPD).

[22]  Philip W. Jones,et al.  A multi-resolution approach to global ocean modeling , 2013 .

[23]  Rudolf Eigenmann,et al.  Cetus: A Source-to-Source Compiler Infrastructure for Multicores , 2009, Computer.

[24]  Oscar R. Hernandez,et al.  Experiences in extending parallware to support OpenACC , 2015, WACCPD '15.

[25]  Samuel Williams,et al.  Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.

[26]  Weicheng Huang,et al.  Building a Virtual HPC Cluster with Auto Scaling by the Docker , 2015, ArXiv.

[27]  Blesson Varghese,et al.  Container-Based Cloud Virtual Machine Benchmarking , 2016, 2016 IEEE International Conference on Cloud Engineering (IC2E).

[28]  R. Sasanka,et al.  An efficient and portable SIMD algorithm for charge/current deposition in Particle-In-Cell codes , 2016, Comput. Phys. Commun..

[29]  Luke M. Leslie,et al.  Optimizing Scientific Workflows in the Cloud: A Montage Example , 2014, 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing.

[30]  Francois Gygi,et al.  Optimization algorithm for the generation of ONCV pseudopotentials , 2015, Comput. Phys. Commun..

[31]  Yunheung Paek,et al.  Parallel Programming with Polaris , 1996, Computer.

[32]  Malcolm P. Atkinson,et al.  Rethinking High Performance Computing Platforms: Challenges, Opportunities and Recommendations , 2016, DIDC@HPDC.

[33]  Gabriel Rodríguez,et al.  A novel compiler support for automatic parallelization on multicore systems , 2013, Parallel Comput..