Thoughtful Precision in Mini-Apps

Approximate computing addresses many of the identified challenges for exascale computing, leading to performance improvements that may include changes in fidelity of calculation. In this paper, we examine approximate approaches for a range of DOE-relevant computational problems run on a variety of architectures as a proxy for the wider set of exascaleclass applications.We show anticipated improvements in computational and memory performance and in power savings. We also assess application correctness when operating under conditions of reduced precision, and show that this is within acceptable bounds. Finally, we discuss the trade space between performance, power, precision and resolution for these mini-apps, and optimized solutions attained within given constraints, with positive implications for application of approximate computing to exascale-class problems.

[1]  Sparsh Mittal,et al.  A Survey of Techniques for Approximate Computing , 2016, ACM Comput. Surv..

[2]  Kipton Barros,et al.  Solving lattice QCD systems of equations using mixed precision solvers on GPUs , 2009, Comput. Phys. Commun..

[3]  James Demmel,et al.  Parallel Reproducible Summation , 2015, IEEE Transactions on Computers.

[4]  Peter Lindstrom,et al.  Fixed-Rate Compressed Floating-Point Arrays , 2014, IEEE Transactions on Visualization and Computer Graphics.

[5]  Sebastian Schöps,et al.  GPU-accelerated mixed precision algebraic multigrid preconditioners for discrete elliptic field problems , 2014 .

[6]  Ewa Deelman,et al.  The cost of doing science on the cloud: the Montage example , 2008, HiPC 2008.

[7]  Pavel Panchekha,et al.  Automatically improving accuracy for floating point expressions , 2015, PLDI.

[8]  Jonathan M. Robey,et al.  In search of numerical consistency in parallel programming , 2011, Parallel Comput..

[9]  James Demmel,et al.  IEEE Standard for Floating-Point Arithmetic , 2008 .

[10]  Viktor Kuncak,et al.  Towards a Compiler for Reals , 2014, ACM Trans. Program. Lang. Syst..

[11]  Sven Leyffer,et al.  Overcoming the power wall by exploiting inexactness and emerging COTS architectural features: Trading precision for improving application quality , 2016, 2016 29th IEEE International System-on-Chip Conference (SOCC).

[12]  Ian Briggs,et al.  Rigorous floating-point mixed-precision tuning , 2017, POPL.

[13]  James Demmel,et al.  Precimonious: Tuning assistant for floating-point precision , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[14]  Timothy C. Warburton,et al.  A GPU-accelerated continuous and discontinuous Galerkin non-hydrostatic atmospheric model , 2019, Int. J. High Perform. Comput. Appl..

[15]  Norbert Wehn,et al.  Mixed precision multilevel Monte Carlo on hybrid computing systems , 2014, 2014 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr).

[16]  Jack J. Dongarra,et al.  Mixed-precision block gram Schmidt orthogonalization , 2015, ScalA '15.

[17]  Hartwig Anzt,et al.  Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms , 2010, Computer Science - Research and Development.

[18]  James Demmel,et al.  Floating-Point Precision Tuning Using Blame Analysis , 2016, 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE).

[19]  Jack J. Dongarra,et al.  Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy , 2008, TOMS.

[20]  David Defour,et al.  ExBLAS: Reproducible and Accurate BLAS Library , 2015 .

[21]  Michael O. Lam,et al.  Fine-grained floating-point precision analysis , 2018, Int. J. High Perform. Comput. Appl..

[22]  Robert Strzodka,et al.  Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations , 2007, Int. J. Parallel Emergent Distributed Syst..

[23]  Lukas Einkemmer,et al.  A mixed precision semi-Lagrangian algorithm and its performance on accelerators , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[24]  James Demmel,et al.  Design, implementation and testing of extended and mixed precision BLAS , 2000, TOMS.

[25]  Michela Taufer,et al.  On the Need for Reproducible Numerical Accuracy through Intelligent Runtime Selection of Reduction Algorithms at the Extreme Scale , 2015, 2015 IEEE International Conference on Cluster Computing.

[26]  Bronis R. de Supinski,et al.  Abstract: Automatically Adapting Programs for Mixed-Precision Floating-Point Computation , 2013, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[27]  John Augustine,et al.  Opportunities for energy efficient computing: A study of inexact general purpose processors for high-performance and big-data applications , 2015, 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[28]  Chao Yang,et al.  Accelerating solvers for global atmospheric equations through mixed-precision data flow engine , 2013, 2013 23rd International Conference on Field programmable Logic and Applications.