Mixed-Precision Tomographic Reconstructor Computations on Hardware Accelerators

The computation of tomographic reconstructors (ToR) is at the core of a simulation framework to design the next generation of adaptive optics (AO) systems to be installed on future Extremely Large Telescopes (ELT). In fact, it is also a critical component for their operation on sky. The goals of these instruments range from the detection of the light from the most distant galaxies to the analysis of the composition of exoplanets terrestrial atmospheres. Based on advanced AO techniques, the instrument MOSAIC relies on a computational framework to filter out the Earth atmospheric turbulence and eventually enhance the images quality out of ground-based telescopes. The ToR calculation is a compute-bound operation based on the Cholesky factorization. Due to its cubical algorithmic complexity, the ToR may represent a major bottleneck for the E-ELT when scaling up the large number of wavefront sensors used in the baseline MOSAIC design. To mitigate this increasing dimensionality overhead, this paper presents the implementation of a novel mixed-precision Cholesky-based dense matrix solver on hardware accelerators. The new algorithm takes into account the data-sparse structure of the covariance matrix operator and uses the tensor cores of NVIDIA V100 GPUs to leverage performance at an unprecedented scale. To our knowledge, this is the first computational astronomy application that exploits V100's tensor cores outside of the traditional arena of artificial intelligence. Experimental results demonstrate the accuracy robustness and the high performance of the mixed-precision ToR on synthetic datasets, which paves the way for future instrument deployments on the E-ELT

[1]  Horace W. Babcock,et al.  THE POSSIBILITY OF COMPENSATING ASTRONOMICAL SEEING , 1953 .

[2]  George Z. Angeli,et al.  An overview and status of GMT active and adaptive optics , 2018, Astronomical Telescopes + Instrumentation.

[3]  Philippe Dierickx,et al.  E-ELT update of project and effect of change to 39m design , 2012, Other Conferences.

[4]  Franck Cappello,et al.  Big data and extreme-scale computing , 2018, Int. J. High Perform. Comput. Appl..

[5]  Prabhat,et al.  Exascale Deep Learning for Climate Analytics , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[6]  Nicholas J. Higham,et al.  Accelerating the Solution of Linear Systems by Iterative Refinement in Three Precisions , 2018, SIAM J. Sci. Comput..

[7]  David E. Keyes,et al.  Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[8]  M. Puech,et al.  MOSAIC at the E-ELT: A multi-object spectrograph for astrophysics, IGM and cosmology , 2014, Astronomical Telescopes and Instrumentation.

[9]  David Kainer,et al.  Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[10]  Malcolm Smith,et al.  Adaptive optics program at TMT , 2014, Astronomical Telescopes and Instrumentation.

[11]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[12]  Malcolm Smith,et al.  NFIRAOS adaptive optics for the Thirty Meter Telescope , 2018, Astronomical Telescopes + Instrumentation.

[13]  Pascal Jagourel,et al.  Adaptive optics prototype system for infrared astronomy: I. System description , 1990, Astronomical Telescopes and Instrumentation.

[14]  David E. Keyes,et al.  Real-Time Massively Distributed Multi-object Adaptive Optics Simulations for the European Extremely Large Telescope , 2018, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[15]  Nicholas J. Higham,et al.  Harnessing GPU Tensor Cores for Fast FP16 Arithmetic to Speed up Mixed-Precision Iterative Refinement Solvers , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[16]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[17]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[18]  A. Sevin,et al.  A novel fast and accurate pseudo-analytical simulation approach for MOAO , 2014, Astronomical Telescopes and Instrumentation.

[19]  J. Hart,et al.  GMTIFS: The Giant Magellan Telescope integral fields spectrograph and imager , 2016, Astronomical Telescopes + Instrumentation.

[20]  Roberto Ragazzoni,et al.  MAORY for ELT: preliminary design overview , 2018, Astronomical Telescopes + Instrumentation.

[21]  John Salvatier,et al.  Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[22]  Eric Gendron,et al.  Adaptive Optics Simulation for the World's Largest Telescope on Multicore Architectures with Multiple GPUs , 2016, PASC.

[23]  H. Ltaief,et al.  Scalable soft real-time supervisor for tomographic AO , 2018, Astronomical Telescopes + Instrumentation.

[24]  Cédric Augonnet,et al.  StarPU: a unified platform for task scheduling on heterogeneous multicore architectures , 2011, Concurr. Comput. Pract. Exp..

[25]  Julien Langou,et al.  Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems , 2007, Int. J. High Perform. Comput. Appl..

[26]  Jack Dongarra,et al.  Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects , 2009 .

[27]  John Tran,et al.  cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.

[28]  J.-L. Beuzit,et al.  COMPASS: an efficient, scalable and versatile numerical platform for the development of ELT AO systems , 2014, Astronomical Telescopes and Instrumentation.

[29]  Marc G. Genton,et al.  Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization , 2019, 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC).