Neural Networks for Astronomical Data Analysis and Bayesian Inference

We present our generic neural network training algorithm, called Sky Net and the accelerated Bayesian inference algorithm, BAMBI. Sky Net combines multiple techniques already developed individually in the literature to create an efficient and robust machine-learning tool that is able to train large and deep feed-forward neural networks for use in a wide range of learning applications, such as regression, classification, density estimation, clustering and dimensionality reduction. Sky Net uses a powerful `pre-training' method, to obtain a set of network parameters close to the true global maximum of the training objective function, followed by further optimisation using an automatically-regularised variant of Newton's method that uses second-order derivative information to improve convergence, but without the need to evaluate or store the full Hessian matrix, by using a fast approximate method to calculate Hessian-vector products. This combination of methods allows for the training of complicated networks that are difficult to optimise using standard back propagation techniques. The blind accelerated multimodal Bayesian inference (BAMBI) algorithm implements the MultiNest package for nested sampling as well as the training of an artificial neural network by Sky Net to learn the likelihood function. In the case of computationally expensive likelihoods, this allows the substitution of a much more rapid approximation in order to increase significantly the speed of the analysis. Astrophysical examples are provided for both Sky Net and BAMBI.

[1]  F. Feroz,et al.  Multimodal nested sampling: an efficient and robust alternative to Markov Chain Monte Carlo methods for astronomical data analyses , 2007, 0704.3704.

[2]  Gordon T. Richards,et al.  Proceedings of the International Astronomical Union , 2014 .

[3]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[4]  W. A. Fendt,et al.  Pico: Parameters for the Impatient Cosmologist , 2006, astro-ph/0606709.

[5]  José M. Bernardo,et al.  Bayesian Statistics , 2011, International Encyclopedia of Statistical Science.

[6]  Stefan M. Wild,et al.  Bayesian Calibration and Uncertainty Analysis for Computationally Expensive Models Using Optimization and Radial Basis Function Approximation , 2008 .

[7]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[8]  Ashok N. Srivastava,et al.  Advances in Machine Learning and Data Mining for Astronomy , 2012 .

[9]  Robert B. Ash,et al.  Information Theory , 2020, The SAGE International Encyclopedia of Mass Media and Society.

[10]  Farhan Feroz,et al.  BAMBI: blind accelerated multimodal Bayesian inference , 2011, 1110.2997.

[11]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[12]  M. Hobson,et al.  cosmonet: fast cosmological parameter estimation in non-flat models using neural networks , 2007, astro-ph/0703445.

[13]  Adam Bouland,et al.  Caching and interpolated likelihoods: accelerating cosmological Monte Carlo Markov chains , 2010, 1012.5299.

[14]  Nicol N. Schraudolph,et al.  Fast Curvature Matrix-Vector Products for Second-Order Gradient Descent , 2002, Neural Computation.

[15]  Alan A. Wells,et al.  The Swift Gamma-Ray Burst Mission , 2004, astro-ph/0405233.

[16]  F. Feroz,et al.  MultiNest: an efficient and robust Bayesian inference tool for cosmology and particle physics , 2008, 0809.3437.

[17]  Salvatore Sessa,et al.  Neural neZtworks in astronomy , 2003, Neural Networks.

[18]  Canada.,et al.  Data Mining and Machine Learning in Astronomy , 2009, 0906.2173.

[19]  A. Lewis,et al.  Cosmological parameters from CMB and other data: A Monte Carlo approach , 2002, astro-ph/0205436.

[20]  A. Lewis,et al.  Efficient computation of CMB anisotropies in closed FRW models , 1999, astro-ph/9911177.

[21]  Earl Lawrence,et al.  Simulation-Aided Inference in Cosmology , 2012 .

[22]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[23]  M. Bethge,et al.  Gravitational Lensing Accuracy Testing 2010 (GREAT10) Challenge Handbook , 2010, 1009.0779.

[24]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[25]  D. Wanderman,et al.  The luminosity function and the rate of Swift's Gamma Ray Bursts , 2009, 0912.0709.

[26]  Barak A. Pearlmutter Fast Exact Multiplication by the Hessian , 1994, Neural Computation.

[27]  John Ellis,et al.  Int. J. Mod. Phys. , 2005 .