Rotation-invariant convolutional neural networks for galaxy morphology prediction

Measuring the morphological parameters of galaxies is a key requirement for studying their formation and evolution. Surveys such as the Sloan Digital Sky Survey have resulted in the availability of very large collections of images, which have permitted population-wide analyses of galaxy morphology. Morphological analysis has traditionally been carried out mostly via visual inspection by trained experts, which is time consuming and does not scale to large (≳104) numbers of images. Although attempts have been made to build automated classification systems, these have not been able to achieve the desired level of accuracy. The Galaxy Zoo project successfully applied a crowdsourcing strategy, inviting online users to classify images by answering a series of questions. Unfortunately, even this approach does not scale well enough to keep up with the increasing availability of galaxy images. We present a deep neural network model for galaxy morphology classification which exploits translational and rotational symmetry. It was developed in the context of the Galaxy Challenge, an international competition to build the best model for morphology classification based on annotated images from the Galaxy Zoo project. For images with high agreement among the Galaxy Zoo participants, our model is able to reproduce their consensus with near-perfect accuracy (>99 per cent) for most questions. Confident model predictions are highly accurate, which makes the model suitable for filtering large collections of images and forwarding challenging images to experts for manual annotation. This approach greatly reduces the experts’ workload without affecting accuracy. The application of these algorithms to larger sets of training data will be critical for analysing results from future surveys such as the Large Synoptic Survey Telescope.

[1]  A. S. Szalay,et al.  Galaxy Zoo: the fraction of merging galaxies in the SDSS and their morphologies , 2009, 0903.4937.

[2]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[3]  Ofer Lahav,et al.  ANNz: Estimating Photometric Redshifts Using Artificial Neural Networks , 2004 .

[4]  Paolo Persi,et al.  Science with astronomical near-infrared sky surveys , 1994 .

[5]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[6]  A. Naim,et al.  Neural computation as a tool for galaxy classification: methods and examples , 1995, astro-ph/9508012.

[7]  Lior Shamir,et al.  WND-CHARM: Multi-purpose image classification using compound image transforms , 2008, Pattern Recognit. Lett..

[8]  Simone Gori,et al.  Reversal of apparent rotation in the Enigma-figure with and without motion adaptation and the effect of T-junctions , 2006, Vision Research.

[9]  C. Lintott,et al.  Galaxy Zoo: reproducing galaxy morphologies via machine learning★ , 2009, 0908.2033.

[10]  Emmanuelle Gouillart,et al.  scikit-image: image processing in Python , 2014, PeerJ.

[11]  R. C. Nichol,et al.  Galaxy Zoo: bulgeless galaxies with growing black holes , 2012, 1207.4190.

[12]  Sugata Kaviraj,et al.  Galaxy Zoo: a sample of blue early-type galaxies at low redshift , 2009, 0903.3415.

[13]  A. Naim,et al.  Automated morphological classification of APM galaxies by supervised artificial neural networks , 1995, astro-ph/9503001.

[14]  Robert C. Nichol,et al.  MegaMorph - multiwavelength measurement of galaxy structure: complete Sersic profile information from modern surveys , 2012, 1212.3332.

[15]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[16]  Jean Ponce,et al.  A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.

[17]  M. V. Rossum,et al.  In Neural Computation , 2022 .

[18]  Lior Shamir,et al.  Combining Human and Machine Learning for Morphological Analysis of Galaxy Images , 2014, ArXiv.

[19]  The bulletin of mathematical biophysics , 2005, Protoplasma.

[20]  Marc Huertas-Company,et al.  Revisiting the Hubble sequence in the SDSS DR7 spectroscopic sample: a publicly available Bayesian automated classification , 2010, 1010.3018.

[21]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[22]  Simo Särkkä,et al.  Advances in Neural Information Processing Systems 25 (NIPS 2012) , 2002 .

[23]  O. Lahav,et al.  Galaxies, Human Eyes, and Artificial Neural Networks , 1994, Science.

[24]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[25]  E. Bertin,et al.  SExtractor: Software for source extraction , 1996 .

[26]  C. Lintott,et al.  Galaxy Zoo: 'Hanny's Voorwerp', a quasar light echo? , 2009, 0906.5304.

[27]  Joan Bruna,et al.  Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[28]  F. R. Harnden,et al.  Astronomical Data Analysis Software and Systems X , 2001 .

[29]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.

[30]  Yichuan Zhang,et al.  Advances in Neural Information Processing Systems 25 , 2012 .

[31]  Lior Shamir,et al.  Automatic morphological classification of galaxy images. , 2009, Monthly notices of the Royal Astronomical Society.

[32]  J. van Leeuwen,et al.  Neural Networks: Tricks of the Trade , 2002, Lecture Notes in Computer Science.

[33]  O. Lahav,et al.  An artificial neural network approach to the classification of galaxy spectra , 1996, astro-ph/9608073.

[34]  O. Fuentes,et al.  Machine learning and image analysis for morphological galaxy classification , 2004 .

[35]  Robert C. Nichol,et al.  Galaxy Zoo:bars in disc galaxies , 2010, 1003.0449.

[36]  Alexander S. Szalay,et al.  Galaxy Zoo: the dependence of morphology and colour on environment , 2008, 0805.2612.

[37]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[38]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[39]  C. Lintott,et al.  Galaxy Zoo 1: data release of morphological classifications for nearly 900 000 galaxies , 2010, 1007.3265.

[40]  C. Lintott,et al.  Galaxy Zoo: the large-scale spin statistics of spiral galaxies in the Sloan Digital Sky Survey , 2008, 0803.3247.

[41]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[42]  S. Okamura,et al.  Galaxy types in the Sloan Digital Sky survey using supervised artificial neural networks , 2003, astro-ph/0306390.

[43]  C. Lintott,et al.  Galaxy Zoo 2: detailed morphological classifications for 304,122 galaxies from the Sloan Digital Sky Survey , 2013, 1308.3496.

[44]  Pedro M. Domingos,et al.  Deep Symmetry Networks , 2014, NIPS.

[45]  L. Shamir,et al.  Automatic quantitative morphological analysis of interacting galaxies , 2013, Astron. Comput..

[46]  Michigan.,et al.  Estimating photometric redshifts with artificial neural networks , 2002, astro-ph/0203250.

[47]  D. Clery Galaxy evolution. Galaxy zoo volunteers share pain and glory of research. , 2011, Science.

[48]  C. Lintott,et al.  Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey , 2008, 0804.4483.

[49]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[50]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[51]  Robert C. Nichol,et al.  Galaxy Zoo: an independent look at the evolution of the bar fraction over the last eight billion years from HST-COSMOS ? , 2014, 1401.3334.

[52]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[53]  C. Lintott,et al.  Galaxy Zoo: Disentangling the Environmental Dependence of Morphology and Colour ⋆ , 2008, 0811.3970.

[54]  Stéphane Mallat,et al.  Rotation, Scaling and Deformation Invariant Scattering for Texture Discrimination , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  C. Lintott,et al.  Galaxy Zoo: Passive Red Spirals . , 2009, 0910.4113.

[56]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.