Optical Wavelength Guided Self-Supervised Feature Learning For Galaxy Cluster Richness Estimate

Most galaxies in the nearby Universe are gravitationally bound to a cluster or group of galaxies. Their optical contents, such as optical richness, are crucial for understanding the co-evolution of galaxies and large-scale structures in modern astronomy and cosmology. The determination of optical richness can be challenging. We propose a self-supervised approach for estimating optical richness from multi-band optical images. The method uses the data properties of the multi-band optical images for pre-training, which enables learning feature representations from a large but unlabeled dataset. We apply the proposed method to the Sloan Digital Sky Survey. The result shows our estimate of optical richness lowers the mean absolute error and intrinsic scatter by 11.84% and 20.78%, respectively, while reducing the need for labeled training data by up to 60%. We believe the proposed method will benefit astronomy and cosmology, where a large number of unlabeled multi-band images are available, but acquiring image labels is costly.

[1]  M. Jarvis,et al.  An Application of Multi-band Forced Photometry to One Square Degree of SERVS: Accurate Photometric Redshifts and Implications for Future Science , 2017, 1704.01582.

[2]  Nathan Jacobs,et al.  Automatic Hand Skeletal Shape Estimation from Radiographs , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[3]  Yu Zhang,et al.  Multi-Branch Attention Networks for Classifying Galaxy Clusters , 2021, 2020 25th International Conference on Pattern Recognition (ICPR).

[4]  A. Finoguenov,et al.  redMaPPer. I. ALGORITHM AND SDSS DR8 CATALOG , 2013, 1303.3562.

[5]  D. Lambas,et al.  Brightest group galaxies and the large-scale environment , 2015, 1502.01221.

[6]  Scott Workman,et al.  Learning a Dynamic Map of Visual Appearance , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[8]  J. Mohr,et al.  Constraints on Cosmological Parameters from Future Galaxy Cluster Surveys , 2000, astro-ph/0002336.

[9]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[10]  Yu Zhang,et al.  Defense-PointNet: Protecting PointNet Against Adversarial Attacks , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[11]  Chris Bebek,et al.  The Dark Energy Spectroscopic Instrument (DESI) , 2019, 1907.10688.

[12]  Aniruddha R. Thakar,et al.  Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies, and the Distant Universe , 2017, 1703.00052.

[13]  Hunter Blanton,et al.  Inconsistent Performance of Deep Learning Models on Mammogram Classification. , 2020, Journal of the American College of Radiology : JACR.

[14]  Hilo,et al.  THE ELEVENTH AND TWELFTH DATA RELEASES OF THE SLOAN DIGITAL SKY SURVEY: FINAL DATA FROM SDSS-III , 2015, 1501.00963.

[15]  F. Marinacci,et al.  A Deep Learning Approach to Galaxy Cluster X-Ray Masses , 2018, The Astrophysical Journal.

[16]  Xiaoqin Wang,et al.  Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification , 2020, ArXiv.

[17]  Y Zhang,et al.  A deep learning view of the census of galaxy clusters in IllustrisTNG , 2020, Monthly Notices of the Royal Astronomical Society.

[18]  Yaohang Li,et al.  Clinical big data and deep learning: Applications, challenges, and future outlooks , 2019, Big Data Min. Anal..

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.