Revealing the Galaxy–Halo Connection through Machine Learning

Understanding the connections between galaxy stellar mass, star formation rate, and dark matter halo mass represents a key goal of the theory of galaxy formation. Cosmological simulations that include hydrodynamics, physical treatments of star formation, feedback from supernovae, and the radiative transfer of ionizing photons can capture the processes relevant for establishing these connections. The complexity of these physics can prove difficult to disentangle and obfuscate how mass-dependent trends in the galaxy population originate. Here, we train a machine-learning method called Explainable Boosting Machines (EBMs) to infer how the stellar mass and star formation rate of nearly 6 million galaxies simulated by the Cosmic Reionization on Computers project depend on the physical properties of halo mass, the peak circular velocity of the galaxy during its formation history v peak, cosmic environment, and redshift. The resulting EBM models reveal the relative importance of these properties in setting galaxy stellar mass and star formation rate, with v peak providing the most dominant contribution. Environmental properties provide substantial improvements for modeling the stellar mass and star formation rate in only ≲10% of the simulated galaxies. We also provide alternative formulations of EBM models that enable low-resolution simulations, which cannot track the interior structure of dark matter halos, to predict the stellar mass and star formation rate of galaxies computed by high-resolution simulations with detailed baryonic physics.

[1]  C. Conselice,et al.  The quenching of galaxies, bulges, and disks since cosmic noon. A machine learning approach for identifying causality in astronomical data , 2022, Astronomy & Astrophysics.

[2]  R. McGibbon,et al.  Multi-epoch machine learning 1: Unravelling nature versus nurture for galaxy formation , 2021, Monthly notices of the Royal Astronomical Society.

[3]  R. Maiolino,et al.  On the quenching of star formation in observed and simulated central galaxies: Evidence for the role of integrated AGN feedback , 2021, 2112.07672.

[4]  B. Robertson,et al.  Inferring the Thermal History of the Intergalactic Medium from the Properties of the Hydrogen and Helium Lyα Forest , 2021, The Astrophysical Journal.

[5]  V. Springel,et al.  Introducing the THESAN project: radiation-magnetohydrodynamic simulations of the epoch of reionization , 2021, 2110.00584.

[6]  Xiaohui Fan,et al.  Chasing the Tail of Cosmic Reionization with Dark Gap Statistics in the Lyα Forest over 5 < z < 6 , 2021, The Astrophysical Journal.

[7]  J. Neill,et al.  An Ancient Massive Quiescent Galaxy Found in a Gas-rich z ∼ 3 Group , 2021, The Astrophysical Journal Letters.

[8]  I. Zehavi,et al.  Predicting halo occupation and galaxy assembly bias with machine learning , 2021, Monthly Notices of the Royal Astronomical Society.

[9]  C. Baugh,et al.  A machine learning approach to mapping baryons onto dark matter haloes using the EAGLE and C-EAGLE simulations , 2021, 2106.04980.

[10]  A. Farahi,et al.  shaping the gas: understanding gas shapes in dark matter haloes with interpretable machine learning , 2020, Monthly Notices of the Royal Astronomical Society.

[11]  B. Robertson,et al.  Effects of Photoionization and Photoheating on Lyα Forest Properties from Cholla Cosmological Simulations , 2020, The Astrophysical Journal.

[12]  C. Giocoli,et al.  The stellar-to-halo mass relation over the past 12 Gyr , 2020, Astronomy & Astrophysics.

[13]  C. Avestruz,et al.  Cosmic Reionization On Computers: The Galaxy–Halo Connection between 5 ≤ z ≤ 10 , 2020, The Astrophysical Journal.

[14]  D. Narayanan,et al.  simba: Cosmological simulations with black hole growth and feedback , 2019, Monthly Notices of the Royal Astronomical Society.

[15]  R. Teyssier,et al.  Cosmic Dawn II (CoDa II): a new radiation-hydrodynamics simulation of the self-consistent coupling of galaxy formation and reionization , 2018, 1811.11192.

[16]  D. Goddard,et al.  Both starvation and outflows drive galaxy quenching , 2018, Monthly Notices of the Royal Astronomical Society.

[17]  L. Dessart,et al.  The surface abundances of red supergiants at core collapse , 2018, Monthly Notices of the Royal Astronomical Society.

[18]  Andrew P. Hearin,et al.  UniverseMachine: The correlation between galaxy growth and dark matter halo assembly from z = 0−10 , 2018, Monthly Notices of the Royal Astronomical Society.

[19]  J. Tinker,et al.  The Connection Between Galaxies and Their Dark Matter Halos , 2018, Annual Review of Astronomy and Astrophysics.

[20]  Annalisa Pillepich,et al.  Simulating galaxy formation with the IllustrisTNG model , 2017, 1703.02970.

[21]  Simon J. Mutch,et al.  SEMI-ANALYTIC GALAXY EVOLUTION (SAGE): MODEL CALIBRATION AND BASIC RESULTS , 2016, 1601.04709.

[22]  A. Hopkins,et al.  Galaxy And Mass Assembly (GAMA) : growing up in a bad neighbourhood – how do low-mass galaxies become passive? , 2015, 1511.02245.

[23]  R. Teyssier,et al.  Cosmic Dawn (CoDa): the first radiation-hydrodynamics simulation of reionization and galaxy formation in the Local Universe , 2015, 1511.00011.

[24]  Samuel W. Skillman,et al.  THE CONCENTRATION DEPENDENCE OF THE GALAXY–HALO CONNECTION: MODELING ASSEMBLY BIAS WITH ABUNDANCE MATCHING , 2015, 1510.05651.

[25]  Gregory F. Snyder,et al.  The illustris simulation: Public data release , 2015, Astron. Comput..

[26]  R. Somerville,et al.  Physical Models of Galaxy Formation in a Cosmological Framework , 2014, 1412.2712.

[27]  S. White,et al.  The EAGLE project: Simulating the evolution and assembly of galaxies and their environments , 2014, 1407.7040.

[28]  P. Shapiro,et al.  Simulating cosmic reionization: how large a volume is large enough? , 2013, 1310.7463.

[29]  H. Hoekstra,et al.  CFHTLenS: the relation between galaxy dark matter haloes and baryons from weak gravitational lensing , 2013, 1304.4265.

[30]  Andrew P. Hearin,et al.  SHAM Beyond Clustering: New Tests of Galaxy-Halo Abundance Matching with Galaxy Groups , 2012, 1210.4927.

[31]  S. More,et al.  Satellite kinematics – II. The halo mass–luminosity relation of central galaxies in SDSS , 2008, 0807.4532.

[32]  Columbia,et al.  Star Formation in AEGIS Field Galaxies since z = 1.1: The Dominance of Gradually Declining Star Formation, and the Main Sequence of Star-forming Galaxies , 2007, astro-ph/0701924.

[33]  R. Wechsler,et al.  Modeling Luminosity-dependent Galaxy Clustering through Cosmic Time , 2005, astro-ph/0512234.

[34]  Princeton University.,et al.  The Non-Parametric Model for Linking Galaxy Luminosity with Halo/Subhalo Mass: Are First Brightest Galaxies Special? , 2005, astro-ph/0701096.

[35]  R. Bender,et al.  Specific Star Formation Rates to Redshift 5 from the FORS Deep Field and the GOODS-S Field , 2005, astro-ph/0509197.

[36]  A. Szalay,et al.  Galaxy Luminosity Functions to z~1 from DEEP2 and COMBO-17: Implications for Red Galaxy Formation , 2005, astro-ph/0506044.

[37]  R. Mandelbaum,et al.  Galaxy-galaxy lensing : dissipationless simulations versus the halo model , 2004, astro-ph/0410711.

[38]  R. Nichol,et al.  The Bimodal Galaxy Color Distribution: Dependence on Luminosity and Environment , 2004, astro-ph/0406266.

[39]  J. Ostriker,et al.  Linking halo mass to galaxy luminosity , 2004, astro-ph/0402500.

[40]  J. Mohr,et al.  K-Band Properties of Galaxy Clusters and Groups: Luminosity Function, Radial Distribution, and Halo Occupation Number , 2004, astro-ph/0402308.

[41]  J. Brinkmann,et al.  The environmental dependence of the relations between stellar mass, structure, star formation and nuclear activity in galaxies , 2004, astro-ph/0402030.

[42]  J. Brinkmann,et al.  The physical properties of star-forming galaxies in the low-redshift universe , 2003, astro-ph/0311060.

[43]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[44]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[45]  Guido Rossum,et al.  Python Reference Manual , 2000 .

[46]  Y. Jing,et al.  Spatial Correlation Function and Pairwise Velocity Dispersion of Galaxies: Cold Dark Matter Models versus the Las Campanas Survey , 1997, astro-ph/9707106.

[47]  S. Yi,et al.  The Roles of Mass and Environment in the Quenching of Galaxies. II. , 2020 .