Avocado: Photometric Classification of Astronomical Transients with Gaussian Process Augmentation

Upcoming astronomical surveys such as the Large Synoptic Survey Telescope (LSST) will rely on photometric classification to identify the majority of the transients and variables that they discover. We present a set of techniques for photometric classification that can be applied even when the training set of spectroscopically-confirmed objects is heavily biased towards bright, low-redshift objects. Using Gaussian process regression to model arbitrary light curves in all bands simultaneously, we "augment" the training set by generating new versions of the original light curves covering a range of redshifts and observing conditions. We train a boosted decision tree classifier on features extracted from the augmented light curves, and we show how such a classifier can be designed to produce classifications that are independent of the redshift distributions of objects in the training sample. Our classification algorithm was the best-performing among the 1,094 models considered in the blinded phase of the Photometric LSST Astronomical Time-Series Classification Challenge (PLAsTiCC), scoring 0.468 on the organizers' logarithmic-loss metric with flat weights for all object classes in the training set, and achieving an AUC of 0.957 for classification of Type Ia supernovae. Our results suggest that spectroscopic campaigns used for training photometric classifiers should focus on typing large numbers of well-observed, intermediate redshift transients instead of attempting to type a sample of transients that is directly representative of the full dataset being classified. All of the algorithms described in this paper are implemented in the avocado software package.

[1]  Chad M. Schafer,et al.  Semi-supervised learning for photometric supernova classification★ , 2011, 1103.6034.

[2]  Prasanth H. Nair,et al.  Astropy: A community Python package for astronomy , 2013, 1307.6212.

[3]  David O. Jones,et al.  The Complete Light-curve Sample of Spectroscopically Confirmed SNe Ia from Pan-STARRS1 and Cosmological Constraints from the Combined Pantheon Sample , 2017, The Astrophysical Journal.

[4]  A. A. Mahabal,et al.  The Photometric LSST Astronomical Time-series Classification Challenge PLAsTiCC: Selection of a Performance Metric for Classification Probabilities Balancing Diverse Science Goals , 2018, The Astronomical Journal.

[5]  N. S. Philip,et al.  Results from the Supernova Photometric Classification Challenge , 2010, 1008.1024.

[6]  F. Feroz,et al.  A simple and robust method for automated photometric classification of supernovae using neural networks , 2012, 1208.1264.

[7]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[8]  Melvin M. Varughese,et al.  PHOTOMETRIC SUPERNOVA COSMOLOGY WITH BEAMS AND SDSS-II , 2011, 1111.5328.

[9]  Larry Denneau,et al.  The Pan-STARRS wide-field optical/NIR imaging survey , 2010, Astronomical Telescopes + Instrumentation.

[10]  C. Tao,et al.  STANDARDIZING TYPE Ia SUPERNOVA ABSOLUTE MAGNITUDES USING GAUSSIAN PROCESS DATA REGRESSION , 2013, 1302.2925.

[11]  David A. van Dyk,et al.  STACCATO: a novel solution to supernova photometric classification with biased training sets , 2017, 1706.03811.

[12]  C. Baltay,et al.  Evidence of Environmental Dependencies of Type Ia Supernovae from the Nearby Supernova Factory indicated by Local H$\alpha$ , 2013, 1309.1182.

[13]  Tom Charnock,et al.  Deep Recurrent Neural Networks for Supernovae Classification , 2016, ArXiv.

[14]  Stefano Casertano,et al.  Type Ia Supernova Discoveries at z > 1 from the Hubble Space Telescope: Evidence for Past Deceleration and Constraints on Dark Energy Evolution , 2004, astro-ph/0402512.

[15]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[16]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[17]  M. Kunz,et al.  Bayesian estimation applied to multiple species , 2006, astro-ph/0611004.

[18]  A. J. Connolly,et al.  Models and Simulations for the Photometric LSST Astronomical Time Series Classification Challenge (PLAsTiCC) , 2019, Publications of the Astronomical Society of the Pacific.

[19]  J. Vanderplas Understanding the Lomb–Scargle Periodogram , 2017, 1703.09824.

[20]  S. Bailey,et al.  How to Find More Supernovae with Less Work: Object Classification Techniques for Difference Imaging , 2006, 0705.0493.

[21]  E. Ishida,et al.  Kernel PCA for Type Ia supernovae photometric classification , 2012, 1201.6676.

[22]  David O. Jones,et al.  TYPE Ia SUPERNOVA RATE MEASUREMENTS TO REDSHIFT 2.5 FROM CANDELS: SEARCHING FOR PROMPT EXPLOSIONS IN THE EARLY UNIVERSE , 2014, 1401.7978.

[23]  Richard Kessler,et al.  PHOTOMETRIC SN IA CANDIDATES FROM THE THREE-YEAR SDSS-II SN SURVEY DATA , 2022 .

[24]  S. P. Littlefair,et al.  THE ASTROPY PROJECT: BUILDING AN INCLUSIVE, OPEN-SCIENCE PROJECT AND STATUS OF THE V2.0 CORE PACKAGE , 2018 .

[25]  Adam G. Riess,et al.  THE RATE OF CORE COLLAPSE SUPERNOVAE TO REDSHIFT 2.5 FROM THE CANDELS AND CLASH SUPERNOVA SURVEYS , 2015, 1509.06574.

[26]  A. Mahabal,et al.  Optimizing spectroscopic follow-up strategies for supernova photometric classification with active learning , 2018, Monthly Notices of the Royal Astronomical Society.

[27]  Marc Chaumont,et al.  PELICAN: deeP architecturE for the LIght Curve ANalysis , 2019, Astronomy & Astrophysics.

[28]  T. Beers,et al.  Measures of location and scale for velocities in clusters of galaxies. A robust approach , 1990 .

[29]  R. Kirshner,et al.  HUBBLE RESIDUALS OF NEARBY TYPE Ia SUPERNOVAE ARE CORRELATED WITH HOST GALAXY MASSES , 2009, 0912.0929.

[30]  Bayesian Single-Epoch Photometric Classification of Supernovae , 2006, astro-ph/0610129.

[31]  M. Sullivan,et al.  THE DIFFERENCE IMAGING PIPELINE FOR THE TRANSIENT SEARCH IN THE DARK ENERGY SURVEY , 2015, 1507.05137.

[32]  Kyle Barbary,et al.  UNITY: CONFRONTING SUPERNOVA COSMOLOGY’S STATISTICAL AND SYSTEMATIC UNCERTAINTIES IN A UNIFIED BAYESIAN FRAMEWORK , 2015, 1507.01602.

[33]  Jake Vanderplas,et al.  SNANA: A Public Software Package for Supernova Analysis , 2009, 0908.4280.

[34]  J. B. Oke,et al.  ENERGY DISTRIBUTIONS, K CORRECTIONS, AND THE STEBBINS--WHITFORD EFFECT FOR GIANT ELLIPTICAL GALAXIES. , 1968 .

[35]  M. Sullivan,et al.  SUPERNOVA SIMULATIONS AND STRATEGIES FOR THE DARK ENERGY SURVEY , 2011, 1111.1969.

[36]  Richard Kessler,et al.  PHOTOMETRIC TYPE Ia SUPERNOVA CANDIDATES FROM THE THREE-YEAR SDSS-II SN SURVEY DATA , 2011, 1107.5106.

[37]  Xiao-Li Meng,et al.  The Art of Data Augmentation , 2001 .

[38]  M. Sullivan,et al.  Improved cosmological constraints from a joint analysis of the SDSS-II and SNLS supernova samples , 2014, 1401.4064.

[39]  M. Sullivan,et al.  SALT2: using distant supernovae to improve the use of type Ia supernovae as distance indicators , 2007, astro-ph/0701828.

[40]  M. Phillips,et al.  Observational Evidence from Supernovae for an Accelerating Universe and a Cosmological Constant , 1998, astro-ph/9805201.

[41]  R. Ellis,et al.  Measurements of $\Omega$ and $\Lambda$ from 42 high redshift supernovae , 1998, astro-ph/9812133.

[42]  S. Deustua,et al.  THE HUBBLE SPACE TELESCOPE CLUSTER SUPERNOVA SURVEY. V. IMPROVING THE DARK-ENERGY CONSTRAINTS ABOVE z > 1 AND BUILDING AN EARLY-TYPE-HOSTED SUPERNOVA SAMPLE , 2011, 1105.3470.

[43]  L.Wang,et al.  Improved Cosmological Constraints from New, Old, and Combined Supernova Data Sets , 2008, 0804.4142.

[44]  Leslie Greengard,et al.  Fast Direct Methods for Gaussian Processes , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  A. Riess,et al.  Measuring the Properties of Dark Energy with Photometrically Classified Pan-STARRS Supernovae. I. Systematic Uncertainty from Core-collapse Supernova Contamination , 2016, 1611.07042.

[46]  O. Lahav,et al.  PHOTOMETRIC SUPERNOVA CLASSIFICATION WITH MACHINE LEARNING , 2016, 1603.00882.

[47]  Rae. Z.H. Aliyev,et al.  Interpolation of Spatial Data , 2018, Biomedical Journal of Scientific & Technical Research.

[48]  Miguel de Val-Borro,et al.  The Astropy Project: Building an Open-science Project and Status of the v2.0 Core Package , 2018, The Astronomical Journal.

[49]  Abhijit Saha,et al.  The LSST operations simulator , 2005, Astronomical Telescopes and Instrumentation.

[50]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[51]  C. Tao,et al.  SNEMO: Improved Empirical Models for Type Ia Supernovae , 2018, The Astrophysical Journal.

[52]  Mamoru Doi,et al.  The Type Ia supernovae rate with Subaru/XMM-Newton Deep Survey , 2014, 1401.7701.

[53]  C. Tao,et al.  IMPROVING COSMOLOGICAL DISTANCE MEASUREMENTS USING TWIN TYPE IA SUPERNOVAE , 2015, 1511.01102.