Modeling lightcurves for improved classification of astronomical objects

Many synoptic surveys are observing large parts of the sky multiple times. The resulting time series of light measurements, called lightcurves, provide a wonderful window to the dynamic nature of the Universe. However, there are many significant challenges in analyzing these lightcurves. We describe a modeling‐based approach using Gaussian process regression for generating critical measures for the classification of such lightcurves. This method has key advantages over other popular nonparametric regression methods in its ability to deal with censoring, a mixture of sparsely and densely sampled curves, the presence of annual gaps caused by objects not being visible throughout the year from a given position on Earth and known but variable measurement errors. We demonstrate that our approach performs better by showing it has a higher correct classification rate than past methods popular in astronomy. Finally, we provide future directions for use in sky‐surveys that are getting even bigger by the day. © 2016 Wiley Periodicals, Inc. Statistical Analysis and Data Mining: The ASA Data Science Journal, 2016

[1]  Peter B. Stetson,et al.  ON THE AUTOMATIC DETERMINATION OF LIGHT-CURVE PARAMETERS FOR CEPHEID VARIABLES , 1996 .

[2]  R. Webbink,et al.  A Catalog and Atlas of Cataclysmic Variables: The Living Edition , 2001, astro-ph/0102302.

[3]  Christina Gloeckner,et al.  Modern Applied Statistics With S , 2003 .

[4]  Kurt Hornik,et al.  kernlab - An S4 Package for Kernel Methods in R , 2004 .

[5]  R. Webbink,et al.  A Catalog and Atlas of Cataclysmic Variables: The Final Edition , 2005, astro-ph/0602278.

[6]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[7]  Kirk D. Borne,et al.  Scientific Data Mining in Astronomy , 2009, Next Generation of Data Mining.

[8]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9]  A. J. Drake,et al.  FIRST RESULTS FROM THE CATALINA REAL-TIME TRANSIENT SURVEY , 2008, 0809.1394.

[10]  J. De Ridder,et al.  AUTOMATED CLASSIFICATION OF VARIABLE STARS IN THE ASTEROSEISMOLOGY PROGRAM OF THE KEPLER SPACE MISSION , 2010, 1001.0507.

[11]  A. A. Mahabal,et al.  The Catalina Real-Time Transient Survey (CRTS) , 2011, 1102.5004.

[12]  S. G. Djorgovski,et al.  Discovery, classification, and scientific exploration of transient events from the Catalina Real-time Transient Survey , 2011, 1111.0313.

[13]  J. Richards,et al.  ON MACHINE-LEARNED CLASSIFICATION OF VARIABLE STARS WITH SPARSE AND NOISY TIME-SERIES DATA , 2011, 1101.1959.

[14]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[15]  A. Markowitz,et al.  TWELVE AND A HALF YEARS OF OBSERVATIONS OF CENTAURUS A WITH THE ROSSI X-RAY TIMING EXPLORER , 2011, 1102.5076.

[16]  Stephen R. Kane,et al.  CHARACTERIZING THE VARIABILITY OF STARS WITH EARLY-RELEASE KEPLER DATA , 2010, 1009.1840.

[17]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[18]  Ciro Donalek,et al.  Flashes in a star stream: Automated classification of astronomical transient events , 2012, 2012 IEEE 8th International Conference on E-Science.

[19]  Pavlos Protopapas,et al.  Semi-parametric Robust Event Detection for Massive Time-Domain Databases , 2013, 1301.3027.

[20]  N. Peng,et al.  Selecting quasar candidates using a support vector machine classification system , 2012 .

[21]  Adam A. Miller,et al.  ACTIVE LEARNING TO OVERCOME SAMPLE SELECTION BIAS: APPLICATION TO PHOTOMETRIC VARIABLE STAR CLASSIFICATION , 2011, 1106.2832.

[22]  E. O. Ofek,et al.  Automating Discovery and Classification of Transients and Variable Stars in the Synoptic Survey Era , 2011, 1106.5491.

[23]  S Roberts,et al.  Gaussian processes for time-series modelling , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[24]  S. G. Djorgovski,et al.  Feature selection strategies for classifying high dimensional astronomical data sets , 2013, 2013 IEEE International Conference on Big Data.

[25]  Sergey E. Koposov,et al.  THE CATALINA SURVEYS PERIODIC VARIABLE STAR CATALOG , 2014, 1405.4290.

[26]  Ciro Donalek,et al.  A novel variability-based method for quasar selection: evidence for a rest-frame ∼54 d characteristic time-scale , 2013, 1401.1785.