The classification of periodic light curves from non-survey optimized observational data through automated extraction of phase-based visual features

We present Random Forest, Support Vector Machine and Feedforward Neural Network models to classify 2519 variable star light curves. These light curves are generated from a reduction of non-survey optimized observational images gathered by wide-field cameras mounted on the Liverpool Telescope. We extract 16 features found to be highly informative in previous studies and achieve an area under the curve of 0.8495 using a feedforward neural network with 50 hidden neurons trained with stratified 10-fold cross-validation with 3 repeats. We propose using an automated visual feature extraction technique by transforming bin-averaged phase-folded light curves into image based representations. This eliminates much of the noise and the missing phase data, due to sampling defects, should have a less destructive effect on these shape features as they still remain at least partially present. There is also no need for feature engineering as the learning algorithms can learn shape features directly from the light curves. We produced a set of scaled images based on a threshold of data points in each pixel. Training on the same feedforward network, we achieve an area under the curve of 0.6348. By introducing the Period and Amplitude as features into this dataset therefore giving meaning to the dimensions of the image we show this improves to 0.7952. Our current models lack translational-invariance and the method may be better suited to specific sub-classification problems common in the variable object hierarchical multi-class problem.

[1]  R. J. Smith,et al.  STILT: System Design & Performance , 2013 .

[2]  Pavlos Protopapas,et al.  META-CLASSIFICATION FOR VARIABLE STARS , 2016, 1601.03013.

[3]  J. Scargle Studies in astronomical time series analysis. II - Statistical aspects of spectral analysis of unevenly spaced data , 1982 .

[4]  M. G. Lattanzi,et al.  GAIA: Composition, formation and evolution of the Galaxy , 2001, astro-ph/0101235.

[5]  T. Bedding,et al.  Hipparcos Period-Luminosity Relations for Mira and Semiregular variables , 1998, astro-ph/9808173.

[6]  Dhiya Al-Jumeily,et al.  A Dynamic, Modular Intelligent-Agent Framework for Astronomical Light Curve Analysis and Classification , 2016, ICIC.

[7]  C. Bailer-Jones,et al.  A package for the automated classification of periodic variable stars , 2015, 1512.01611.

[8]  E. al.,et al.  The Sloan Digital Sky Survey: Technical summary , 2000, astro-ph/0006396.

[9]  L. M. Sarro,et al.  Automated supervised classification of variable stars - I. Methodology , 2007, 0711.0703.

[10]  Laurent Eyer,et al.  Variable stars across the observational HR diagram , 2007, 0712.3797.

[11]  J. Gunn,et al.  The Sloan Digital Sky Survey , 1994, astro-ph/9412080.

[12]  A. K. Vivas,et al.  The QUEST RR Lyrae Survey: Confirmation of the Clump at 50 Kiloparsecs and Other Overdensities in the Outer Halo , 2001, astro-ph/0105135.

[13]  Gustavo A. Medrano-Cerda,et al.  The Liverpool Telescope: performance and first results , 2004, SPIE Astronomical Telescopes + Instrumentation.

[14]  J. Richards,et al.  ON MACHINE-LEARNED CLASSIFICATION OF VARIABLE STARS WITH SPARSE AND NOISY TIME-SERIES DATA , 2011, 1101.1959.

[15]  Eduardo Serrano,et al.  LSST: From Science Drivers to Reference Design and Anticipated Data Products , 2008, The Astrophysical Journal.

[16]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[17]  Pavlos Protopapas,et al.  FATS: Feature Analysis for Time Series , 2015, 1506.00010.

[18]  M. Paegert,et al.  THE EB FACTORY PROJECT. I. A FAST, NEURAL-NET-BASED, GENERAL PURPOSE LIGHT CURVE CLASSIFIER OPTIMIZED FOR ECLIPSING BINARIES , 2014, 1407.0443.

[19]  DISCORDANCE OF THE UNIFIED SCHEME WITH OBSERVED PROPERTIES OF QUASARS AND HIGH-EXCITATION GALAXIES IN THE 3CRR SAMPLE , 2013, 1306.2087.