Robust Period Estimation Using Mutual Information for Multiband Light Curves in the Synoptic Survey Era

The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a particular model for the light curve nor its underlying probability density and it is robust to non-Gaussian noise and outliers. By combining the QMI from several bands the true period can be estimated even when no single-band QMI yields the period. Period recovery performance as a function of average magnitude and sample size is measured using 30,000 synthetic multi-band light curves of RR Lyrae and Cepheid variables generated by the LSST Operations and Catalog simulators. The results show that aggregating information from several bands is highly beneficial in LSST sparsely-sampled time series, obtaining an absolute increase in period recovery rate up to 50%. We also show that the QMI is more robust to noise and light curve length (sample size) than the multiband generalizations of the Lomb Scargle and Analysis of Variance periodograms, recovering the true period in 10-30% more cases than its competitors. A python package containing efficient Cython implementations of the QMI and other methods is provided.

[1]  P. Bartholdi,et al.  VARIABLE STARS : WHICH NYQUIST FREQUENCY? , 1999 .

[2]  J. Scargle Studies in astronomical time series analysis. II - Statistical aspects of spectral analysis of unevenly spaced data , 1982 .

[3]  S. Djorgovski,et al.  Using conditional entropy to identify periodicity , 2013, 1306.6664.

[4]  Donald W. Sweeney,et al.  LSST Science Book, Version 2.0 , 2009, 0912.0201.

[5]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[6]  Andrew Cameron Becker,et al.  SIMULATED LSST SURVEY OF RR LYRAE STARS THROUGHOUT THE LOCAL GROUP , 2012 .

[7]  Jose C. Principe,et al.  Information Theoretic Learning - Renyi's Entropy and Kernel Perspectives , 2010, Information Theoretic Learning.

[8]  D. Clarke,et al.  String/Rope length methods using the Lafler-Kinman statistic , 2002 .

[9]  Jose C. Principe,et al.  Measures of Entropy From Data Using Infinitely Divisible Kernels , 2012, IEEE Transactions on Information Theory.

[10]  John W. Fisher,et al.  Learning from Examples with Information Theoretic Criteria , 2000, J. VLSI Signal Process..

[11]  Moon,et al.  Estimation of mutual information using kernel density estimators. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[12]  R. Stellingwerf Period determination using phase dispersion minimization , 1978 .

[13]  Zeljko Ivezic,et al.  PERIODOGRAMS FOR MULTIBAND ASTRONOMICAL TIME SERIES , 2015, 1502.01344.

[14]  Nicholas Mondrik,et al.  A MULTIBAND GENERALIZATION OF THE MULTIHARMONIC ANALYSIS OF VARIANCE PERIOD ESTIMATION ALGORITHM AND THE EFFECT OF INTER-BAND OBSERVING CADENCE ON PERIOD RECOVERY RATE , 2015, 1508.04772.

[15]  D. Palmer A FAST CHI-SQUARED TECHNIQUE FOR PERIOD SEARCH OF IRREGULARLY SAMPLED DATA , 2009, 0901.1913.

[16]  Eduardo Serrano,et al.  LSST: From Science Drivers to Reference Design and Anticipated Data Products , 2008, The Astrophysical Journal.

[17]  Abhijit Saha,et al.  The LSST operations simulator , 2005, Astronomical Telescopes and Instrumentation.

[18]  Pavlos Protopapas,et al.  Computational Intelligence Challenges and Applications on Large-Scale Astronomical Time Series Databases , 2014, IEEE Computational Intelligence Magazine.

[19]  Linhua Jiang,et al.  LIGHT CURVE TEMPLATES AND GALACTIC DISTRIBUTION OF RR LYRAE STARS FROM SLOAN DIGITAL SKY SURVEY STRIPE 82 , 2009, 0910.4611.

[20]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  G. Jogesh Babu,et al.  Big data in astronomy , 2012 .

[22]  Pavlos Protopapas,et al.  A NOVEL, FULLY AUTOMATED PIPELINE FOR PERIOD ESTIMATION IN THE EROS 2 DATA SET , 2014, ArXiv.

[23]  Andrew J. Connolly,et al.  An end-to-end simulation framework for the Large Synoptic Survey Telescope , 2014, Astronomical Telescopes and Instrumentation.

[24]  M. Zechmeister,et al.  The generalised Lomb-Scargle periodogram. A new formalism for the floating-mean and Keplerian periodograms , 2009, 0901.2573.

[25]  Robert M. Gray,et al.  Entropy and Information Theory -2/E. , 2014 .

[26]  G. Fasano,et al.  A multidimensional version of the Kolmogorov–Smirnov test , 1987 .

[27]  Ciro Donalek,et al.  A comparison of period finding algorithms , 2013, 1307.2209.

[28]  Frederic Pont,et al.  The effect of red noise on planetary transit detection , 2006, astro-ph/0608597.

[29]  Gutti Jogesh Babu,et al.  Statistical Challenges in Modern Astronomy , 1992 .

[30]  S. Saigal,et al.  Relative performance of mutual information estimation methods for quantifying the dependence among short and noisy data. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  J. Richards,et al.  ON MACHINE-LEARNED CLASSIFICATION OF VARIABLE STARS WITH SPARSE AND NOISY TIME-SERIES DATA , 2011, 1101.1959.

[32]  S. R. Jammalamadaka,et al.  Topics in Circular Statistics , 2001 .

[33]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[34]  Shay Zucker Detection of Periodicity Based on Independence Tests – II. Improved Serial Independence Measure , 2016 .

[35]  Pavlos Protopapas,et al.  An Information Theoretic Algorithm for Finding Periodicities in Stellar Light Curves , 2012, IEEE Transactions on Signal Processing.

[36]  A. Schwarzenberg-Czerny Fast and Statistically Optimal Period Search in Uneven Sampled Observations , 1996 .