SIDRA: a blind algorithm for signal detection in photometric surveys

We present the Signal Detection using Random-Forest Algorithm (SIDRA). SIDRA is a detection and classification algorithm based on the Machine Learning technique (Random Forest). The goal of this paper is to show the power of SIDRA for quick and accurate signal detection and classification. We first diagnose the power of the method with simulated light curves and try it on a subset of the Kepler space mission catalogue. We use five classes of simulated light curves (CONSTANT, TRANSIT, VARIABLE, MLENS and EB for constant light curves, transiting exoplanet, variable, microlensing events and eclipsing binaries, respectively) to analyse the power of the method. The algorithm uses four features in order to classify the light curves. The training sample contains 5000 light curves (1000 from each class) and 50000 random light curves for testing. The total SIDRA success ratio is $\geq 90\%$. Furthermore, the success ratio reaches 95 - 100$\%$ for the CONSTANT, VARIABLE, EB, and MLENS classes and 92$\%$ for the TRANSIT class with a decision probability of 60$\%$. Because the TRANSIT class is the one which fails the most, we run a simultaneous fit using SIDRA and a Box Least Square (BLS) based algorithm for searching for transiting exoplanets. As a result, our algorithm detects 7.5$\%$ more planets than a classic BLS algorithm, with better results for lower signal-to-noise light curves. SIDRA succeeds to catch 98$\%$ of the planet candidates in the Kepler sample and fails for 7$\%$ of the false alarms subset. SIDRA promises to be useful for developing a detection algorithm and/or classifier for large photometric surveys such as TESS and PLATO exoplanet future space missions.

[1]  Bohdan Paczynski,et al.  Gravitational microlensing by the galactic halo , 1986 .

[2]  A. J. Drake,et al.  The MACHO Project: Microlensing Optical Depth Toward the Galactic Bulge from Difference Image Analysis , 2000 .

[3]  Y. Watase,et al.  Real-time difference imaging analysis of moa galactic bulge observations during 2000 , 2001 .

[4]  Claude E. Shannon,et al.  A mathematical theory of communication , 1948, MOCO.

[5]  G. Kov'acs,et al.  A box-fitting algorithm in the search for periodic transits , 2002, astro-ph/0206099.

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  A. Claret,et al.  A new non-linear limb-darkening law for LTE stellar atmosphere models III - Sloan filters: Calculations for –5.0 ≤ log [M/H] ≤ +1, 2000 K ≤ T$\mathsf{_{eff}}$ ≤ 50 000 K at several surface gravities , 2004 .

[8]  S. Derriere,et al.  Erratum: A synthetic view on structure and evolution of the Milky Way , 2004 .

[9]  A. Prsa,et al.  A Computational Guide to Physics of Eclipsing Binaries. I. Demonstrations and Perspectives , 2005, astro-ph/0503361.

[10]  A. Collier Cameron,et al.  A fast hybrid algorithm for exoplanetary transit searches , 2006, astro-ph/0609418.

[11]  A. Pál Properties of analytic transit light-curve models , 2008 .

[12]  Yanxia Zhang,et al.  k-Nearest Neighbors for automated classification of celestial objects , 2008 .

[13]  Howard Isaacson,et al.  Kepler Planet-Detection Mission: Introduction and First Results , 2010, Science.

[14]  D. W. Latham,et al.  Planets from the HATNet project , 2011, 1101.0322.

[15]  S. Csizmadia,et al.  A study of the performance of the transit detection tool DST in space-based surveys - Application of the CoRoT pipeline to Kepler data , 2012, 1211.6550.

[16]  Antonino Francesco Lanza,et al.  Detection of Neptune-size planetary candidates with CoRoT data. Comparison with the planet occurrence rate derived from Kepler , 2012, 1209.4815.

[17]  Las Cumbres Observatory Global Telescope Network,et al.  PLANETARY CANDIDATES OBSERVED BY KEPLER. III. ANALYSIS OF THE FIRST 16 MONTHS OF DATA , 2012, 1202.5852.

[18]  Carl J. Grillmair,et al.  AUTOMATED CLASSIFICATION OF PERIODIC VARIABLE STARS DETECTED BY THE WIDE-FIELD INFRARED SURVEY EXPLORER , 2014, 1402.0125.

[19]  Daniel Foreman-Mackey,et al.  A SYSTEMATIC SEARCH FOR TRANSITING PLANETS IN THE K2 DATA , 2015, 1502.04715.

[20]  Khadeejah A. Zamudio,et al.  PLANETARY CANDIDATES OBSERVED BY KEPLER. VI. PLANET SAMPLE FROM Q1–Q16 (47 MONTHS) , 2015, 1502.02038.

[21]  N. Gopalswamy,et al.  ESTIMATING THE HEIGHT OF CMEs ASSOCIATED WITH A MAJOR SEP EVENT AT THE ONSET OF THE METRIC TYPE II RADIO BURST DURING SOLAR CYCLES 23 AND 24 , 2015 .