Data mining for gravitationally lensed quasars

Gravitationally lensed (GL) quasars are brighter than their unlensed counterparts and produce images with distinctive morphological signatures. Past searches and target selection algorithms, in particular the Sloan Quasar Lens Search (SQLS), have relied on basic morphological criteria, which were applied to samples of bright, spectroscopically confirmed quasars. The SQLS techniques are not sufficient for searching into new surveys (e.g. DES, PS1, LSST), because spectroscopic information is not readily available and the large data volume requires higher purity in target/candidate selection. We carry out a systematic exploration of machine learning techniques and demonstrate that a two step strategy can be highly effective. In the first step we use catalog-level information ($griz$+WISE magnitudes, second moments) to preselect targets, using artificial neural networks. The accepted targets are then inspected with pixel-by-pixel pattern recognition algorithms (Gradient-Boosted Trees), to form a final set of candidates. The results from this procedure can be used to further refine the simpler SQLS algorithms, with a twofold (or threefold) gain in purity and the same (or $80\%$) completeness at target-selection stage, or a purity of $70\%$ and a completeness of $60\%$ after the candidate-selection step. Simpler photometric searches in $griz$+WISE based on colour cuts would provide samples with $7\%$ purity or less. Our technique is extremely fast, as a list of candidates can be obtained from a stage III experiment (e.g. DES catalog/database) in {a few} CPU hours. The techniques are easily extendable to Stage IV experiments like LSST with the addition of time domain information.

[1]  Improved Cosmological Constraints from Gravitational Lens Statistics , 2004, astro-ph/0401138.

[2]  Probing the Coevolution of Supermassive Black Holes and Galaxies Using Gravitationally Lensed Quasar Hosts , 2006, astro-ph/0603248.

[3]  Yann Le Du,et al.  Lightcurve Classification in Massive Variability Surveys , 2003 .

[4]  Nicholas A. Walton,et al.  gs-tec: the Gaia spectrophotometry transient events classifier , 2014, 1404.7150.

[5]  N. Wyn Evans,et al.  Light-curve classification in massive variability surveys — I. Microlensing , 2002, astro-ph/0211121.

[6]  R. J. Brunner,et al.  Exhausting the Information: Novel Bayesian Combination of Photometric Redshift PDFs , 2014, 1403.0044.

[7]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[8]  Psu,et al.  Probing the Dust-to-Gas Ratio of z > 0 Galaxies through Gravitational Lenses , 2005, astro-ph/0507392.

[9]  M. W. Auger,et al.  Detection of substructure with adaptive optics integral field spectroscopy of the gravitational lens B1422+231 , 2014 .

[10]  U. Oklahoma,et al.  THE OPTICAL, ULTRAVIOLET, AND X-RAY STRUCTURE OF THE QUASAR HE 0435−1223 , 2011, 1112.0027.

[11]  P. Madau,et al.  Compound Gravitational Lensing as a Probe of Dark Matter Substructure within Galaxy Halos , 2001, astro-ph/0108224.

[12]  Gregory Dobler,et al.  SELECTING QUASARS BY THEIR INTRINSIC VARIABILITY , 2010, 1002.2642.

[13]  J. Friedman Stochastic gradient boosting , 2002 .

[14]  C. Rusu,et al.  The stellar and dark matter distributions in elliptical galaxies from the ensemble of strong gravitational lenses , 2013, 1309.5408.

[15]  E. Ishida,et al.  Kernel PCA for Type Ia supernovae photometric classification , 2012, 1201.6676.

[16]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[17]  Martin G. Cohen,et al.  THE WIDE-FIELD INFRARED SURVEY EXPLORER (WISE): MISSION DESCRIPTION AND INITIAL ON-ORBIT PERFORMANCE , 2010, 1008.0031.

[18]  Ucsb,et al.  Gravitationally lensed quasars and supernovae in future wide-field optical imaging surveys , 2010, 1001.2037.

[19]  E. al.,et al.  The Sloan Digital Sky Survey: Technical summary , 2000, astro-ph/0006396.

[20]  C. Kochanek,et al.  A Simple Method to Find All Lensed Quasars , 2005, astro-ph/0512031.

[21]  M. G. Lattanzi,et al.  GAIA: Composition, formation and evolution of the Galaxy , 2001, astro-ph/0101235.

[22]  C. Kochanek,et al.  Inspiration , 2012 .

[23]  P. Schneider,et al.  Evidence for substructure in lens galaxies , 1997, astro-ph/9707187.

[24]  J. Frieman,et al.  THE SLOAN DIGITAL SKY SURVEY QUASAR LENS SEARCH. V. FINAL CATALOG FROM THE SEVENTH DATA RELEASE , 2012, 1203.1087.

[25]  Statistics of magnification perturbations by substructure in the cold dark matter cosmological model , 2005, astro-ph/0506573.

[26]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[27]  J. Brinkmann,et al.  Determining the Lensing Fraction of SDSS Quasars: Methods and Results from the Early Data Release , 2003, astro-ph/0301464.

[28]  F. Courbin,et al.  Gravitational lensing : an astrophysical tool , 2002 .

[29]  A. Szalay,et al.  The Sloan Digital Sky Survey Quasar Survey: Quasar Luminosity Function from Data Release 3 , 2006, astro-ph/0601434.

[30]  Edwin L. Turner,et al.  The Sloan Digital Sky Survey Quasar Lens Search. I. Candidate Selection Algorithm , 2006 .

[31]  Ucsd,et al.  Direct Detection of Cold Dark Matter Substructure , 2001, astro-ph/0111456.

[32]  K. Abazajian,et al.  THE SEVENTH DATA RELEASE OF THE SLOAN DIGITAL SKY SURVEY , 2008, 0812.0649.

[33]  Donald W. Sweeney,et al.  Large Synoptic Survey Telescope: From Science Drivers to Reference Design , 2008 .

[34]  B. Pindor Discovering Gravitational Lenses through Measurements of Their Time Delays , 2005, astro-ph/0501518.

[35]  B. C. Kelly,et al.  MORPHOLOGICAL CLASSIFICATION OF GALAXIES BY SHAPELET DECOMPOSITION IN THE SLOAN DIGITAL SKY SURVEY. II. MULTIWAVELENGTH CLASSIFICATION , 2005 .

[36]  G. Meylan,et al.  Microlensing of the broad line region in 17 lensed quasars , 2012, 1206.0731.

[37]  T. Schrabback,et al.  AUTOMATED DETECTION OF GALAXY-SCALE GRAVITATIONAL LENSES IN HIGH-RESOLUTION IMAGING DATA , 2008, 0805.1469.

[38]  Canada.,et al.  Data Mining and Machine Learning in Astronomy , 2009, 0906.2173.

[39]  G. Meylan,et al.  COSMOLOGY FROM GRAVITATIONAL LENS TIME DELAYS AND PLANCK DATA , 2013, 1306.4732.

[40]  V. A. Menegatto,et al.  Eigenvalues of Integral Operators Defined by Smooth Positive Definite Kernels , 2009 .