Probabilistic multi-catalogue positional cross-match

We lay the foundations of a statistical framework for multi-catalogue cross-correlation and cross-identification based on explicit simplified catalogue models. A proper identification process should rely on both astrometric and photometric data. Under some conditions, the astrometric part and the photometric part can be processed separately and merged a posteriori to provide a single global probability of identification. The present paper addresses almost exclusively the astrometrical part and specifies the proper probabilities to be merged with photometric likelihoods. To select matching candidates in n catalogues, we used the Chi (or, indifferently, the Chi-square) test with 2(n-1) degrees of freedom. We thus call this cross-match a chi-match. In order to use Bayes' formula, we considered exhaustive sets of hypotheses based on combinatorial analysis. The volume of the Chi-test domain of acceptance -- a 2(n-1)-dimensional acceptance ellipsoid -- is used to estimate the expected numbers of spurious associations. We derived priors for those numbers using a frequentist approach relying on simple geometrical considerations. Likelihoods are based on standard Rayleigh, Chi and Poisson distributions that we normalized over the Chi-test acceptance domain. We validated our theoretical results by generating and cross-matching synthetic catalogues. The results we obtain do not depend on the order used to cross-correlate the catalogues. We applied the formalism described in the present paper to build the multi-wavelength catalogues used for the science cases of the ARCHES (Astronomical Resource Cross-matching for High Energy Studies) project. Our cross-matching engine is publicly available through a multi-purpose web interface. In a longer term, we plan to integrate this tool into the CDS XMatch Service.

[1]  Harvey T. MacGillivray,et al.  The identification of IRAS point sources. I: A 304 deg2 field centred on the south galactic pole , 1986 .

[2]  E. Greisen,et al.  Representations of celestial coordinates in FITS , 2002, astro-ph/0207413.

[3]  E. Bertin,et al.  SExtractor: Software for source extraction , 1996 .

[4]  Richard L. White,et al.  A Catalog of 1.4 GHz Radio Sources from the FIRST Survey , 1997 .

[5]  Robert J. Brunner,et al.  XID: Cross-Association of ROSAT/Bright Source Catalog X-Ray Sources with USNO A-2 Optical Point Sources , 2000 .

[6]  R. Lupton,et al.  Astrometric Calibration of the Sloan Digital Sky Survey , 2002, astro-ph/0211375.

[7]  Ashish Mahabal,et al.  A Limit on the Number of Isolated Neutron Stars Detected in the ROSAT Bright Source Catalogue , 2003, astro-ph/0302107.

[8]  Alexander G. Gray,et al.  EFFICIENT PHOTOMETRIC SELECTION OF QUASARS FROM THE SLOAN DIGITAL SKY SURVEY. II. ∼1, 000, 000 QUASARS FROM DATA RELEASE 6 , 2004, The Astrophysical Journal Supplement Series.

[9]  M. Skrutskie,et al.  The Two Micron All Sky Survey (2MASS) , 2006 .

[10]  Alexander S. Szalay,et al.  TO APPEAR IN THE ASTROPHYSICAL JOURNAL Preprint typeset using LATEX style emulateapj v. 10/09/06 PROBABILISTIC CROSS-IDENTIFICATION OF ASTRONOMICAL SOURCES , 2008 .

[11]  S. J. Lilly,et al.  The XMM-Newton wide-field survey in the COSMOS field - III. Optical identification and multiwavelength properties of a large sample of X-ray-selected sources , 2007 .

[12]  A. Zacchei,et al.  THE SECOND-GENERATION GUIDE STAR CATALOG: DESCRIPTION AND PROPERTIES , 2008, 0807.2522.

[13]  Martin G. Cohen,et al.  THE WIDE-FIELD INFRARED SURVEY EXPLORER (WISE): MISSION DESCRIPTION AND INITIAL ON-ORBIT PERFORMANCE , 2010, 1008.0031.

[14]  S. Derriere,et al.  Cross-correlation of the 2XMMi catalogue with Data Release 7 of the Sloan Digital Sky Survey , 2010, 1012.1727.

[15]  Aniruddha R. Thakar,et al.  ERRATUM: “THE EIGHTH DATA RELEASE OF THE SLOAN DIGITAL SKY SURVEY: FIRST DATA FROM SDSS-III” (2011, ApJS, 193, 29) , 2011 .

[16]  W. M. Wood-Vasey,et al.  THE NINTH DATA RELEASE OF THE SLOAN DIGITAL SKY SURVEY: FIRST SPECTROSCOPIC DATA FROM THE SDSS-III BARYON OSCILLATION SPECTROSCOPIC SURVEY , 2012, 1207.7137.

[17]  G. L. Wycoff,et al.  The Second US Naval Observatory CCD Astrograph Catalog (UCAC2) , 2004, astro-ph/0403060.

[18]  Tim Naylor,et al.  Bayesian Matching for X-Ray and Infrared Sources in the MYStIX Project , 2013 .

[19]  Michel Fioc,et al.  Probabilistic positional association of catalogs of astrophysical sources: the Aspects code , 2012, 1209.5361.

[20]  A. Szalay,et al.  CANDELS/GOODS-S, CDFS, AND ECDFS: PHOTOMETRIC REDSHIFTS FOR NORMAL AND X-RAY-DETECTED GALAXIES , 2014, 1409.7119.

[21]  F. J. Carrera,et al.  The XMM-Newton serendipitous survey - VII. The third XMM-Newton serendipitous source catalogue , 2015, 1504.07051.

[22]  Robert H. Becker,et al.  THE LAST OF FIRST: THE FINAL CATALOG AND SOURCE IDENTIFICATIONS , 2015, 1501.01555.

[23]  Andrea Merloni,et al.  A spectroscopic survey of X-ray-selected AGNs in the northern XMM-XXL field , 2015, 1511.07870.

[24]  F. J. Carrera,et al.  The MIXR sample: AGN activity versus star formation across the cross-correlation of WISE, 3XMM, and FIRST/NVSS , 2016, 1607.06471.