A new method for unveiling open clusters in Gaia

Context. The publication of the Gaia Data Release 2 (Gaia DR2) opens a new era in astronomy. It includes precise astrometric data (positions, proper motions, and parallaxes) for more than 1.3 billion sources, mostly stars. To analyse such a vast amount of new data, the use of data-mining techniques and machine-learning algorithms is mandatory. Aims. A great example of the application of such techniques and algorithms is the search for open clusters (OCs), groups of stars that were born and move together, located in the disc. Our aim is to develop a method to automatically explore the data space, requiring minimal manual intervention. Methods. We explore the performance of a density-based clustering algorithm, DBSCAN, to find clusters in the data together with a supervised learning method such as an artificial neural network (ANN) to automatically distinguish between real OCs and statistical clusters. Results. The development and implementation of this method in a five-dimensional space (l, b, ϖ, μα*, μδ) with the Tycho-Gaia Astrometric Solution (TGAS) data, and a posterior validation using Gaia DR2 data, lead to the proposal of a set of new nearby OCs. Conclusions. We have developed a method to find OCs in astrometric data, designed to be applied to the full Gaia DR2 archive.

[1]  M. Brescia,et al.  The detection of globular clusters in galaxies as a data mining problem , 2011, 1110.2144.

[2]  New member candidates of Upper Scorpius from Gaia DR1 , 2018, Astronomy & Astrophysics.

[3]  N. V. Kharchenko,et al.  Global survey of star clusters in the Milky Way - III. 139 new open clusters at high Galactic latitudes , 2014, 1406.6267.

[4]  B. Goldman,et al.  Nine new open clusters within 500 pc from the Sun , 2016, 1608.02704.

[5]  C. Barache,et al.  Gaia Data Release 1: Astrometry - one billion positions, two million proper motions and parallaxes , 2016, 1609.04303.

[6]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[7]  L. Dinis,et al.  A revisit to agglomerates of early-type Hipparcos stars , 2008, 0807.0895.

[8]  J. J. González-Vidal,et al.  Gaia Data Release 2 , 2018, Astronomy & Astrophysics.

[9]  Geoffrey E. Hinton Connectionist Learning Procedures , 1989, Artif. Intell..

[10]  Lennart Lindegren,et al.  The astrometric core solution for the Gaia mission. Overview of models, algorithms, and software implementation , 2011, 1112.4139.

[11]  C. Barache,et al.  Gaia Data Release 1 - Catalogue validation , 2017, 1701.00292.

[12]  S. Roser,et al.  Global survey of star clusters in the Milky Way - IV. 63 new open clusters detected by proper motions , 2015, 1507.02125.

[13]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14]  N. V. Kharchenko,et al.  Global survey of star clusters in the Milky Way II. The catalogue of basic parameters , 2013, 1308.5822.

[15]  T. A. Lister,et al.  Gaia Data Release 2. Summary of the contents and survey properties , 2018, 1804.09365.

[16]  C. Babusiaux,et al.  Overview and stellar statistics of the expectedGaiaCatalogue using theGaiaObject Generator , 2014, Astronomy & Astrophysics.

[17]  A New Method of Open Cluster Membership Determination , 2014 .

[18]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[19]  A. Moitinho,et al.  New catalogue of optically visible open clusters and candidates , 2002, astro-ph/0203351.

[20]  L. Szabados,et al.  Gaia Data Release 1. Open cluster astrometry: performance, limitations, and future prospects , 2017, 1703.01131.

[21]  D. Froebrich New compact star cluster candidates in the Galactic plane , 2017, 1704.04957.

[22]  Observatoire de la Côte d'Azur,et al.  Gaia Data Release 1. Summary of the astrometric, photometric, and survey properties , 2016, 1609.04172.

[23]  Lennart Lindegren,et al.  The Tycho-Gaia astrometric solution. How to get 2.5 million parallaxes with less than one year of Gaia data , 2014, 1412.8770.

[24]  Patrick Petitjean,et al.  Artificial neural networks for quasar selection and photometric redshift determination , 2010 .

[25]  Daniel Egret,et al.  Harmonizing Cosmic Distance Scales in a Post‐Hipparcos Era , 1999 .