Random forest algorithm for classification of multiwavelength data

We introduced a decision tree method called Random Forests for multi-wavelength data classification. The data were adopted from different databases, including the Sloan Digital Sky Survey (SDSS) Data Release five, USNO, FIRST and ROSAT. We then studied the discrimination of quasars from stars and the classification of quasars, stars and galaxies with the sample from optical and radio bands and with that from optical and X-ray bands. Moreover, feature selection and feature weighting based on Random Forests were investigated. The performances based on different input patterns were compared. The experimental results show that the random forest method is an effective method for astronomical object classification and can be applied to other classification problems faced in astronomy. In addition, Random Forests will show its superiorities due to its own merits, e.g. classification, feature selection, feature weighting as well as outlier detection.

[1]  O. Lahav,et al.  Morphological Classification of galaxies by Artificial Neural Networks , 1992 .

[2]  Yong-Heng Zhao,et al.  Learning Vector Quantization for Classifying Astronomical Objects , 2003 .

[3]  L. Sodré,et al.  Spectral classification of galaxies , 1994, astro-ph/9411080.

[4]  Yong-Heng Zhao,et al.  A Comparison of BBN, ADTree and MLP in separating Quasars from Large Survey Catalogues , 2007 .

[5]  Y. Zhao,et al.  Comparison of decision tree methods for finding active objects , 2007, 0708.4274.

[6]  A. Chilingarian,et al.  Implementation of the Random Forest method for the Imaging Atmospheric Cherenkov Telescope MAGIC , 2007, 0709.3719.

[7]  A. Adams,et al.  Hubble classification of galaxies using neural networks , 1994 .

[8]  E. al.,et al.  The Sloan Digital Sky Survey: Technical summary , 2000, astro-ph/0006396.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  D. Schlegel,et al.  Maps of Dust Infrared Emission for Use in Estimation of Reddening and Cosmic Microwave Background Radiation Foregrounds , 1998 .

[11]  D. Schlegel,et al.  Maps of Dust IR Emission for Use in Estimation of Reddening and CMBR Foregrounds , 1997, astro-ph/9710327.

[12]  Yanxia Zhang,et al.  Automated clustering algorithms for classification of astronomical objects , 2004, astro-ph/0403431.

[13]  Gutti Jogesh Babu,et al.  Statistical Challenges of Astronomy , 2003 .

[14]  S. Bailey,et al.  How to Find More Supernovae with Less Work: Object Classification Techniques for Difference Imaging , 2006, 0705.0493.

[15]  Yong-Heng Zhao,et al.  Classification in Multidimensional Parameter Space: Methods and Examples , 2003 .

[16]  Richard L. White,et al.  A Catalog of 1.4 GHz Radio Sources from the FIRST Survey , 1997 .