Principal Component Analysis Neural Network Hybrid Classification Approach for Galaxies Images

This article presents an automatic hybrid approach for galaxies images classification based on principal component analysis (PCA) neural network and moment-based features extraction algorithms. The proposed approach is consisted of four phases; namely image denoising, feature extraction, reduct generation, and classification phases. For the denoising phase, noise pixels are removed from input images, then input galaxy image is normalized to a uniform scale and Hu seven invariant moment algorithm is applied to reduce the dimensionality of the feature space during the feature extraction phase. Subsequently, for reduct generation phase, attributes in the information system table that is more important to the knowledge is generated as a subset of attributes. Rough set is used as feature reduction approach. The subset of attributed, which is called a reduct, is fully characterizing the knowledge in the database. Finally, during the classification phase, principal component analysis neural network algorithm is utilized for classifying the input galaxies images into one of four obtained source catalogue types. Experimental results showed that combining PCA and rough set as feature reduction techniques along with invariant moments for feature extraction provided better classification results than having no rough set feature reduction technique applied. It is also concluded that a small set of features is sufficient to classify galaxy images and provide a fast classification.

[1]  O. Fuentes,et al.  Machine learning and image analysis for morphological galaxy classification , 2004 .

[2]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[3]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[4]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[5]  L. Polkowski Rough Sets: Mathematical Foundations , 2013 .

[6]  C. Lintott,et al.  Galaxy Zoo: reproducing galaxy morphologies via machine learning★ , 2009, 0908.2033.

[7]  William W. Hsieh,et al.  Nonlinear principal component analysis by neural networks , 2001 .

[8]  Ofer Lahav Artificial Neural Networks as a Tool for Galaxy Classification , 1996 .

[9]  European Southern Observatory,et al.  CASSOWARY 20: a wide separation Einstein Cross identified with the X-shooter spectrograph , 2009, Monthly Notices of the Royal Astronomical Society.

[10]  Olac Fuentes,et al.  Automated Classification of Galaxy Images , 2004, KES.

[11]  C Donlan,et al.  Nonword repetition as a behavioural marker for inherited language impairment: evidence from a twin study. , 1996, Journal of child psychology and psychiatry, and allied disciplines.

[12]  Mohamed A. Mohamed,et al.  Automated Classification of Galaxies Using Transformed Domain Features , 2010 .

[13]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[14]  Hossein Karimi,et al.  A new Approach for Detecting Intrusions Based on the PCA Neural Networks , 2012 .

[15]  David G. Stork,et al.  Pattern Classification , 1973 .