Extracting features by interpolating and down-sampling for Galaxy and QSO spectrum classification

For celestial spectra are vectors in a several-thousand-dimensional space with a mass of redundancy and usually contaminated with various noises, feature extraction is an essential procedure in automatic spectra processing. We investigated the feature extraction problem for Quasar and Galaxy spectra classification. The available methods in literature can be loosely classified into the following types: principal component analysis (PCA), wavelet transform, artificial neural networks (ANN), Rough Set and Bayesian mixed model. In this work, by analyzing the traditional feature extraction methods, we introduced a novel feature analysis framework STP (Space Transformation and Partition) and proposed a novel feature extraction method EFCD (Extracting features by curve-fitting and down-sampling). Researches also show that it is sufficient to extract features in some cases, not necessary to use the sophisticated methods, which is usually more complex in computation. The proposed EFCD method is evaluated by recognizing Galaxy and QSO spectra, which is disturbed by red shift and representative in automatic spectra classification research. The results of this work are helpful to gain novel insight into the traditional feature extraction methods and design more efficient spectrum classifying schemes.

[1]  Stephen Kent Sloan Digital Sky Survey , 1994 .

[2]  K. Abazajian et al. The Third Data Release of the Sloan Digital Sky Survey , 2004 .

[3]  Zhanyi Hu,et al.  A New Automated Classification Technique of Galaxy Spectra with Z<1.2 Based on PCA-ODP , 2002, SPIE Astronomical Telescopes + Instrumentation.

[4]  Eli Shechtman,et al.  In defense of Nearest-Neighbor based image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Xin Xu,et al.  [Studies of spectra classification based on kernel covering algorithm]. , 2007, Guang pu xue yu guang pu fen xi = Guang pu.

[6]  Chu Yao-quan Scientific projects of LAMOST , 2007 .

[7]  K. Abazajian et al. The Second Data Release of the Sloan Digital Sky Survey , 2004 .

[8]  Mao-Yong Cao,et al.  Rejecting Outliers Based on Correspondence Manifold: Rejecting Outliers Based on Correspondence Manifold , 2009 .

[9]  Xiao-Ming Li,et al.  Rejecting Outliers Based on Correspondence Manifold , 2009 .

[10]  Wei Ping Radar Transmitter Identification Based on the Nearest Distance , 2005 .

[11]  Matthew Colless,et al.  First results from the 2dF Galaxy Redshift Survey. , 1998, astro-ph/9804079.

[12]  Yong-Heng Zhao,et al.  Automated Spectral Classification of Broad-line and Narrow-line Active Galactic Nuclei Based on the K-nearest Neighbor Method , 2007 .

[13]  Ann I. Zabludoff,et al.  Spectral classification of galaxies along the hubble sequence , 1995 .

[14]  Thomas A. Sebring,et al.  Spectroscopic survey telescope project , 1994, Astronomical Telescopes and Instrumentation.

[15]  Yong-Heng Zhao,et al.  [Celestial spectrum flux standardization for classification]. , 2007, Guang pu xue yu guang pu fen xi = Guang pu.

[16]  David G. Lowe,et al.  Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Zhao Yong-heng,et al.  Supervised feature extraction based on FDA and galaxy spectra classification , 2007 .

[18]  Yong-Heng Zhao,et al.  [Supervised feature extraction based on FDA and galaxy spectra classification]. , 2007, Guang pu xue yu guang pu fen xi = Guang pu.

[19]  Zhanyi Hu,et al.  Rejecting Mismatches by Correspondence Function , 2010, International Journal of Computer Vision.

[20]  Zhao Yong-heng,et al.  Spectra classification based on generalized discriminant analysis , 2006 .

[21]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[22]  A. Szalay,et al.  Spectral classification of galaxies: An Orthogonal approach , 1994, astro-ph/9411044.

[23]  David Chiu,et al.  BOOK REVIEW: "PATTERN CLASSIFICATION", R. O. DUDA, P. E. HART and D. G. STORK, Second Edition , 2001 .

[24]  A-Li Luo,et al.  [Automated recognition of quasars based on adaptive radial basis function neural networks]. , 2006, Guang pu xue yu guang pu fen xi = Guang pu.

[25]  Gaspar Galaz,et al.  Spectral Classification of Galaxies with z ≲ 0.5 , 1997 .

[26]  L. Miller,et al.  The 2dF QSO Redshift Survey - 10K@2K! , 2001 .

[27]  Yongheng Zhao Observatory Control System of the LAMOST , 2000, Astronomical Telescopes and Instrumentation.

[28]  Xinpan Li,et al.  Spectral Preprocess and Its Influence on Galaxy and Quasar Classification , 2007 .

[29]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[30]  Hao Wang-shen Research on Mechanical Fault Diagnosis Technology Based on Nearest Neighbor Data Description Method , 2009 .

[31]  Zhao Yong-heng,et al.  RVM supervised feature extraction and Seyfert spectra classification , 2009 .

[32]  Ping Guo,et al.  Automated Separation of Stars and Normal Galaxies Based on Statistical Mixture Modeling with RBF Neural Networks , 2003 .