Locally linear embedding: dimension reduction of massive protostellar spectra

We present the results of the application of locally linear embedding (LLE) to reduce the dimensionality of dereddened and continuum subtracted near-infrared spectra using a combination of models and real spectra of massive protostars selected from the Red MSX Source survey database. A brief comparison is also made with two other dimension reduction techniques; Principal Component Analysis (PCA) and Isomap using the same set of spectra as well as a more advanced form of LLE, Hessian locally linear embedding. We find that whilst LLE certainly has its limitations, it significantly outperforms both PCA and Isomap in classification of spectra based on the presence/absence of emission lines and provides a valuable tool for classification and analysis of large spectral data sets.

[1]  Pan Jingchang,et al.  Stellar Spectral Subclass Classification Based on Locally Linear Embedding , 2013 .

[2]  T. Deeming,et al.  Stellar Spectral Classification: I. Application of Component Analysis , 1964 .

[3]  Bonn,et al.  THE RED MSX SOURCE SURVEY: THE MASSIVE YOUNG STELLAR POPULATION OF OUR GALAXY , 2013, 1308.0134.

[4]  I. Jolliffe Principal Component Analysis , 2002 .

[5]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[6]  Liverpool John Moores University,et al.  The RMS survey: near-IR spectroscopy of massive young stellar objects , 2013, 1301.4109.

[7]  A. J. Connolly,et al.  REDUCING THE DIMENSIONALITY OF DATA: LOCALLY LINEAR EMBEDDING OF SLOAN GALAXY SPECTRA , 2009, 0907.2238.

[8]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Niko Wilbert,et al.  Modular Toolkit for Data Processing (MDP): A Python Data Processing Framework , 2008, Frontiers Neuroinformatics.

[10]  E. al.,et al.  The Sloan Digital Sky Survey: Technical summary , 2000, astro-ph/0006396.

[11]  Andrew J. Connolly,et al.  CLASSIFICATION OF STELLAR SPECTRA WITH LOCAL LINEAR EMBEDDING , 2011 .

[12]  R. Nichol,et al.  Distributions of Galaxy Spectral Types in the Sloan Digital Sky Survey , 2004, astro-ph/0407061.

[13]  Andrew J. Connolly,et al.  Classification of Stellar Spectra with LLE , 2011, 1110.4646.

[14]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.