A comparison between subset selection and L1 regularisation with an application in spectroscopy

article i nfo A reflectance spectrum measures the reflectance of a material at hundreds or thousands of wavelengths. It provides chemical information about the material. Many rock samples actually contain a mixture of minerals. Because of the differing chemical compositions of the component minerals, the spectra of the rock sample often enable us to "unmix" the spectra to identify their mineral components. This is done with the aid of a fairly large library of pure spectra and a relatively simple linear mixture model, but with non-negativity con- straints on some of the coefficients in the model. For many years, we have used full subset selection methods to identify the composition of millions of samples. There are several difficulties with this approach, in partic- ular: (i) identifying the composition of large numbers of samples can be relatively slow, and (ii) estimating the number of components in a mixture is not as reliable as we would like, because both the deterministic and stochastic parts of our model are only approximations to reality, and hence classical statistical methods for deciding on the order of a linear model (e.g. F tests, AIC) do not work very well. Hence, ad hoc methods have had to be developed. In the hope of overcoming these difficulties, we have investigated the use of L1 regularisation as an alternative, because it is a convex optimisation problem and therefore there are efficient methods for finding the unique optimum. Moreover, it is straightforward to carry out L1 regularisation incor- porating non-negativity constraints on some of the coefficients. Unfortunately, L1 regularisation does not work as well as full subset selection does. We briefly discuss a possible hybrid approach.

[1]  W Y Zhang,et al.  Discussion on `Sure independence screening for ultra-high dimensional feature space' by Fan, J and Lv, J. , 2008 .

[2]  Seung-Jean Kim,et al.  Hyperspectral Image Unmixing via Alternating Projected Subgradients , 2007, 2007 Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers.

[3]  A. Atkinson Subset Selection in Regression , 1992 .

[4]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[5]  R. Barnes,et al.  Standard Normal Variate Transformation and De-Trending of Near-Infrared Diffuse Reflectance Spectra , 1989 .

[6]  R. Tibshirani,et al.  Penalized Discriminant Analysis , 1995 .

[7]  Stanley Osher,et al.  L1 unmixing and its application to hyperspectral image enhancement , 2009, Defense + Commercial Sensing.

[8]  G. Wahba Spline models for observational data , 1990 .

[9]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[10]  Antonio J. Plaza,et al.  Sparse Unmixing of Hyperspectral Data , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[12]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[13]  Mark Berman,et al.  An Unmixing Algorithm Based on a Large Library of Shortwave Infrared Spectra , 2011 .

[14]  D. Roberts,et al.  Hierarchical Multiple Endmember Spectral Mixture Analysis (MESMA) of hyperspectral imagery for urban environments , 2009 .

[15]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[16]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[17]  Jianqing Fan,et al.  Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[18]  Qian Du,et al.  Foreword to the Special Issue on Spectral Unmixing of Remotely Sensed Data , 2011 .

[19]  T. Kemper,et al.  A new tool for variable multiple endmember spectral mixture analysis (VMESMA) , 2005 .

[20]  D. Roberts,et al.  Endmember selection for multiple endmember spectral mixture analysis using endmember average RMSE , 2003 .

[21]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[22]  Margaret E. Gardner,et al.  Mapping Chaparral in the Santa Monica Mountains Using Multiple Endmember Spectral Mixture Models , 1998 .

[23]  R. J. Barnes,et al.  Correction to the Description of Standard Normal Variate (SNV) and De-Trend (DT) Transformations in Practical Spectroscopy with Applications in Food and Beverage Analysis—2nd Edition , 1993 .

[24]  Yichao Wu,et al.  Ultrahigh Dimensional Feature Selection: Beyond The Linear Model , 2009, J. Mach. Learn. Res..

[25]  Nirmal Keshava,et al.  A Survey of Spectral Unmixing Algorithms , 2003 .

[26]  David R. Thompson,et al.  Sparse superpixel unmixing for exploratory analysis of CRISM hyperspectral images , 2009, 2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing.

[27]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.