EndNet: Sparse AutoEncoder Network for Endmember Extraction and Hyperspectral Unmixing

Data acquired from multichannel sensors are a highly valuable asset to interpret the environment for a variety of remote sensing applications. However, low spatial resolution is a critical limitation for previous sensors, and the constituent materials of a scene can be mixed in different fractions due to their spatial interactions. Spectral unmixing is a technique that allows us to obtain the material spectral signatures and their fractions from hyperspectral data. In this paper, we propose a novel endmember extraction and hyperspectral unmixing scheme, so-called EndNet, that is based on a two-staged autoencoder network. This well-known structure is completely enhanced and restructured by introducing additional layers and a projection metric [i.e., spectral angle distance (SAD) instead of inner product] to achieve an optimum solution. Moreover, we present a novel loss function that is composed of a Kullback–Leibler divergence term with SAD similarity and additional penalty terms to improve the sparsity of the estimates. These modifications enable us to set the common properties of endmembers, such as nonlinearity and sparsity for autoencoder networks. Finally, due to the stochastic-gradient-based approach, the method is scalable for large-scale data and it can be accelerated on graphical processing units. To demonstrate the superiority of our proposed method, we conduct extensive experiments on several well-known data sets. The results confirm that the proposed method considerably improves the performance compared to the state-of-the-art techniques in the literature.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Fuan Tsai,et al.  Derivative analysis of hyperspectral data , 1996, Remote Sensing.

[3]  Antonio J. Plaza,et al.  On the use of small training sets for neural network-based characterization of mixed pixels in remotely sensed hyperspectral images , 2009, Pattern Recognit..

[4]  Alfred O. Hero,et al.  Joint Bayesian Endmember Extraction and Linear Unmixing for Hyperspectral Imagery , 2009, IEEE Transactions on Signal Processing.

[5]  Sida I. Wang,et al.  Dropout Training as Adaptive Regularization , 2013, NIPS.

[6]  J. Boardman,et al.  Geometric mixture analysis of imaging spectrometry data , 1994, Proceedings of IGARSS '94 - 1994 IEEE International Geoscience and Remote Sensing Symposium.

[7]  Nicolas Dobigeon,et al.  Nonlinear Hyperspectral Unmixing With Robust Nonnegative Matrix Factorization , 2014, IEEE Transactions on Image Processing.

[8]  D. Roberts,et al.  A comparison of error metrics and constraints for multiple endmember spectral mixture analysis and spectral angle mapper , 2004 .

[9]  Chong-Yung Chi,et al.  Nonnegative Least-Correlated Component Analysis for Separation of Dependent Sources by Volume Maximization , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Simon J. Hook,et al.  HYDROTHERMAL FORMATION OF CLAY-CARBONATE ALTERATION ASSEMBLAGES IN THE , 2010, 1402.1150.

[11]  José M. Bioucas-Dias,et al.  Vertex component analysis: a fast algorithm to unmix hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Jie Chen,et al.  Nonlinear Unmixing of Hyperspectral Data Based on a Linear-Mixture/Nonlinear-Fluctuation Model , 2013, IEEE Transactions on Signal Processing.

[13]  José M. Bioucas-Dias,et al.  Minimum Volume Simplex Analysis: A Fast Algorithm to Unmix Hyperspectral Data , 2008, IGARSS 2008 - 2008 IEEE International Geoscience and Remote Sensing Symposium.

[14]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[15]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[18]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[19]  Jon Atli Benediktsson,et al.  Evaluation of Kernels for Multiclass Classification of Hyperspectral Remote Sensing Data , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[20]  Antonio J. Plaza,et al.  Joint linear/nonlinear spectral unmixing of hyperspectral image data , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[21]  Jean-Yves Tourneret,et al.  Bilinear models for nonlinear unmixing of hyperspectral images , 2011, 2011 3rd Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS).

[22]  Chein-I Chang,et al.  Hyperspectral image classification and dimensionality reduction: an orthogonal subspace projection approach , 1994, IEEE Trans. Geosci. Remote. Sens..

[23]  B. Hapke Bidirectional reflectance spectroscopy: 1. Theory , 1981 .

[24]  Chein-I Chang,et al.  Fully constrained least squares linear spectral mixture analysis method for material quantification in hyperspectral imagery , 2001, IEEE Trans. Geosci. Remote. Sens..

[25]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[26]  Victor Haertel,et al.  Spectral linear mixing model in low spatial resolution image data , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Pascal Vincent,et al.  Contractive Auto-Encoders: Explicit Invariance During Feature Extraction , 2011, ICML.

[28]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[29]  Rob Heylen,et al.  Non-Linear Spectral Unmixing by Geodesic Simplex Volume Maximization , 2011, IEEE Journal of Selected Topics in Signal Processing.

[30]  Rob Heylen,et al.  A Multilinear Mixing Model for Nonlinear Spectral Unmixing , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Paul D. Gader,et al.  Nonlinear Unmixing by Using Different Metrics in a Linear Unmixing Chain , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[32]  Antonio J. Plaza,et al.  Collaborative Sparse Regression for Hyperspectral Unmixing , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Gustavo Camps-Valls,et al.  Composite kernels for hyperspectral image classification , 2006, IEEE Geoscience and Remote Sensing Letters.

[34]  Jean-Yves Tourneret,et al.  Nonlinear unmixing of hyperspectral images using a generalized bilinear model , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[35]  Nathan S. Netanyahu,et al.  A Stepwise Analytical Projected Gradient Descent Search for Hyperspectral Unmixing and Its Code Vectorization , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Chein-I Chang,et al.  A quantitative and comparative analysis of linear and nonlinear spectral mixture models using radial basis function neural networks , 2001, IEEE Trans. Geosci. Remote. Sens..

[37]  W. Verstraeten,et al.  Nonlinear Hyperspectral Mixture Analysis for tree cover estimates in orchards , 2009 .

[38]  Thomas L. Ainsworth,et al.  Improved Manifold Coordinate Representations of Large-Scale Hyperspectral Scenes , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[39]  Andrea Vedaldi,et al.  Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[40]  Jianfeng Zhan,et al.  Cosine Normalization: Using Cosine Similarity Instead of Dot Product in Neural Networks , 2017, ICANN.

[41]  David A. Clausi,et al.  Hyperspectral Image Classification With Limited Labeled Training Samples Using Enhanced Ensemble Learning and Conditional Random Fields , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[42]  John R. Hershey,et al.  Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[43]  Chunhui Zhao,et al.  Multilayer Unmixing for Hyperspectral Imagery With Fast Kernel Archetypal Analysis , 2016, IEEE Geoscience and Remote Sensing Letters.

[44]  Manuel Graña,et al.  Hyperspectral image nonlinear unmixing and reconstruction by ELM regression ensemble , 2016, Neurocomputing.

[45]  Johannes R. Sveinsson,et al.  Neural network hyperspectral unmixing with spectral information divergence objective , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[46]  Hairong Qi,et al.  Endmember Extraction From Highly Mixed Data Using Minimum Volume Constrained Nonnegative Matrix Factorization , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[47]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[48]  Amit Banerjee,et al.  A comparison of kernel functions for intimate mixture models , 2009, 2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing.

[49]  Chun Yang,et al.  A Kernel Spectral Angle Mapper algorithm for remote sensing image classification , 2013, 2013 6th International Congress on Image and Signal Processing (CISP).

[50]  Rob Heylen,et al.  Fully Constrained Least Squares Spectral Unmixing by Simplex Projection , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[51]  Imed Riadh Farah,et al.  A New Method to Change Illumination Effect Reduction Based on Spectral Angle Constraint for Hyperspectral Image Unmixing , 2011, IEEE Geoscience and Remote Sensing Letters.

[52]  Xia Xu,et al.  R-VCANet: A New Deep-Learning-Based Hyperspectral Image Classification Method , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[53]  Fred A. Kruse,et al.  Comparison of AVIRIS and Hyperion for Hyperspectral Mineral Mapping , 2002 .

[54]  Chein-I Chang,et al.  Multispectral and hyperspectral image analysis with convex cones , 1999, IEEE Trans. Geosci. Remote. Sens..

[55]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Alfonso Fernández-Manso,et al.  Spectral unmixing , 2012 .

[57]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[58]  Gaofeng Meng,et al.  Spectral Unmixing via Data-Guided Sparsity , 2014, IEEE Transactions on Image Processing.

[59]  Alfred O. Hero,et al.  Nonlinear Unmixing of Hyperspectral Images: Models and Algorithms , 2013, IEEE Signal Processing Magazine.

[60]  Joseph N. Wilson,et al.  Using physics-based macroscopic and microscopic mixture models for hyperspectral pixel unmixing , 2012, Defense + Commercial Sensing.

[61]  Adrian J. Brown Spectral curve fitting for automatic hyperspectral data analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[62]  Liangpei Zhang,et al.  A Hybrid Automatic Endmember Extraction Algorithm Based on a Local Window , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[63]  Paul D. Gader,et al.  Sparsity Promoting Iterated Constrained Endmember Detection in Hyperspectral Imagery , 2007, IEEE Geoscience and Remote Sensing Letters.

[64]  Tapani Raiko,et al.  Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[65]  Gustavo Camps-Valls,et al.  Kernel spectral angle mapper , 2016 .

[66]  Jun Zhou,et al.  Hyperspectral Unmixing via $L_{1/2}$ Sparsity-Constrained Nonnegative Matrix Factorization , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[67]  Rui Guo,et al.  Hyperspectral image unmixing using autoencoder cascade , 2015, 2015 7th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS).

[68]  Antonio J. Plaza,et al.  Hyperspectral Unmixing Overview: Geometrical, Statistical, and Sparse Regression-Based Approaches , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[69]  Ying Wang,et al.  Structured Sparse Method for Hyperspectral Unmixing , 2014, ArXiv.

[70]  Jean-Yves Tourneret,et al.  Supervised Nonlinear Spectral Unmixing Using a Postnonlinear Mixing Model for Hyperspectral Imagery , 2012, IEEE Transactions on Image Processing.

[71]  R. Singer,et al.  Mars - Large scale mixing of bright and dark surface materials and implications for analysis of spectral reflectance , 1979 .

[72]  Paul D. Gader,et al.  A Spatial Compositional Model for Linear Unmixing and Endmember Uncertainty Estimation , 2015, IEEE Transactions on Image Processing.

[73]  Marc'Aurelio Ranzato,et al.  Fast Inference in Sparse Coding Algorithms with Applications to Object Recognition , 2010, ArXiv.

[74]  Roland Memisevic,et al.  Zero-bias autoencoders and the benefits of co-adapting features , 2014, ICLR.

[75]  Peter Strobl,et al.  HySens-DAIS/ROSIS Imaging Spectrometers at DLR , 2002, Remote Sensing.

[76]  木股 雅章,et al.  SPIE's International Symposium on Optical Science, Engineering, and Instrumentation報告 , 1998 .

[77]  Paul D. Gader,et al.  Piecewise Convex Multiple-Model Endmember Detection and Spectral Unmixing , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[78]  Xuelong Li,et al.  Manifold Regularized Sparse NMF for Hyperspectral Unmixing , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[79]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[80]  Mario Winter,et al.  N-FINDR: an algorithm for fast autonomous spectral end-member determination in hyperspectral data , 1999, Optics & Photonics.

[81]  Emrecan Bati,et al.  Hyperspectral anomaly detection method based on auto-encoder , 2015, SPIE Remote Sensing.

[82]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[83]  Brendan J. Frey,et al.  k-Sparse Autoencoders , 2013, ICLR.

[84]  Tianqi Chen,et al.  Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[85]  Antonio J. Plaza,et al.  Sparse Unmixing of Hyperspectral Data , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[86]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.