Pretraining for Hyperspectral Convolutional Neural Network Classification

Convolutional neural networks (CNNs) have been shown to be a powerful tool for image classification. Recently, they have been adopted into the remote sensing community with applications in material classification from hyperspectral images. However, CNNs are time-consuming to train and often require large amounts of labeled training data. The widespread use of CNNs in the image processing and computer vision communities has been facilitated by the networks that have already been trained on large amounts of data. These pretrained networks can be used to initialize networks for new tasks. This transfer of knowledge makes it far less time-consuming to train a new classifier and reduces the need for a large labeled data set. This concept of transfer learning has not yet been fully explored by those using CNNs to train material classifiers from hyperspectral data. This paper provides an insight into training hyperspectral CNN classifiers by transferring knowledge from well labeled data sets to data sets that are less well labeled. It is shown that these CNNs can transfer between completely different domains and sensing platforms, and still improve classification performance. The application of this work is in the training of material classifiers of data acquired from field-based platforms, by transferring knowledge from publicly accessible airborne data sets. Factors, such as training set size, CNN architectures, and the impact of filter width and wavelength interval, are studied.

[1]  D. Paulus,et al.  Hyperspectral Imaging or Victim Detection with Rescue Robots , 2008, 2008 IEEE International Workshop on Safety, Security and Rescue Robotics.

[2]  Ronald Kemker,et al.  Self-Taught Feature Learning for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Xuelong Li,et al.  Locality Adaptive Discriminant Analysis for Spectral–Spatial Classification of Hyperspectral Images , 2017, IEEE Geoscience and Remote Sensing Letters.

[4]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[6]  Freek D. van der Meer,et al.  Mapping of heavy metal pollution in stream sediments using combined geochemistry, field spectroscopy, and hyperspectral remote sensing: A case study of the Rodalquilar mining area, SE Spain , 2008 .

[7]  Richard J. Murphy,et al.  Automated Multi-class Classification of Remotely Sensed Hyperspectral Imagery Via Gaussian Processes with a Non-stationary Covariance Function , 2016, Mathematical Geosciences.

[8]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Richard J. Murphy,et al.  A geological perception system for autonomous mining , 2012, 2012 IEEE International Conference on Robotics and Automation.

[10]  Sildomar T. Monteiro,et al.  Mapping Layers of Clay in a Vertical Geological Surface Using Hyperspectral Imagery: Variability in Parameters of SWIR Absorption Features under Different Conditions of Illumination , 2014, Remote. Sens..

[11]  Shutao Li,et al.  From Subpixel to Superpixel: A Novel Fusion Framework for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Dong Yu,et al.  Exploring convolutional neural network structures and optimization techniques for speech recognition , 2013, INTERSPEECH.

[13]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[14]  Sen Jia,et al.  Convolutional neural networks for hyperspectral image classification , 2017, Neurocomputing.

[15]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[16]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Jonathan Cheung-Wai Chan,et al.  Hyperspectral image classification using two-channel deep convolutional neural network , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[18]  Uwe Stilla,et al.  Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[19]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[20]  Bin Wang,et al.  Deep Convolutional networks with superpixel segmentation for hyperspectral image classification , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[21]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[22]  Sildomar T. Monteiro,et al.  Evaluating Classification Techniques for Mapping Vertical Geology Using Field-Based Hyperspectral Sensors , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Jonathan Cheung-Wai Chan,et al.  Learning and Transferring Deep Joint Spectral–Spatial Features for Hyperspectral Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Richard J. Murphy,et al.  Unsupervised feature learning for illumination robustness , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[25]  Shihong Du,et al.  Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[27]  Yoshua Bengio,et al.  Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[28]  Xiao Xiang Zhu,et al.  A Self-Improving Convolution Neural Network for the Classification of Hyperspectral Data , 2016, IEEE Geoscience and Remote Sensing Letters.

[29]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Qi Wang,et al.  Salient Band Selection for Hyperspectral Image Classification via Manifold Ranking , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[31]  Pao-Ta Yu,et al.  A Nonparametric Feature Extraction and Its Application to Nearest Neighbor Classification for Hyperspectral Image Data , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Richard J. Murphy,et al.  Hyperspectral CNN Classification with Limited Training Samples , 2016, BMVC.

[33]  Kuntal Kumar Pal,et al.  Preprocessing for image classification by convolutional neural networks , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[34]  Jun Zhou,et al.  CRF learning with CNN features for hyperspectral image segmentation , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[35]  Yi Shen,et al.  Convolutional neural network based classification for hyperspectral data , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[36]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[38]  Qian Du,et al.  Hyperspectral Image Classification Using Deep Pixel-Pair Features , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[39]  Rishi Ramakrishnan,et al.  Shadow compensation for outdoor perception , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[40]  S. Ustin,et al.  Building spectral libraries for wetlands land cover classification and hyperspectral remote sensing. , 2009, Journal of Environmental Management.

[41]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Carlo Gatta,et al.  Unsupervised Deep Feature Extraction for Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[44]  Fred A. Kruse,et al.  Comparison of airborne hyperspectral data and EO-1 Hyperion for mineral mapping , 2003, IEEE Trans. Geosci. Remote. Sens..

[45]  Houbing Song,et al.  Feature selection and multiple kernel boosting framework based on PSO with mutation mechanism for hyperspectral classification , 2017, Neurocomputing.

[46]  David Bergström,et al.  Hyperspectral image analysis using deep learning — A review , 2016, 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA).

[47]  Qi Wang,et al.  Hyperspectral Image Classification via Multitask Joint Sparse Representation and Stepwise MRF Optimization , 2016, IEEE Transactions on Cybernetics.

[48]  Alexander Wendel,et al.  Self-supervised weed detection in vegetable crops using ground based hyperspectral imaging , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[49]  Lianlei Lin,et al.  Using CNN to Classify Hyperspectral Data Based on Spatial-spectral Information , 2017 .

[50]  Stefan B. Williams,et al.  Convolutional neural networks for passive monitoring of a shallow water environment using a single sensor , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[51]  Wei Li,et al.  Transferred Deep Learning for Anomaly Detection in Hyperspectral Imagery , 2017, IEEE Geoscience and Remote Sensing Letters.

[52]  Richard J. Murphy,et al.  A Physics-Based Deep Learning Approach to Shadow Invariant Representations of Hyperspectral Images , 2018, IEEE Transactions on Image Processing.

[53]  Sildomar T. Monteiro,et al.  Mapping the distribution of ferric iron minerals on a vertical mine face using derivative analysis of hyperspectral imagery (430-970 nm) , 2013 .

[54]  Yoshua Bengio,et al.  Exploring Strategies for Training Deep Neural Networks , 2009, J. Mach. Learn. Res..

[55]  Qian Du,et al.  Integrating spectral and spatial information into deep convolutional Neural Networks for hyperspectral classification , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[56]  Alexei A. Efros,et al.  What makes ImageNet good for transfer learning? , 2016, ArXiv.

[57]  Bo Du,et al.  A Novel Semisupervised Active-Learning Algorithm for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[58]  A F Goetz,et al.  Imaging Spectrometry for Earth Remote Sensing , 1985, Science.

[59]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[60]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[61]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.