GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification

Hyperspectral image classification has been proved significant in remote sensing field. Traditional classification methods have meet bottlenecks due to the lack of remote sensing background knowledge or high dimensionality. Deep learning based methods, such as deep convolutional neural network (CNN), can effectively extract high level features from raw data. But the training of deep CNN is rather time-consuming. The general purpose graphic processing units (GPUs) have been considered as one of the most common co-processors that can help accelerate deep learning applications. In this paper we propose a GPU-based Cube CNN (GCN) framework for hyperspectral image classification. First, a Parallel Neighbor Pixels Extraction (PNPE) algorithm is designed to enable the framework directly loading raw hyperspectral image into GPU's global memory, and extracting samples into data cube. Then, based on the peculiarity of hyperspectral image and cube convolution, we propose a novel Cube CNN-to-GPU mapping mechanism that transfers the training of Cube CNN to GPU effectively. Finally, the mini-batch gradient descent(MBGD) algorithm is improved with Computing United Device Architecture(CUDA) multi-streaming technique, which further speeds up network training in GCN framework. Experiments on KSC dataset, PU dataset and SA dataset show that, compared with state-of-art framework Caffe, we achieve up to 83% and 67% reduction in network training time without losing accuracy, when using SGD (Stochastic Gradient Descent) and MBGD algorithm respectively. Experiments across different GPUs show the same performance trend, which demonstrates the good extendibility of GCN framework.

[1]  Tianshi Chen,et al.  ShiDianNao: Shifting vision processing closer to the sensor , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[2]  Haokui Zhang,et al.  Spectral-spatial classification of hyperspectral imagery using a dual-channel convolutional neural network , 2017 .

[3]  Jon Atli Benediktsson,et al.  Multiple Feature Learning for Hyperspectral Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Ute Beyer,et al.  Remote Sensing And Image Interpretation , 2016 .

[5]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[6]  Jon Atli Benediktsson,et al.  Advances in Hyperspectral Image Classification: Earth Monitoring with Statistical Learning Methods , 2013, IEEE Signal Processing Magazine.

[7]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[8]  Qi Li,et al.  Hyperspectral Imagery Classification Using Sparse Representations of Convolutional Neural Network Features , 2016, Remote. Sens..

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Xuelong Li,et al.  Spectral-Spatial Constraint Hyperspectral Image Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Chalavadi Krishna Mohan,et al.  Human action recognition using genetic algorithms and convolutional neural networks , 2016, Pattern Recognit..

[12]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[13]  Erich Elsen,et al.  Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.

[14]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[15]  Jun Li,et al.  Improved hyperspectral image classification by active learning using pre-designed mixed pixels , 2016, Pattern Recognit..

[16]  Razvan Pascanu,et al.  Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.

[17]  Shanjun Mao,et al.  Spectral–spatial classification of hyperspectral images using deep convolutional neural networks , 2015 .

[18]  John Tran,et al.  cuDNN: Efficient Primitives for Deep Learning , 2014, ArXiv.

[19]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[20]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[21]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[22]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Tao Wang,et al.  Deep learning with COTS HPC systems , 2013, ICML.

[24]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[25]  Tara N. Sainath,et al.  Deep Convolutional Neural Networks for Large-scale Speech Tasks , 2015, Neural Networks.

[26]  Huchuan Lu,et al.  Hyperspectral Image Classification via JCR and SVM Models With Decision Fusion , 2016, IEEE Geoscience and Remote Sensing Letters.

[27]  Maoguo Gong,et al.  Unsupervised Band Selection Based on Evolutionary Multiobjective Optimization for Hyperspectral Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Tao Li,et al.  PE-TLD: Parallel Extended Tracking-Learning-Detection for Multi-target Tracking , 2015, ICA3PP.

[29]  Jia Wang,et al.  DaDianNao: A Machine-Learning Supercomputer , 2014, 2014 47th Annual IEEE/ACM International Symposium on Microarchitecture.

[30]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[31]  Tao Li,et al.  Cube-CNN-SVM: A Novel Hyperspectral Image Classification Method , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[32]  Sandra Lowe,et al.  Classification Methods For Remotely Sensed Data , 2016 .

[33]  Yi Shen,et al.  Low-rank group inspired dictionary learning for hyperspectral image classification , 2016, Signal Processing.