FusioNet: A two-stream convolutional neural network for urban scene classification using PolSAR and hyperspectral data

Urban Scene classification using single source data is massively studied in remote sensing field. However, single source only provides one certain perspective of the complicated urban scene while the fusion of multimodal dataset can provide complementary knowledge. We aim at fusing the spectrum information of the hyperspectral image and the scattering mechanisms of PolSAR data for urban scene classification. For the joint usage of the two data sets, a simple concatenation would lead to extraction of insufficient information and weakens the influence of the lower dimensional data. In this work, the end-to-end convolutional neural network is utilized to automatically learn how to effectively extract features and to fuse the hyperspectral image and the PolSAR data. More specifically, we propose a novel two-stream convolutional network architecture. It creates identical but separated convolutional stream for each data. Subsequently, the two streams are merged with comparable numbers of dimensionality within the fusion layer. This architecture ensures the effectively extraction of informative features from both data for the classification purpose and the fusion of the two data in a balanced manner. Experimental results suggest significantly superior performance of the proposed framework, while comparing to other existing fusion methods. To our knowledge, it is the first time that deep convolutional neural network accomplishes the fusion of hyperspectral image and SAR data.

[1]  Xiao Xiang Zhu,et al.  Object based fusion of polarimetric SAR and hyperspectral imaging for land use classification , 2016, 2016 8th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS).

[2]  Lichao Mou,et al.  Learning a Transferable Change Rule from a Recurrent Neural Network for Land Cover Change Detection , 2016, Remote. Sens..

[3]  Cuiping Shi,et al.  Classification-oriented hyperspectral and PolSAR images synergic processing , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[4]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[5]  Xiaoqiang Lu,et al.  Scene Recognition by Manifold Regularized Deep Learning Architecture , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Alexandre Jouan,et al.  Land use mapping with evidential fusion of features extracted from polarimetric synthetic aperture radar and hyperspectral imagery , 2004, Inf. Fusion.

[7]  Nicolas H. Younan,et al.  Fusion of synthetic aperture radar and hyperspectral imagery to detect impacts of oil spill in Gulf of Mexico , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[8]  D. Roberts,et al.  Urban tree species mapping using hyperspectral and lidar data fusion , 2014 .

[9]  Gabriele Moser,et al.  Multimodal Classification of Remote Sensing Images: A Review and Future Directions , 2015, Proceedings of the IEEE.