Scale fusion light CNN for hyperspectral face recognition with knowledge distillation and attention mechanism

Hyperspectral imaging technology, combining traditional imaging and spectroscopy technologies to simultaneously acquire spatial and spectral information, is deemed to be an intuitive medium for robust face recognition. However, the intrinsic structure of hyperspectral images is more complicated than ordinary gray-scale or RGB images, how to fully explore the discriminant and correlation features with only a limited number of hyperspectral samples for deep learning training has not been well studied. In response to these problems, this paper proposes an end-to-end multiscale fusion lightweight convolution neural network (CNN) framework for hyperspectral face recognition, termed as the features fusion with channel attention network (FFANet). Firstly, to capture richer subtle details, we introduce Second-Order Efficient Channel Attention (SECA) as the variant of Efficient Channel Attention (ECA) into the framework. The difference from ECA is that SECA can extract the second-order information of each channel to improve the network’s feature extraction ability and is more suitable for the complexity of hyperspectral data. Secondly, we further fuse multiscale information to yield a comprehensive and discriminative representation learning. Finally, the joint of Self-Supervision and Knowledge Distillation (SSKD) is exploited to train an efficient deep model, which can learn more dark knowledge from the trained teacher network. The experimental results on three benchmark hyperspectral face databases of PolyU, CMU, and UWA show that the proposed approach has achieved competitive accuracy and efficiency on the basis of significantly reducing the storage space and computation overheads. These characteristics also show its wide applicability on edge/mobile devices.

[1]  Chen Change Loy,et al.  Knowledge Distillation Meets Self-Supervision , 2020, ECCV.

[2]  Jangho Kim,et al.  Paraphrasing Complex Network: Network Compression via Factor Transfer , 2018, NeurIPS.

[3]  Nasser M. Nasrabadi,et al.  Deep Sparse Band Selection for Hyperspectral Face Recognition , 2019, Hyperspectral Image Analysis.

[4]  Jun Zhou,et al.  3D local derivative pattern for hyperspectral face recognition , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  M. W Gardner,et al.  Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences , 1998 .

[7]  Cheng-Zhong Xu,et al.  Pay Attention to Features, Transfer Learn Faster CNNs , 2020, ICLR.

[8]  Yang Liu,et al.  MobileFaceNets: Efficient CNNs for Accurate Real-time Face Verification on Mobile Devices , 2018, CCBR.

[9]  Mohammad H. Mahoor,et al.  Going deeper in facial expression recognition using deep neural networks , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[10]  Junmo Kim,et al.  A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Jean-Fu Kiang,et al.  Comparison of CNN Algorithms on Hyperspectral Image Classification in Agricultural Lands , 2020, Sensors.

[12]  Qilong Wang,et al.  ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Xiao-Yuan Jing,et al.  Intraspectrum Discrimination and Interspectrum Correlation Analysis Deep Network for Multispectral Face Recognition , 2020, IEEE Transactions on Cybernetics.

[14]  Hongtao Liang,et al.  A novel framework based on wavelet transform and principal component for face recognition under varying illumination , 2020 .

[15]  Thirimachos Bourlai,et al.  A spectral independent approach for physiological and geometric based face recognition in the visible, middle-wave and long-wave infrared bands , 2014, Image Vis. Comput..

[16]  Bruce J. Tromberg,et al.  Comparison of Spectral-Only and Spectral/Spatial Face Recognition for Personal Identity Verification , 2009, EURASIP J. Adv. Signal Process..

[17]  Di Huang,et al.  Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Xiangyu Zhang,et al.  ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design , 2018, ECCV.

[19]  Quan Pan,et al.  Studies on Hyperspectral Face Recognition in Visible Spectrum With Feature Band Selection , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[20]  Nassir Navab,et al.  Recalibrating Fully Convolutional Networks With Spatial and Channel “Squeeze and Excitation” Blocks , 2018, IEEE Transactions on Medical Imaging.

[21]  Tieniu Tan,et al.  A Light CNN for Deep Face Representation With Noisy Labels , 2015, IEEE Transactions on Information Forensics and Security.

[22]  Xuelong Li,et al.  Person Reidentification by Minimum Classification Error-Based KISS Metric Learning , 2015, IEEE Transactions on Cybernetics.

[23]  Bing Li,et al.  Knowledge Distillation via Instance Relationship Graph , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Arif Mahmood,et al.  Hyperspectral Face Recognition using 3D-DCT and Partial Least Squares , 2013, BMVC.

[25]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[26]  Hasan Demirel,et al.  3D discrete wavelet transform-based feature extraction for hyperspectral face recognition , 2018, IET Biom..

[27]  Qilong Wang,et al.  Global Second-Order Pooling Convolutional Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Lin Lei,et al.  Multimodal Bilinear Fusion Network With Second-Order Attention-Based Channel Selection for Land Cover Classification , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[29]  Shengcai Liao,et al.  Learning Face Representation from Scratch , 2014, ArXiv.

[30]  Donato Malerba,et al.  Detecting salient regions in a bi-temporal hyperspectral scene by iterating clustering and classification , 2020, Applied Intelligence.

[31]  Jie Wang,et al.  Joint spectral-spatial hyperspectral image classification based on hierarchical subspace switch ensemble learning algorithm , 2018, Applied Intelligence.

[32]  Yong Wu,et al.  An Image-Level Classification Framework for Hyperspectral Image with CNNs , 2018, 2018 14th IEEE International Conference on Signal Processing (ICSP).

[33]  Xiao Chen,et al.  Advanced Variations of Two-Dimensional Principal Component Analysis for Face Recognition , 2019, Neurocomputing.

[34]  Geoffrey E. Hinton,et al.  Learning to combine foveal glimpses with a third-order Boltzmann machine , 2010, NIPS.

[35]  Arif Mahmood,et al.  Hyperspectral Face Recognition With Spatiospectral Information Fusion and PLS Regression , 2015, IEEE Transactions on Image Processing.

[36]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[37]  Stefan A. Robila,et al.  Toward hyperspectral face recognition , 2008, Electronic Imaging.

[38]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[39]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Ling Shi,et al.  Hyperspectral Face Recognition Using Block based Convolution Neural Network and AdaBoost Band Selection , 2019, 2019 6th International Conference on Systems and Informatics (ICSAI).

[41]  Zhihua Xie,et al.  Hyperspectral face recognition based on SLRC for single sample problem , 2020, International Symposium on Multispectral Image Processing and Pattern Recognition.

[42]  Gang Sun,et al.  Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks , 2018, NeurIPS.

[43]  LinLin Shen,et al.  Hyperspectral face recognition using 3D Gabor wavelets , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[44]  Nikos Komodakis,et al.  Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer , 2016, ICLR.

[45]  Qiang Chen,et al.  Network In Network , 2013, ICLR.