Deep Hybrid 2-D–3-D CNN Based on Dual Second-Order Attention With Camera Spectral Sensitivity Prior for Spectral Super-Resolution

A largely ignored fact in spectral super-resolution (SSR) is that the subsistent mapping methods neglect the auxiliary prior of camera spectral sensitivity (CSS) and only pay attention to wider or deeper network framework design while ignoring to excavate the spatial and spectral dependencies among intermediate layers, hence constraining representational capability of convolutional neural networks (CNNs). To conquer these drawbacks, we propose a novel deep hybrid 2-D–3-D CNN based on dual second-order attention with CSS prior (HSACS), which can excavate sufficient spatial–spectral context information. Specifically, dual second-order attention embedded in the residual block for more powerful spatial–spectral feature representation and relation learning is composed of a brand new trainable 2-D second-order channel attention (SCA) or 3-D second-order band attention (SBA) and a structure tensor attention (STA). Concretely, the band and channel attention modules are developed to adaptively recalibrate the band-wise and interchannel features via employing second-order band or channel feature statistics for more discriminative representations. Besides, the STA is promoted to rebuild the significant high-frequency spatial details for enough spatial feature extraction. Moreover, the CSS is first employed as a superior prior to avoid its effect of SSR quality, on the strength of which the resolved RGB can be calculated naturally through the super-reconstructed hyperspectral image (HSI); then, the final loss consists of the discrepancies of RGB and the HSI as a finer constraint. Experimental results demonstrate the superiority and progressiveness of the presented approach in terms of quantitative metrics and visual effect over SOTA SSR methods.

[1]  Renwei Dian,et al.  Regularizing Hyperspectral and Multispectral Image Fusion by CNN Denoiser , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Lei Zhang,et al.  Pixel-aware Deep Function-mixture Network for Spectral Super-Resolution , 2019, AAAI.

[3]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Vassilis Athitsos,et al.  lambda-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[5]  Jungong Han,et al.  Attentive Temporal Pyramid Network for Dynamic Scene Classification , 2019, AAAI.

[6]  Qilong Wang,et al.  Global Second-Order Pooling Convolutional Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Tao Zhang,et al.  Joint Camera Spectral Sensitivity Selection and Hyperspectral Image Recovery , 2018, ECCV.

[8]  Yun Fu,et al.  Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[9]  Jun Li,et al.  Accurate Spectral Super-resolution from Single RGB Image Using Multi-scale CNN , 2018, PRCV.

[10]  Dong Liu,et al.  HSCNN+: Advanced CNN-Based Hyperspectral Recovery from RGB Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[11]  Dorit Merhof,et al.  Reconstructing Spectral Images from RGB-Images Using a Convolutional Neural Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[12]  R. Timofte,et al.  NTIRE 2018 Challenge on Spectral Reconstruction from RGB Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Himanshu Sharma,et al.  2D-3D CNN Based Architectures for Spectral Reconstruction from RGB Images , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[14]  Imari Sato,et al.  Deeply Learned Filter Response Functions for Hyperspectral Reconstruction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Radu Timofte,et al.  An efficient CNN for spectral reconstruction from RGB images , 2018, ArXiv.

[16]  Yunsong Li,et al.  Structure Tensor-Based Algorithm for Hyperspectral and Panchromatic Images Fusion , 2018, Remote. Sens..

[17]  Shutao Li,et al.  Deep Hyperspectral Image Sharpening , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Yoichi Sato,et al.  From RGB to Spectrum for Natural Scenes via Manifold-Based Mapping , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Dong Liu,et al.  HSCNN: CNN-Based Hyperspectral Image Recovery from Spectrally Undersampled Projections , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[20]  Radu Timofte,et al.  In Defense of Shallow Learned Spectral Reconstruction from RGB Images , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[21]  Joost van de Weijer,et al.  Adversarial Networks for Spatial Context-Aware Spectral Image Reconstruction from RGB , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[22]  Konrad Schindler,et al.  Learned Spectral Super-Resolution , 2017, ArXiv.

[23]  Qilong Wang,et al.  Is Second-Order Information Helpful for Large-Scale Visual Recognition? , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[24]  Boaz Arad,et al.  Sparse Recovery of Hyperspectral Signal from Natural RGB Images , 2016, ECCV.

[25]  Théodore Bluche,et al.  Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition , 2016, NIPS.

[26]  Wei Xu,et al.  Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Yu-Wing Tai,et al.  RGB-Guided Hyperspectral Image Upsampling , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28]  Antonio Robles-Kelly,et al.  Single Image Spectral Reconstruction for Multimedia Applications , 2015, ACM Multimedia.

[29]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[31]  Michael S. Brown,et al.  Training-Based Spectral Reconstruction from a Single RGB Image , 2014, ECCV.

[32]  Guolan Lu,et al.  Medical hyperspectral imaging: a review , 2014, Journal of biomedical optics.

[33]  Stephen Lin,et al.  A Prism-Mask System for Multispectral Video Acquisition. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[34]  Ayan Chakrabarti,et al.  Statistics of real-world hyperspectral images , 2011, CVPR 2011.

[35]  Shree K. Nayar,et al.  Generalized Assorted Pixel Camera: Postcapture Control of Resolution, Dynamic Range, and Spectrum , 2010, IEEE Transactions on Image Processing.

[36]  Liang Gao,et al.  Snapshot Image Mapping Spectrometer (IMS) with high sampling density for hyperspectral microscopy , 2010, Optics express.

[37]  Xiaobai Sun,et al.  Video rate spectral imaging using a coded aperture snapshot spectral imager. , 2009, Optics express.

[38]  Ullrich Köthe,et al.  Edge and Junction Detection with an Improved Structure Tensor , 2003, DAGM-Symposium.

[39]  Nahum Gat,et al.  Imaging spectroscopy using tunable filters: a review , 2000, SPIE Defense + Commercial Sensing.

[40]  M. Descour,et al.  Computed-tomography imaging spectrometer: experimental calibration and reconstruction results. , 1995, Applied optics.

[41]  Hsieh Hou,et al.  Cubic splines for image interpolation and digital filtering , 1978 .