论文信息 - Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging

Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging

In coded aperture snapshot spectral compressive imaging (CASSI) systems, hyperspectral image (HSI) reconstruction methods are employed to recover the spatial-spectral signal from a compressed measurement. Among these algorithms, deep unfolding methods demonstrate promising performance but suffer from two issues. Firstly, they do not estimate the degradation patterns and ill-posedness degree from the highly related CASSI to guide the iterative learning. Secondly, they are mainly CNN-based, showing limitations in capturing long-range dependencies. In this paper, we propose a principled Degradation-Aware Unfolding Framework (DAUF) that estimates parameters from the compressed image and physical mask, and then uses these parameters to control each iteration. Moreover, we customize a novel Half-Shuffle Transformer (HST) that simultaneously captures local contents and non-local dependencies. By plugging HST into DAUF, we establish the first Transformer-based deep unfolding method, Degradation-Aware Unfolding Half-Shuffle Transformer (DAUHST), for HSI reconstruction. Experiments show that DAUHST significantly surpasses state-of-the-art methods while requiring cheaper computational and memory costs. Code and models will be released at https://github.com/caiyuanhao1998/MST

[1] Hao Zhang,et al. Recurrent Neural Networks for Snapshot Compressive Imaging , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] L. Gool,et al. MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3] L. Gool,et al. Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction , 2022, ECCV.

[4] L. Gool,et al. HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] L. Gool,et al. Flow-Guided Sparse Transformer for Video Deblurring , 2022, ICML.

[6] Wenming Yang,et al. RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark , 2022, IEEE Journal of Biomedical and Health Informatics.

[7] Yongbing Zhang,et al. HerosNet: Hyperspectral Explicable Reconstruction and Optimal Sampling Deep Network for Snapshot Compressive Imaging , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] L. Gool,et al. Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Jianmin Bao,et al. Uformer: A General U-Shaped Transformer for Image Restoration , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Qi Tian,et al. Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation , 2021, ECCV Workshops.

[11] Jinli Suo,et al. Plug-and-Play Algorithms for Video Snapshot Compressive Imaging , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] H. Pfister,et al. Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training , 2022, NeurIPS.

[13] Lu Yuan,et al. Dynamic DETR: End-to-End Object Detection with Dynamic Attention , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[14] Rohit Girdhar,et al. An End-to-End Transformer Model for 3D Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[15] Xin Yuan,et al. Supplementary Material for “Self-supervised Neural Networks for Spectral Snapshot Compressive Imaging” , 2021 .

[16] Luc Van Gool,et al. SwinIR: Image Restoration Using Swin Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[17] Tao Xiang,et al. Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[18] Xiaolong Wang,et al. Test-Time Personalization with a Transformer for Human Pose Estimation , 2021, NeurIPS.

[19] Matthijs Douze,et al. XCiT: Cross-Covariance Image Transformers , 2021, NeurIPS.

[20] L. Gool,et al. Video Super-Resolution Transformer , 2021, ArXiv.

[21] Yi Yang,et al. Associating Objects with Transformers for Video Object Segmentation , 2021, NeurIPS.

[22] Jiyang Qi,et al. You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection , 2021, NeurIPS.

[23] Anima Anandkumar,et al. SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers , 2021, NeurIPS.

[24] Cordelia Schmid,et al. Segmenter: Transformer for Semantic Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[25] Zhuowen Tu,et al. Pose Recognition with Cascade Transformers , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Shu-Tao Xia,et al. TokenPose: Learning Keypoint Tokens for Human Pose Estimation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[27] Cordelia Schmid,et al. ViViT: A Video Vision Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[28] Chunhua Shen,et al. TFPose: Direct Human Pose Estimation with Transformers , 2021, ArXiv.

[29] G. Arce,et al. LED-based compressive spectral-temporal imaging. , 2021, Optics express.

[30] Quanfu Fan,et al. CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[31] Andreas Veit,et al. Understanding Robustness of Transformers for Image Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[32] Zhengming Ding,et al. 3D Human Pose Estimation with Spatial and Temporal Transformers , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[33] Guangming Shi,et al. Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Aggelos K. Katsaggelos,et al. Snapshot Compressive Imaging: Theory, Algorithms, and Applications , 2021, IEEE Signal Processing Magazine.

[35] Tao Xiang,et al. Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Wankou Yang,et al. TransPose: Keypoint Localization via Transformer , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[37] Wen Gao,et al. Pre-Trained Image Processing Transformer , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Shensheng Han,et al. Deep plug-and-play priors for spectral snapshot compressive imaging , 2020, Photonics Research.

[39] S. Gelly,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[40] Bin Li,et al. Deformable DETR: Deformable Transformers for End-to-End Object Detection , 2020, ICLR.

[41] Stephen Lin,et al. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[42] Shaodi You,et al. Bidirectional 3D Quasi-Recurrent Neural Network for Hyperspectral Image Super-Resolution , 2021, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[43] Shirin Jalali,et al. GAP-net for Snapshot Compressive Imaging , 2020, 2012.08364.

[44] Xin Yuan,et al. End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention , 2020, ECCV.

[45] Zhenming Yu,et al. Snapshot multispectral endomicroscopy. , 2020, Optics letters.

[46] Hua Huang,et al. DNU: Deep Non-Local Unrolling for Computational Spectral Imaging , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Nicolas Usunier,et al. End-to-End Object Detection with Transformers , 2020, ECCV.

[48] Xin Yuan,et al. Snapshot spatial-temporal compressive imaging. , 2020, Optics letters.

[49] Qionghai Dai,et al. Plug-and-Play Algorithms for Large-Scale Snapshot Compressive Imaging , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50] Xiangyu Zhang,et al. Learning Delicate Local Representations for Multi-Person Pose Estimation , 2020, ECCV.

[51] Xin Yuan,et al. Deep learning for video compressive sensing , 2020, APL Photonics.

[52] Vassilis Athitsos,et al. lambda-Net: Reconstruct Hyperspectral Images From a Snapshot Measurement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[53] Zheng Shou,et al. Deep Tensor ADMM-Net for Snapshot Compressive Imaging , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[54] Ying Fu,et al. Computational Hyperspectral Imaging Based on Dimension-Discriminative Low-Rank Tensor Recovery , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[55] Ashish Vaswani,et al. Stand-Alone Self-Attention in Vision Models , 2019, NeurIPS.

[56] Ying Fu,et al. Hyperspectral Image Reconstruction Using a Deep Spatial-Spectral Prior , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57] Xin Yuan,et al. Snapshot Compressed Sensing: Performance Bounds and Algorithms , 2018, IEEE Transactions on Information Theory.

[58] Qionghai Dai,et al. Rank Minimization for Snapshot Compressive Imaging , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[59] Xiangyu Zhang,et al. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[60] Giljoo Nam,et al. High-quality hyperspectral reconstruction using a spectral prior , 2017, ACM Trans. Graph..

[61] Guangming Shi,et al. Adaptive Nonlocal Sparse Representation for Dual-Camera Compressive Hyperspectral Imaging , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62] Matthew J. Hoffman,et al. Aerial Vehicle Tracking by Adaptive Fusion of Hyperspectral Likelihood Maps , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[63] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[64] Xiangtao Zheng,et al. Hyperspectral Image Superresolution by Transfer Learning , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[65] Frank Hutter,et al. SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[66] Pierre Alliez,et al. Recurrent Neural Networks to Correct Satellite Image Classification Maps , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[67] Stanley H. Chan,et al. Plug-and-Play ADMM for Image Restoration: Fixed-Point Convergence and Applications , 2016, IEEE Transactions on Computational Imaging.

[68] Stephen Lin,et al. Computational Snapshot Multispectral Cameras: Toward dynamic capture of the spectral world , 2016, IEEE Signal Processing Magazine.

[69] Matthew J. Hoffman,et al. Real-Time Vehicle Tracking in Aerial Video Using Hyperspectral Features , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[70] Yoichi Sato,et al. Exploiting Spectral-Spatial Correlation for Coded Hyperspectral Image Restoration , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[71] Bo Du,et al. Scene Classification via a Gradient Boosting Random Convolutional Network Framework , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[72] Xin Yuan,et al. Generalized alternating projection based total variation minimization for compressive sensing , 2015, 2016 IEEE International Conference on Image Processing (ICIP).

[73] Gonzalo R. Arce,et al. Compressive Hyperspectral Imaging via Approximate Message Passing , 2015, IEEE Journal of Selected Topics in Signal Processing.

[74] Xin Yuan,et al. Compressive Hyperspectral Imaging With Side Information , 2015, IEEE Journal of Selected Topics in Signal Processing.

[75] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[76] S. Frick,et al. Compressed Sensing , 2014, Computer Vision, A Reference Guide.

[77] Guolan Lu,et al. Medical hyperspectral imaging: a review , 2014, Journal of biomedical optics.

[78] Jon Atli Benediktsson,et al. Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[79] Guillermo Sapiro,et al. Coded aperture compressive temporal imaging , 2013, Optics express.

[80] Min H. Kim,et al. 3D imaging spectroscopy for measuring hyperspectral patterns on solid objects , 2012, ACM Trans. Graph..

[81] David J. Brady,et al. Multiframe image estimation for coded aperture snapshot spectral imagers. , 2010, Applied optics.

[82] Rama Chellappa,et al. Tracking via object reflectance using a hyperspectral video camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[83] Stephen Lin,et al. A Prism-Mask System for Multispectral Video Acquisition. , 2011, IEEE transactions on pattern analysis and machine intelligence.

[84] Xiaobai Sun,et al. Video rate spectral imaging using a coded aperture snapshot spectral imager. , 2009, Optics express.

[85] Ashwin A. Wagadarikar,et al. Single disperser design for coded aperture snapshot spectral imaging. , 2008, Applied optics.

[86] Shree K. Nayar,et al. Multispectral Imaging Using Multiplexed Illumination , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[87] M. Borengasser,et al. Hyperspectral Remote Sensing: Principles and Applications , 2007 .

[88] José M. Bioucas-Dias,et al. A New TwIST: Two-Step Iterative Shrinkage/Thresholding Algorithms for Image Restoration , 2007, IEEE Transactions on Image Processing.

[89] Mário A. T. Figueiredo,et al. Gradient Projection for Sparse Reconstruction: Application to Compressed Sensing and Other Inverse Problems , 2007, IEEE Journal of Selected Topics in Signal Processing.

[90] M E Gehm,et al. Single-shot compressive spectral imaging with a dual-disperser architecture. , 2007, Optics express.

[91] Joel A. Tropp,et al. Signal Recovery From Random Measurements Via Orthogonal Matching Pursuit , 2007, IEEE Transactions on Information Theory.

[92] Lorenzo Bruzzone,et al. Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[93] Bruce J. Tromberg,et al. Face recognition in hyperspectral images , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[94] S. Shapshay,et al. Detection of preinvasive cancer cells , 2000, Nature.

[95] A F Goetz,et al. Imaging Spectrometry for Earth Remote Sensing , 1985, Science.