SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams

Reconstructing a sequence of sharp images from the blurry input is crucial for enhancing our insights into the captured scene and poses a significant challenge due to the limited temporal features embedded in the image. Spike cameras, sampling at rates up to 40,000 Hz, have proven effective in capturing motion features and beneficial for solving this ill-posed problem. Nonetheless, existing methods fall into the supervised learning paradigm, which suffers from notable performance degradation when applied to real-world scenarios that diverge from the synthetic training data domain. Moreover, the quality of reconstructed images is capped by the generated images based on motion analysis interpolation, which inherently differs from the actual scene, affecting the generalization ability of these methods in real high-speed scenarios. To address these challenges, we propose the first self-supervised framework for the task of spike-guided motion deblurring. Our approach begins with the formulation of a spike-guided deblurring model that explores the theoretical relationships among spike streams, blurry images, and their corresponding sharp sequences. We subsequently develop a self-supervised cascaded framework to alleviate the issues of spike noise and spatial-resolution mismatching encountered in the deblurring model. With knowledge distillation and re-blurring loss, we further design a lightweight deblur network to generate high-quality sequences with brightness and texture consistency with the original input. Quantitative and qualitative experiments conducted on our real-world and synthetic datasets with spikes validate the superior generalization of the proposed framework. Our code, data and trained models will be available at \url{https://github.com/chenkang455/S-SDM}.

[1]  Gui-Song Xia,et al.  Generalizing Event-Based Motion Deblurring in Real-World Scenarios , 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).

[2]  Zhaofei Yu,et al.  Unveiling the Potential of Spike Streams for Foreground Occlusion Removal from Densely Continuous Views , 2023, ArXiv.

[3]  Zhaofei Yu,et al.  Self-Supervised Joint Dynamic Scene Reconstruction and Optical Flow Estimation for Spiking Camera , 2023, AAAI.

[4]  Boxin Shi,et al.  1000 FPS HDR Video with a Spike-RGB Hybrid Camera , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Zhaofei Yu,et al.  Exploring Efficient Asymmetric Blind-Spots for Self-Supervised Denoising in Real-World Scenarios , 2023, 2303.16783.

[6]  Song Wang,et al.  Spike-Based Motion Estimation for Object Tracking Through Bio-Inspired Unsupervised Learning , 2022, IEEE Transactions on Image Processing.

[7]  Yinqiang Zheng,et al.  Blur Interpolation Transformer for Real-World Motion from Blur , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Tiejun Huang,et al.  Self-Supervised Mutual Learning for Dynamic Scene Reconstruction of Spiking Camera , 2022, IJCAI.

[9]  Shiliang Zhang,et al.  SpikingSIM: A Bio-Inspired Spiking Simulator , 2022, 2022 IEEE International Symposium on Circuits and Systems (ISCAS).

[10]  Xiang Zhang,et al.  Unifying Motion Deblurring and Frame Interpolation with Events , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Kyoung Mu Lee,et al.  AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Guoqing Li,et al.  Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Qi-Xing Huang,et al.  E-CIR: Event-Enhanced Continuous Intensity Recovery , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Ming-Hsuan Yang,et al.  Deep Image Deblurring: A Survey , 2022, International Journal of Computer Vision.

[15]  Ruiqin Xiong,et al.  1000x Faster Camera and Machine Vision with Ordinary Devices , 2022, Engineering.

[16]  L. Gool,et al.  Event-Based Fusion for Motion Deblurring with Cross-modal Attention , 2021, ECCV.

[17]  Gui-Song Xia,et al.  Motion Deblurring with Real Events , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Tiejun Huang,et al.  High-speed Image Reconstruction through Short-term Plasticity for Spiking Cameras , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Ruiqin Xiong,et al.  Spk2ImgNet: Learning to Reconstruct Dynamic Scene from Continuous Spike Stream , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Taesup Moon,et al.  FBI-Denoiser: Fast Blind Image Denoiser for Poisson-Gaussian Noise , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Munchurl Kim,et al.  XVFI: eXtreme Video Frame Interpolation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Zhewei Huang,et al.  Real-Time Intermediate Flow Estimation for Video Frame Interpolation , 2020, ECCV.

[23]  Dacheng Tao,et al.  Exposure Trajectory Recovery From Motion Blur , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Yonghong Tian,et al.  Retina-Like Visual Image Reconstruction via Spiking Neural Model , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Tiejun Huang,et al.  A Retina-Inspired Sampling Method for Visual Texture Reconstruction , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[26]  Chiara Bartolozzi,et al.  Event-Based Vision: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Tero Karras,et al.  High-Quality Self-Supervised Deep Image Denoising , 2019, NeurIPS.

[28]  Xin Yu,et al.  Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Meiguang Jin,et al.  Learning to Extract a Video Sequence from a Single Motion-Blurred Image , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Kyoung Mu Lee,et al.  Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[32]  Tae Hyun Kim,et al.  Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Bernabé Linares-Barranco,et al.  A 128$\,\times$ 128 1.5% Contrast Sensitivity 0.9% FPN 3 µs Latency 4 mW Asynchronous Frame-Free Dynamic Vision Sensor Using Transimpedance Preamplifiers , 2013, IEEE Journal of Solid-State Circuits.

[34]  Bryan Peterson,et al.  Understanding Exposure: How to Shoot Great Photographs with Any Camera , 1990 .

[35]  Lei Yu,et al.  Motion Deblur by Learning Residual From Events , 2024, IEEE Transactions on Multimedia.

[36]  Tiejun Huang,et al.  Enhancing Motion Deblurring in High-Speed Scenes with Spike Streams , 2023, NeurIPS.

[37]  Boxin Shi,et al.  A Residual Learning Approach to Deblur and Generate High Frame Rate Video With an Event Camera , 2023, IEEE Transactions on Multimedia.

[38]  Zhaofei Yu,et al.  Learning Optical Flow from Continuous Spike Streams , 2022, NeurIPS.

[39]  Jiwen Lu,et al.  Spike Transformer: Monocular Depth Estimation for Spiking Camera , 2022, ECCV.

[40]  Jing Chen,et al.  Learning Event-Driven Video Deblurring and Interpolation , 2020, ECCV.