Retina-Like Visual Image Reconstruction via Spiking Neural Model

The high-sensitivity vision of primates, including humans, is mediated by a small retinal region called the fovea. As a novel bio-inspired vision sensor, spike camera mimics the fovea to record the nature scenes by continuous-time spikes instead of frame-based manner. However, reconstructing visual images from the spikes remains to be a challenge. In this paper, we design a retina-like visual image reconstruction framework, which is flexible in reconstructing full texture of natural scenes from the totally new spike data. Specifically, the proposed architecture consists of motion local excitation layer, spike refining layer and visual reconstruction layer motivated by bio-realistic leaky integrate and fire (LIF) neurons and synapse connection with spike-timing-dependent plasticity (STDP) rules. This approach may represent a major shift from conventional frame-based vision to the continuous-time retina-like vision, owning to the advantages of high temporal resolution and low power consumption. To test the performance, a spike dataset is constructed which is recorded by the spike camera. The experimental results show that the proposed approach is extremely effective in reconstructing the visual image in both normal and high speed scenes, while achieving high dynamic range and high image quality.

[1]  Tobi Delbruck,et al.  A 240 × 180 130 dB 3 µs Latency Global Shutter Spatiotemporal Vision Sensor , 2014, IEEE Journal of Solid-State Circuits.

[2]  T. Delbruck,et al.  > Replace This Line with Your Paper Identification Number (double-click Here to Edit) < 1 , 2022 .

[3]  Matthew Cook,et al.  Unsupervised learning of digit recognition using spike-timing-dependent plasticity , 2015, Front. Comput. Neurosci..

[4]  Li Xi,et al.  Autofocusing of ISAR images based on entropy minimization , 1999 .

[5]  Romain Brette,et al.  Brian 2: an intuitive and efficient neural simulator , 2019, bioRxiv.

[6]  Eugenio Culurciello,et al.  Activity-driven, event-based vision sensors , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[7]  M. Baudry Synaptic Plasticity and Learning and Memory: 15 Years of Progress , 1998, Neurobiology of Learning and Memory.

[8]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Lei Zheng,et al.  Image Noise Level Estimation by Principal Component Analysis , 2013, IEEE Transactions on Image Processing.

[10]  Philipp Häfliger,et al.  Bio-Inspired Asynchronous Pixel Event Tricolor Vision Sensor , 2014, IEEE Transactions on Biomedical Circuits and Systems.

[11]  Tiejun Huang,et al.  A Retina-Inspired Sampling Method for Visual Texture Reconstruction , 2019, 2019 IEEE International Conference on Multimedia and Expo (ICME).

[12]  Pierre Tirilly,et al.  Mastering the Output Frequency in Spiking Neural Networks , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[13]  Juan Antonio Leñero-Bardallo,et al.  On the Analysis and Detection of Flames With an Asynchronous Spiking Image Sensor , 2018, IEEE Sensors Journal.

[14]  Shoushun Chen,et al.  Live demonstration: A 768 × 640 pixels 200Meps dynamic vision sensor , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[15]  Davide Scaramuzza,et al.  Asynchronous, Photometric Feature Tracking using Events and Frames , 2018, ECCV.

[16]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[17]  G. Bi,et al.  Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type , 1998, The Journal of Neuroscience.

[18]  Nick Barnes,et al.  Continuous-time Intensity Estimation Using Event Cameras , 2018, ACCV.

[19]  Matthias Durr,et al.  Methods In Neuronal Modeling From Ions To Networks , 2016 .

[20]  Narciso García,et al.  Event-Based Vision Meets Deep Learning on Steering Prediction for Self-Driving Cars , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Vladlen Koltun,et al.  Events-To-Video: Bringing Modern Computer Vision to Event Cameras , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  E. Culurciello,et al.  A biomorphic digital image sensor , 2003, IEEE J. Solid State Circuits.

[24]  Qingjie Zhao,et al.  Blind image quality assessment by relative gradient statistics and adaboosting neural network , 2016, Signal Process. Image Commun..

[25]  A.N. Belbachir,et al.  Embedded Vision System for Real-Time Object Tracking using an Asynchronous Transient Vision Sensor , 2006, 2006 IEEE 12th Digital Signal Processing Workshop & 4th IEEE Signal Processing Education Workshop.

[26]  Daniel Matolin,et al.  An asynchronous time-based image sensor , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[27]  Tobi Delbrück,et al.  A 128$\times$ 128 120 dB 15 $\mu$s Latency Asynchronous Temporal Contrast Vision Sensor , 2008, IEEE Journal of Solid-State Circuits.

[28]  J A Fessler,et al.  Mean and variance of single photon counting with deadtime. , 2000, Physics in medicine and biology.

[29]  Lina J. Karam,et al.  A No-Reference Image Blur Metric Based on the Cumulative Probability of Blur Detection (CPBD) , 2011, IEEE Transactions on Image Processing.