论文信息 - PCA-RECT: An Energy-efficient Object Detection Approach for Event Cameras

PCA-RECT: An Energy-efficient Object Detection Approach for Event Cameras

We present the first purely event-based, energy-efficient approach for object detection and categorization using an event camera. Compared to traditional frame-based cameras, choosing event cameras results in high temporal resolution (order of microseconds), low power consumption (few hundred mW) and wide dynamic range (120 dB) as attractive properties. However, event-based object recognition systems are far behind their frame-based counterparts in terms of accuracy. To this end, this paper presents an event-based feature extraction method devised by accumulating local activity across the image frame and then applying principal component analysis (PCA) to the normalized neighborhood region. Subsequently, we propose a backtracking-free k-d tree mechanism for efficient feature matching by taking advantage of the low-dimensionality of the feature representation. Additionally, the proposed k-d tree mechanism allows for feature selection to obtain a lower-dimensional dictionary representation when hardware resources are limited to implement dimensionality reduction. Consequently, the proposed system can be realized on a field-programmable gate array (FPGA) device leading to high performance over resource ratio. The proposed system is tested on real-world event-based datasets for object categorization, showing superior classification performance and relevance to state-of-the-art algorithms. Additionally, we verified the object detection method and real-time FPGA performance in lab settings under non-controlled illumination conditions with limited training data and ground truth annotations.

[1] Tobi Delbruck,et al. A 240 × 180 130 dB 3 µs Latency Global Shutter Spatiotemporal Vision Sensor , 2014, IEEE Journal of Solid-State Circuits.

[2] Shih-Chii Liu,et al. Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences , 2016, NIPS.

[3] Liang Chen,et al. Scalable scene understanding via saliency consensus , 2019, Soft Comput..

[4] Ryad Benosman,et al. Event-based Dynamic Face Detection and Tracking Based on Activity , 2018, ArXiv.

[5] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[6] Gregory Cohen,et al. Converting Static Image Datasets to Spiking Neuromorphic Datasets Using Saccades , 2015, Front. Neurosci..

[7] Marko Tscherepanow,et al. A saliency map based on sampling an image into random rectangular regions of interest , 2012, Pattern Recognit..

[8] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Richard I. Hartley,et al. Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Garrick Orchard,et al. Spike context: A neuromorphic descriptor for pattern recognition , 2017, 2017 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[11] David G. Lowe,et al. Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12] Hiroomi Hikawa,et al. Novel FPGA Implementation of Hand Sign Recognition System With SOM–Hebb Classifier , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[13] Nitish V. Thakor,et al. HFirst: A Temporal Approach to Object Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Tobi Delbruck,et al. Robotic goalie with 3 ms reaction time at 4% CPU load using event-based dynamic vision sensor , 2013, Front. Neurosci..

[15] Tobi Delbrück,et al. An embedded AER dynamic vision sensor for low-latency pole balancing , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[16] Xiaojun Zhai,et al. Automatic Number Plate Recognition on FPGA , 2013, 2013 IEEE 20th International Conference on Electronics, Circuits, and Systems (ICECS).

[17] Ryad Benosman,et al. Asynchronous Event-Based Visual Shape Tracking for Stable Haptic Feedback in Microrobotics , 2012, IEEE Transactions on Robotics.

[18] Ryad Benosman,et al. HATS: Histograms of Averaged Time Surfaces for Robust Event-Based Object Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19] Garrick Orchard,et al. A Noise Filtering Algorithm for Event-Based Asynchronous Change Detection Image Sensors on TrueNorth and Its Implementation on TrueNorth , 2018, Front. Neurosci..

[20] Tobi Delbruck,et al. Real-time classification and sensor fusion with a spiking deep belief network , 2013, Front. Neurosci..

[21] Tobi Delbrück,et al. Retinomorphic Event-Based Vision Sensors: Bioinspired Cameras With Spiking Output , 2014, Proceedings of the IEEE.

[22] Garrick Orchard,et al. HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[24] Shihao Zhang,et al. Long-term object tracking with a moving event camera , 2018, BMVC.

[25] Hong Yang,et al. DART: Distribution Aware Retinal Transform for Event-Based Cameras , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Loukas P. Petrou,et al. Expanding a robot's life: Low power object recognition via FPGA-based DCNN deployment , 2018, 2018 7th International Conference on Modern Circuits and Systems Technologies (MOCAST).

[27] David G. Lowe,et al. Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[28] Tobi Delbrück,et al. Combined frame- and event-based detection and tracking , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[29] Davide Scaramuzza,et al. Low-latency visual odometry using event-based feature tracks , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30] Serge J. Belongie,et al. Object categorization using co-occurrence, location and appearance , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[31] Tobi Delbrück,et al. Training Deep Spiking Neural Networks Using Backpropagation , 2016, Front. Neurosci..