Real-Time Clustering and Multi-Target Tracking Using Event-Based Sensors

Clustering is crucial for many computer vision applications such as robust tracking, object detection and segmentation. This work presents a real-time clustering technique that takes advantage of the unique properties of event-based vision sensors. Since event-based sensors trigger events only when the intensity changes, the data is sparse, with low redundancy. Thus, our approach redefines the well-known mean-shift clustering method using asynchronous events instead of conventional frames. The potential of our approach is demonstrated in a multi-target tracking application using Kalman filters to smooth the trajectories. We evaluated our method on an existing dataset with patterns of different shapes and speeds, and a new dataset that we collected. The sensor was attached to the Baxter robot in an eye-in-hand setup monitoring real-world objects in an action manipulation task. Clustering accuracy achieved an F-measure of 0.95, reducing the computational cost by 88% compared to the frame-based method. The average error for tracking was 2.5 pixels and the clustering achieved a consistent number of clusters along time.

[1]  Yiannis Aloimonos,et al.  Contour Detection and Characterization for Asynchronous Event Sensors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Yiannis Aloimonos,et al.  Contour Motion Estimation for Asynchronous Event-Driven Cameras , 2014, Proceedings of the IEEE.

[3]  Tobias Delbrück,et al.  Frame-free dynamic digital vision , 2008 .

[4]  Wei Wang,et al.  A multiple object tracking method using Kalman filter , 2010, The 2010 IEEE International Conference on Information and Automation.

[5]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[6]  Matteo Munaro,et al.  Robust multiple object tracking in RGB-D camera networks , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Tobi Delbrück,et al.  The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and SLAM , 2016, Int. J. Robotics Res..

[8]  Yiannis Aloimonos,et al.  An Experimental Study of Color-Based Segmentation Algorithms Based on the Mean-Shift Concept , 2010, ECCV.

[9]  Nitish V. Thakor,et al.  A Saccade Based Framework for Real-Time Motion Segmentation Using Event Based Vision Sensors , 2017, Front. Neurosci..

[10]  Tobi Delbrück,et al.  A USB3.0 FPGA event-based filtering and tracking framework for dynamic vision sensors , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[11]  Ryad Benosman,et al.  Simultaneous Mosaicing and Tracking with an Event Camera , 2014, BMVC.

[12]  T. Delbruck,et al.  > Replace This Line with Your Paper Identification Number (double-click Here to Edit) < 1 , 2022 .

[13]  Long Quan,et al.  Region-based progressive stereo matching , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14]  Yiannis Aloimonos,et al.  Bio-inspired Motion Estimation with Event-Driven Sensors , 2015, IWANN.

[15]  Young-Woo Seo,et al.  Kernel-based tracking for improving sign detection performance , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Sébastien Barré,et al.  A Motion-Based Feature for Event-Based Pattern Recognition , 2017, Front. Neurosci..

[17]  Kostas Daniilidis,et al.  Event-based feature tracking with probabilistic data association , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Ryad Benosman,et al.  Asynchronous Event-Based Multikernel Algorithm for High-Speed Visual Features Tracking , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Tobi Delbrück,et al.  A 128$\times$ 128 120 dB 15 $\mu$s Latency Asynchronous Temporal Contrast Vision Sensor , 2008, IEEE Journal of Solid-State Circuits.

[20]  Ryad Benosman,et al.  Visual Tracking Using Neuromorphic Asynchronous Event-Based Cameras , 2015, Neural Computation.

[21]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..