Real-Time Multi-Modal People Detection and Tracking of Mobile Robots with A RGB-D Sensor*

A real-time multi-modal people detection and tracking algorithm based on RGB-D information is presented in this paper. The color image and the depth image is calibrated and aligned firstly with each other. At texture level a SVM classifier based on HOG features is used to detect and identify human body in color images, while at depth level an Adaboost classifier with Haar-like features is used to detect human head in depth images respectively. With the space relationship between human body and head, a combined multi-modal cascade detector is then constructed, which can greatly improve the detection accuracy. For tracking tasks, the KCF(Kernelized Correlation Filter) algorithm is adopted to realize the real-time people tracking, in which color and depth features are both considered and used in template matching to guarantee the robustness of the tracking algorithm. The proposed algorithm is robust to object occlusion and tracking lost caused by fast movement or cross walking. Extensive experiments are carried out on a wheeled mobile robot mounted with a Kinect2 sensor, and the result shows the feasibility and effectiveness of the method.

[1]  Kai Oliver Arras,et al.  People detection in RGB-D data , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Xiaogang Wang,et al.  Joint Deep Learning for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[6]  Matteo Munaro,et al.  Tracking people within groups with RGB-D data , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Deva Ramanan,et al.  Histograms of Sparse Codes for Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Matteo Munaro,et al.  Fast RGB-D people tracking for service robots , 2014, Auton. Robots.

[10]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[11]  Yann LeCun,et al.  Pedestrian Detection with Unsupervised Multi-stage Feature Learning , 2012, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Daniel Castro,et al.  Feature extraction for moving objects tracking system in indoor environments , 2004 .