Camera-light detection and ranging data fusion-based system for pedestrian detection

Abstract. A hybrid system for pedestrian detection is proposed, in which both visible camera and light detection and ranging (LIDAR) data of the same scene are used. The proposed method is achieved in two main stages. First, the 3-D LIDAR points are clustered and the resulting clusters are projected onto the visible image to get regions of interests (ROIs). A postprocessing method is applied on the resulting ROIs to keep only significant ones. Second, a hybrid feature is defined to classify the ROIs extracted from the first stage. It involves a combination of visual (camera) and range (LIDAR)-based features. The visual feature is a combination of the so-called HOG-Color, which is an extension of the classical histogram of oriented gradients (HOG) feature to the RGB color space and the local self-similarity feature. The range feature we define is called histogram of dominant silhouette orientation descriptor. It is computed from the depth values of the pixels belonging to the ROI. Random forest, support vector machine, and Adaboost classifiers have been tested together with the introduced hybrid feature and the first one is adopted as it outperforms the two other ones. The proposed method has been tested on KITTI dataset and the results are satisfactory when compared to recent state-of-the-art methods.

[1]  Abdelkaher Ait Abdelouahad,et al.  New features for wireless capsule endoscopy polyp detection , 2018, 2018 International Conference on Intelligent Systems and Computer Vision (ISCV).

[2]  Lahcen Koutti,et al.  Fast edge-based stereo matching approach for road applications , 2017, Signal Image Video Process..

[3]  Weiming Shen,et al.  A new pedestrian detection method based on combined HOG and LSS features , 2015, Neurocomputing.

[4]  Cristiano Premebida,et al.  LIDAR and vision‐based pedestrian detection system , 2009, J. Field Robotics.

[5]  Xiaogang Wang,et al.  Pedestrian detection aided by deep learning semantic tasks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Lahcen Koutti,et al.  Fast spatio-temporal stereo matching for advanced driver assistance systems , 2016, Neurocomputing.

[7]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Fernando Puente León,et al.  Information fusion to detect and classify pedestrians using invariant features , 2011, Inf. Fusion.

[10]  Cristiano Premebida,et al.  Evaluation of Boosting-SVM and SRM-SVM cascade classifiers in laser and vision-based pedestrian detection , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[11]  Markus Braun,et al.  Pose-RCNN: Joint object detection and pose estimation using 3D object proposals , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[12]  Bernt Schiele,et al.  Taking a deeper look at pedestrians , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Mohan M. Trivedi,et al.  Learning to Detect Vehicles by Clustering Appearance Patterns , 2015, IEEE Transactions on Intelligent Transportation Systems.

[15]  Raimondo Schettini,et al.  Local detectors and compact descriptors for visual search: A quantitative comparison , 2015, Digit. Signal Process..

[16]  Roland Siegwart,et al.  Human detection using multimodal and multidimensional features , 2008, 2008 IEEE International Conference on Robotics and Automation.

[17]  Xiaogang Wang,et al.  Multi-stage Contextual Deep Learning for Pedestrian Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[18]  Ming Yang,et al.  Regionlets for Generic Object Detection , 2013, 2013 IEEE International Conference on Computer Vision.

[19]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20]  Shuicheng Yan,et al.  Discriminative local binary patterns for human detection in personal album , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Zhan Song,et al.  Histogram of Silhouette Direction code: An efficient HOG-based descriptor for accurate human detection , 2012, 2012 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[22]  George Bebis,et al.  Fast spatio-temporal stereo for intelligent transportation systems , 2012, Pattern Analysis and Applications.

[23]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Bernt Schiele,et al.  Filtered channel features for pedestrian detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Fernando García,et al.  Identifying and Tracking Pedestrians Based on Sensor Fusion and Motion Stability Predictions , 2010, Sensors.

[26]  Daijin Kim,et al.  Robust pedestrian detection under deformation using simple boosted features , 2017, Image Vis. Comput..

[27]  Lei Geng,et al.  Fast pedestrian detection using deformable part model and pyramid layer location , 2017, J. Electronic Imaging.

[28]  George Bebis,et al.  A real-time spatio-temporal stereo matching for road applications , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[29]  Wei Liang,et al.  Face pose estimation with combined 2D and 3D HOG features , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[30]  Cristiano Premebida,et al.  Pedestrian detection combining RGB and dense LIDAR data , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Xiaogang Wang,et al.  Deep Learning Strong Parts for Pedestrian Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Wolfram Burgard,et al.  Using Boosted Features for the Detection of People in 2D Range Data , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[33]  Mohamed El Ansari,et al.  Multisensors-based pedestrian detection system , 2016, 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA).

[34]  Cristiano Premebida,et al.  A Multi-Target Tracking and GMM-Classifier for Intelligent Vehicles , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[35]  Chaur-Heh Hsieh,et al.  Human action recognition using silhouette histogram , 2011 .

[36]  Mohamed El Ansari,et al.  A fully automated ulcer detection system for wireless capsule endoscopy images , 2017, 2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP).

[37]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Rogério Schmidt Feris,et al.  A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection , 2016, ECCV.

[39]  Zezhi Chen,et al.  Vision-based traffic surveys in urban environments , 2016, J. Electronic Imaging.

[40]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[41]  George Bebis,et al.  Temporal consistent fast stereo matching for advanced driver assistance systems (ADAS) , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[42]  Hauke Schramm,et al.  Analysis of the discriminative generalized Hough transform as a proposal generator for a deep network in automatic pedestrian and car detection , 2018, J. Electronic Imaging.

[43]  Nizar Bouguila,et al.  Finite asymmetric generalized Gaussian mixture models learning for infrared object detection , 2013, Comput. Vis. Image Underst..

[44]  Mohamed El Ansari,et al.  Multi-scale completed local binary patterns for ulcer detection in wireless capsule endoscopy images , 2018, Multimedia Tools and Applications.

[45]  A. Trémeau,et al.  Traffic sign recognition method for intelligent vehicles. , 2018, Journal of the Optical Society of America. A, Optics, image science, and vision.

[46]  Andreas Geiger,et al.  Joint 3D Estimation of Objects and Scene Layout , 2011, NIPS.

[47]  Paulo Peixoto,et al.  A Lidar and Vision-based Approach for Pedestrian and Vehicle Detection and Tracking , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[48]  Dongmei Fu,et al.  Pedestrian tracking for infrared image sequence based on trajectory manifold of spatio-temporal slice , 2016, Multimedia Tools and Applications.

[49]  Matti Pietikäinen,et al.  A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[50]  Nuno Vasconcelos,et al.  Learning Complexity-Aware Cascades for Deep Pedestrian Detection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[51]  Siddhartha Bhattacharyya,et al.  Hybrid soft computing approaches to content based video retrieval: A brief review , 2016, Appl. Soft Comput..

[52]  Satoshi Goto,et al.  Histogram of template for human detection , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[53]  Tomaso A. Poggio,et al.  A general framework for object detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[54]  António E. Ruano,et al.  Fast Line, Arc/Circle and Leg Detection from Laser Scan Data in a Player Driver , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[55]  Mohamed El Ansari,et al.  Edge points-based stereo matching approach for omnidirectional images , 2018, J. Electronic Imaging.

[56]  Abdelaziz Bensrhair,et al.  Temporal consistent real-time stereo for intelligent vehicles , 2010, Pattern Recognit. Lett..

[57]  Peter V. Gehler,et al.  Multi-View and 3D Deformable Part Models , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[59]  Mohamed El Ansari,et al.  Mean shift and log-polar transform for road sign detection , 2017, Multimedia Tools and Applications.

[60]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  Mohamed El Ansari,et al.  Traffic sign detection and recognition based on random forests , 2016, Appl. Soft Comput..

[62]  Luc Van Gool,et al.  Seeking the Strongest Rigid Detector , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[63]  R. Tibshirani,et al.  Additive Logistic Regression : a Statistical View ofBoostingJerome , 1998 .