DETECTION OF PERSONS IN MLS POINT CLOUDS USING IMPLICIT SHAPE MODELS

In this paper we present an approach for the detection of persons in point clouds gathered by mobile laser scanning (MLS) systems. The approach consists of a preprocessing and the actual detection. The main task of the preprocessing is to reduce the amount of data which has to be processed by the detection. To fulfill this task, the preprocessing consists of ground removal, segmentation and several filters. The detection is based on an implicit shape models (ISM) approach which is an extension to bag-of-words approaches. For this detection method, it is sufficient to work with a small amount of training data. Although in this paper we focus on the detection of persons, our approach is able to detect multiple classes of objects in point clouds. Using a parameterization of the approach which offers a good compromise between detection and runtime performance, we are able to achieve a precision of 0.68 and a recall of 0.76 while having a average runtime of 370ms per single scan rotation of the rotating head of a typical MLS sensor.

[1]  Roland Siegwart,et al.  A Layered Approach to People Detection in 3D Range Data , 2010, AAAI.

[2]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[3]  Christoph Mertz,et al.  Pedestrian Detection and Tracking Using Three-dimensional LADAR Data , 2010, Int. J. Robotics Res..

[4]  Armin B. Cremers,et al.  Laser-based segment classification using a mixture of bag-of-words , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Konrad Schindler,et al.  IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS , 2012, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[6]  Luc Van Gool,et al.  Hough Transform and 3D SURF for Robust Three Dimensional Classification , 2010, ECCV.

[7]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Michael Arens,et al.  View-invariant person re-identification with an Implicit Shape Model , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[9]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[10]  Cristiano Premebida,et al.  Pedestrian detection combining RGB and dense LIDAR data , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Toby Sharp,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR.

[13]  Andrew Y. Ng,et al.  Convolutional-Recursive Deep Learning for 3D Object Classification , 2012, NIPS.