Results from a Real-time Stereo-based Pedestrian Detection System on a Moving Vehicle

This paper describes performance results from a real-time system for detecting, localizing, and tracking pedes- trians from a moving vehicle. The end-to-end system runs at 5Hz on 1024x768 imagery using standard hardware, and has been integrated and tested on multiple ground vehicles and environments. We show performance on a diverse set of ground- truthed datasets in outdoor environments with varying degrees of pedestrian density and clutter. The system can reliably detect upright pedestrians to a range of 40m in lightly cluttered urban environments. In highly cluttered urban environments, the detection rates are on par with state-of-the-art non-real- time systems (1). I. INTRODUCTION The ability for autonomous vehicles to detect and predict the motion of pedestrians or personnel in their vicinity is crit- ical to ensure that the vehicles operate safely around people. Vehicles must be able to detect people in urban and cross- country environments, including flat, uneven and multi-level terrain, with widely varying degrees of clutter, occlusion, and illumination (and ultimately for operating day or night, in all weather, and in the presence of atmospheric obscurants). To support high-speed driving, detection must be reliable to a range of 100m. The ability to detect pedestrians from a moving vehicle in a cluttered, dynamic urban environments is also applicable to automatic driver-assistance systems or smaller autonomous robots navigating in environments such as a sidewalk or marketplace. This paper describes results from a fully integrated real- time system capable of reliably detecting, localizing, and tracking upright (stationary, walking, or running) human adults at a range out to 40m from a moving platform. Our approach uses imagery and dense range data from stereo cameras for the detection, tracking, and velocity estima- tion of pedestrians. The end-to-end system runs at 5Hz on 1024x768 imagery on a standard 2.4GHz Intel Core 2 Quad processor. The ability to process this high resolution imagery enables the system to achieve better performance at long range compared to other state-of-the-art implementations. Because the system segments and classifies people based on stereo range data, it is largely invariant to the variability of pedestrians' appearance (due to different types and styles of clothing) and scale. The system also handles different viewpoints (frontal vs. side views) and poses (including The research described in this publication was carried out at the Jet Propulsion Laboratory, California Institute of Technology, with funding from the Army Research Lab (ARL) under the Robotics Collaborative Technology Alliance (RCTA) through an agreement with NASA All authors are with the Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109 articulations and walking) of pedestrians, and is robust to objects being carried or worn by them. Furthermore, the system makes no assumption of a ground-plane to detect or track people, and similarly makes no assumption about the predictability of a person's motion other than a maximum velocity.

[1]  Larry Matthies,et al.  Stereo vision and rover navigation software for planetary exploration , 2002, Proceedings, IEEE Aerospace Conference.

[2]  Klaus Dietmayer,et al.  Pedestrian recognition in urban traffic using a vehicle based multilayer laserscanner , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[3]  A. Shashua,et al.  Pedestrian detection for driving assistance systems: single-frame classification and system level performance , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[4]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[5]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Dariu Gavrila,et al.  Multi-cue Pedestrian Detection and Tracking from a Moving Vehicle , 2007, International Journal of Computer Vision.

[7]  Ignacio Parra,et al.  Pedestrian Detection Using SVM and Multi-Feature Combination , 2006, 2006 IEEE Intelligent Transportation Systems Conference.

[8]  Bernt Schiele,et al.  Towards Robust Pedestrian Detection in Crowded Image Sequences , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Alain Rakotomamonjy,et al.  A Pedestrian Detector Using Histograms of Oriented Gradients and a Support Vector Machine Classifier , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[10]  Luc Van Gool,et al.  Depth and Appearance for Mobile Scene Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11]  R. Nevatia,et al.  Simultaneous Object Detection and Segmentation by Boosting Local Shape Feature based Classifier , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  W. Ritter,et al.  Detection and Tracking of Multiple Pedestrians in Automotive Applications , 2007, 2007 IEEE Intelligent Vehicles Symposium.

[13]  A. Howard,et al.  Detecting Pedestrians with Stereo Vision: Safe Operation of Autonomous Ground Vehicles in Dynamic Environments , 2007 .

[14]  Luc Van Gool,et al.  Dynamic 3D Scene Analysis from a Moving Vehicle , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  S. Muller-Schneiders,et al.  A Real Time Object Detection Approach Applied to Reliable Pedestrian Detection , 2007, 2007 IEEE Intelligent Vehicles Symposium.

[16]  Larry S. Davis,et al.  Real-Time Human Detection and Tracking from Mobile Vehicles , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[17]  Greg Mori,et al.  Detecting Pedestrians by Learning Shapelet Features , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  S. Nedevschi,et al.  Pedestrian Detection and Classification Based on 2D and 3D Information For Driving Assistance Systems , 2007, 2007 IEEE International Conference on Intelligent Computer Communication and Processing.

[20]  Andrew Howard,et al.  Real-time stereo visual odometry for autonomous ground vehicles , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Christoph Mertz,et al.  LADAR-based Pedestrian Detection and Tracking , 2008 .

[22]  Max Bajracharya,et al.  Detecting personnel around UGVs using stereo vision , 2008, SPIE Defense + Commercial Sensing.

[23]  Barry A. Bodt A Formal Experiment to Assess Pedestrian Detection and Tracking Technology for Unmanned Ground Systems , 2008 .

[24]  Luc Van Gool,et al.  A mobile vision system for robust multi-person tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Ramakant Nevatia,et al.  Segmentation and Tracking of Multiple Humans in Crowded Environments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Multi-sensor Detection and Tracking of Humans for Safe Operations with Unmanned Ground Vehicles , 2008 .