A Framework for Evaluating Stereo-Based Pedestrian Detection Techniques

Automated pedestrian detection, counting, and tracking have received significant attention in the computer vision community of late. As such, a variety of techniques have been investigated using both traditional 2-D computer vision techniques and, more recently, 3-D stereo information. However, to date, a quantitative assessment of the performance of stereo-based pedestrian detection has been problematic, mainly due to the lack of standard stereo-based test data and an agreed methodology for carrying out the evaluation. This has forced researchers into making subjective comparisons between competing approaches. In this paper, we propose a framework for the quantitative evaluation of a short-baseline stereo-based pedestrian detection system. We provide freely available synthetic and real-world test data and recommend a set of evaluation metrics. This allows researchers to benchmark systems, not only with respect to other stereo-based approaches, but also with more traditional 2-D approaches. In order to illustrate its usefulness, we demonstrate the application of this framework to evaluate our own recently proposed technique for pedestrian detection and tracking.

[1]  Alan F. Smeaton,et al.  Pedestrian detection in uncontrolled environments using stereo and biometric information , 2006, VSSN '06.

[2]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[3]  Michael Harville,et al.  Stereo person tracking with adaptive plan-view templates of height and occupancy statistics , 2004, Image Vis. Comput..

[4]  Dariu Gavrila,et al.  Real-time object detection for "smart" vehicles , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  A. Senior,et al.  Performance Evaluation of Surveillance Systems Under Varying Conditions , 2004 .

[6]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Jorge Batista,et al.  Tracking Pedestrians Under Occlusion Using Multiple Cameras , 2004, ICIAR.

[8]  James W. Davis,et al.  Tracking mean shift clustered point clouds for 3D surveillance , 2006, VSSN '06.

[9]  Jin Hyeong Park,et al.  Performance evaluation of object detection algorithms , 2002, Object recognition supported by user interaction for service robots.

[10]  Adam Baumberg,et al.  Learning deformable models for tracking human motion , 1996 .

[11]  Massimo Bertozzi,et al.  Stereo Vision-based approaches for Pedestrian Detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[12]  Alan F. Smeaton,et al.  Pedestrian Detection Using Stereo and Biometric Information , 2006, ICIAR.

[13]  Mubarak Shah Tracking people in presence of occlusion , 2000 .

[14]  Liang Zhao,et al.  Stereo- and neural network-based pedestrian detection , 1999, Proceedings 199 IEEE/IEEJ/JSAI International Conference on Intelligent Transportation Systems (Cat. No.99TH8383).

[15]  Mubarak Shah,et al.  A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint , 2006, ECCV.

[16]  Philip Kelly Pedestrian detection and tracking using stereo vision techniques , 2008 .

[17]  Larry S. Davis,et al.  M2Tracker: A Multi-view Approach to Segmenting and Tracking People in a Cluttered Scene Using Region-Based Stereo , 2002, ECCV.

[18]  Yuichi Ohta,et al.  Occlusion detectable stereo-occlusion patterns in camera matrix , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Michael Harville,et al.  Stereo person tracking with short and long term plan-view appearance models of shape and color , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..