Appearance-based Tracking of Persons with an Omnidirectional Vision Sensor

This paper addresses the problem of tracking a moving person with a single, omnidirectional camera. An appearance-based tracking system is described which uses a self-acquired appearance model and a Kalman filter to estimate the position of the person. Features corresponding to "depth cues" are first extracted from the panoramic images, then an artificial neural network is trained to estimate the distance of the person from the camera. The estimates are combined using a discrete Kalman filter to track the position of the person over time. The ground truth information required for training the neural network and the experimental analysis was obtained from another vision system, which uses multiple webcams and triangulation to calculate the true position of the person. Experimental results show that the tracking system is accurate and reliable, and that its performance can be further improved by learning multiple, person-specific appearance models.

[1]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[2]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Larry S. Davis,et al.  W/sup 4/: Who? When? Where? What? A real time system for detecting and tracking people , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[4]  C. Qian,et al.  Frame-rate Multi-body Tracking for Surveillance , 1998 .

[5]  Wolfram Burgard,et al.  MINERVA: A Tour-Guide Robot that Learns , 1999, KI.

[6]  Wolfram Burgard,et al.  Experiences with an Interactive Museum Tour-Guide Robot , 1999, Artif. Intell..

[7]  J. Gaspar,et al.  Omni-directional vision for robot navigation , 2000, Proceedings IEEE Workshop on Omnidirectional Vision (Cat. No.PR00704).

[8]  Ales Leonardis,et al.  Robust localization using panoramic view-based recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Wolfram Burgard,et al.  Tracking multiple moving objects with a mobile robot , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Anoop Gupta,et al.  Viewing meeting captured by an omni-directional camera , 2001, CHI.

[11]  Mohan M. Trivedi,et al.  N-Ocular stereo for real-time human tracking , 2001 .