Layered Representation for Pedestrian Detection and Tracking in Infrared Imagery

This paper introduces a layered representation for infrared imagery and studies its application into pedestrian detection and tracking. We present a generalized EM algorithm to decompose infrared images into background and foreground layers and study the phenomenon of polarity switch. We propose a hybrid (shape+appearance) algorithm for pedestrian detection, in which shape cue is first used to eliminate non-pedestrian moving objects and appearance cue is then used to pin down the location of pedestrians. We also formulate the problem of shot segmentation and present a graph matching-based pedestrian tracking algorithm. Experimental results with OSU Thermal Pedestrian Database are reported to demonstrate the excellent performance of our algorithms.

[1]  Noboru Yasuda,et al.  Pedestrian Detection and Tracking in Far Infrared Images , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[2]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[3]  Xia Liu,et al.  Pedestrian detection and tracking with night vision , 2005, IEEE Transactions on Intelligent Transportation Systems.

[4]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[5]  Ali Shokoufandeh,et al.  Graph-Theoretical Methods in Computer Vision , 2000, Theoretical Aspects of Computer Science.

[6]  William Rucklidge,et al.  Efficiently Locating Objects Using the Hausdorff Distance , 1997, International Journal of Computer Vision.

[7]  James W. Davis,et al.  A Two-Stage Template Approach to Person Detection in Thermal Imagery , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[8]  Swarup Medasani,et al.  Classifier Swarms for Human Detection in Infrared Imagery , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[9]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[10]  Larry S. Davis,et al.  Probabilistic template based pedestrian detection in infrared videos , 2002, Intelligent Vehicle Symposium, 2002. IEEE.

[11]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[12]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[13]  Harpreet S. Sawhney,et al.  Layered representation of motion video using robust maximum-likelihood estimation of mixture models and MDL encoding , 1995, Proceedings of IEEE International Conference on Computer Vision.

[14]  Hai Tao,et al.  Object Tracking with Bayesian Estimation of Dynamic Layer Representations , 2002, IEEE Trans. Pattern Anal. Mach. Intell..