Low Resolution Person Detection with a Moving Thermal Infrared Camera by Hot Spot Classification

In many visual surveillance applications the task of person detection and localization can be solved easier by using thermal long-wave infrared (LWIR) cameras which are less affected by changing illumination or background texture than visual-optical cameras. Especially in outdoor scenes where usually only few hot spots appear in thermal infrared imagery, humans can be detected more reliably due to their prominent infrared signature. We propose a two-stage person recognition approach for LWIR images: (1) the application of Maximally Stable Extremal Regions (MSER) to detect hot spots instead of background subtraction or sliding window and (2) the verification of the detected hot spots using a Discrete Cosine Transform (DCT) based descriptor and a modified Random Naïve Bayes (RNB) classifier. The main contributions are the novel modified RNB classifier and the generality of our method. We achieve high detection rates for several different LWIR datasets with low resolution videos in real-time. While many papers in this topic are dealing with strong constraints such as considering only one dataset, assuming a stationary camera, or detecting only moving persons, we aim at avoiding such constraints to make our approach applicable with moving platforms such as Unmanned Ground Vehicles (UGV).

[1]  Wei Li,et al.  An effective approach to pedestrian detection in thermal imagery , 2012, 2012 8th International Conference on Natural Computation.

[2]  Jordi Vitrià,et al.  Improving Naive Bayes Using Class-Conditional ICA , 2002, IBERAMIA.

[3]  Kai Ming Ting,et al.  A Study of AdaBoost with Naive Bayesian Classifiers: Weakness and Improvement , 2003, Comput. Intell..

[4]  Xin Li,et al.  Layered Representation for Pedestrian Detection and Tracking in Infrared Imagery , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[5]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[6]  Nizar Bouguila,et al.  A nonparametric Bayesian approach for enhanced pedestrian detection and foreground segmentation , 2011, CVPR 2011 WORKSHOPS.

[7]  James W. Davis,et al.  Background-subtraction using contour-based fusion of thermal and visible imagery , 2007, Comput. Vis. Image Underst..

[8]  James W. Davis,et al.  A Two-Stage Template Approach to Person Detection in Thermal Imagery , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[9]  Yuichi Matsumoto,et al.  Shrink boost for selecting multi-LBP histogram features in object detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Dirk Van den Poel,et al.  Random Multiclass Classification: Generalizing Random Forests to Random MNL and Random NB , 2007, DEXA.

[11]  Riad I. Hammoud,et al.  Thermal-Visible Video Fusion for Moving Target Tracking and Pedestrian Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Dragoljub Pokrajac,et al.  People detection in low resolution infrared videos , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[13]  George Bebis,et al.  Robust Video-Based Surveillance by Integrating Target Detection with Tracking , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[14]  Michael Teutsch,et al.  Hot spot detection and classification in LWIR videos for person recognition , 2013, Defense, Security, and Sensing.

[15]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Qianqing Qin,et al.  Robust multi-stage approach for the detection of moving target from infrared imagery , 2012 .

[18]  Michael Arens,et al.  Feature based person detection beyond the visible spectrum , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[19]  Takashi Naito,et al.  Pedestrian Recognition Using Second-Order HOG Feature , 2009, ACCV.

[20]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[21]  Dariu Gavrila,et al.  An Experimental Study on Pedestrian Classification , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Rainer Stiefelhagen,et al.  Local appearance based face recognition using discrete cosine transform , 2005, 2005 13th European Signal Processing Conference.

[23]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[25]  Kim-Leng Poh,et al.  A Comparative Study of PCA, ICA and Class-Conditional ICA for Naïve Bayes Classifier , 2007, IWANN.

[26]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[27]  Satoshi Ito,et al.  Co-occurrence Histograms of Oriented Gradients for Pedestrian Detection , 2009, PSIVT.

[28]  Ramakant Nevatia,et al.  Pedestrian Detection in Infrared Images based on Local Shape Features , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Baojun Zhao,et al.  Variational-Based Contour Tracking in Infrared Imagery , 2009, 2009 2nd International Congress on Image and Signal Processing.

[30]  Horst Bischof,et al.  On-Line Random Naive Bayes for Tracking , 2010, 2010 20th International Conference on Pattern Recognition.

[31]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[32]  David Nistér,et al.  Linear Time Maximally Stable Extremal Regions , 2008, ECCV.

[33]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[34]  Toby P. Breckon,et al.  Real-time people and vehicle detection from UAV imagery , 2011, Electronic Imaging.

[35]  Yupin Luo,et al.  Real-Time Pedestrian Detection and Tracking at Nighttime for Driver-Assistance Systems , 2009, IEEE Transactions on Intelligent Transportation Systems.

[36]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..