Proposal generation method for object detection in infrared image

Abstract In an infrared image, there is a significant difference between the region of the interested object and its surrounding background. Based on this observation, we propose an effective and efficient proposal generation method which uses a Multi-layer and Multi-size Superpixel Segmentation (MMSS) scheme for object detection in the infrared image. The SLIC (Simple Linear Iterative Clustering) algorithm is applied to partition an infrared image into multi-layer and multi-size superpixels. In each layer, only the individual superpixel and the merging of two adjacent superpixels are used to create the candidate pool of object proposals. A superpixel-based center-surround feature is then defined to measure the discrepancy between the region of the proposal and its surrounding background. To evaluate the performance of the MMSS-based method of proposal generation method, we create an Infrared Interested Object Image Dataset (IIOID), in which the infrared images are collected from several benchmarks and the ground-truth of the interested object segmentation is manually labeled. Compared with several state-of-the-art methods of proposal generation on IIOID, the MMSS-based method has overwhelming superiority in detection recall under different Intersection over Union (IoU) thresholds and is convenient for computation. Furthermore, we implement the MMSS-based method as a processing step for pedestrian detection. Experimental results on benchmark infrared pedestrian image dataset show that the detectors with our method of proposal generation method can greatly reduce the number of candidate windows to be detected and also suppress false positives.

[1]  Roland Siegwart,et al.  People detection and tracking from aerial thermal views , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Luc Van Gool,et al.  SEEDS: Superpixels Extracted via Energy-Driven Sampling , 2012, ECCV.

[7]  Rainer Stiefelhagen,et al.  Measuring and evaluating the compactness of superpixels , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[8]  Cristian Sminchisescu,et al.  CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Keiichi Yamada,et al.  A shape-independent method for pedestrian detection with far-infrared images , 2004, IEEE Transactions on Vehicular Technology.

[11]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[13]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.

[14]  Rama Chellappa,et al.  Entropy-Rate Clustering: Cluster Analysis via Maximizing a Submodular Function Subject to a Matroid Constraint , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Xia Liu,et al.  Pedestrian detection and tracking with night vision , 2005, IEEE Transactions on Intelligent Transportation Systems.

[16]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[18]  Armin B. Cremers,et al.  Efficient Pedestrian Detection via Rectangular Features Based on a Statistical Shape Model , 2015, IEEE Transactions on Intelligent Transportation Systems.

[19]  David Gerónimo Gómez,et al.  Survey of Pedestrian Detection for Advanced Driver Assistance Systems , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Jiang-tao Wang,et al.  On pedestrian detection and tracking in infrared videos , 2012, Pattern Recognit. Lett..

[21]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[22]  Santiago Manen,et al.  Prime Object Proposals with Randomized Prim's Algorithm , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Yupin Luo,et al.  Real-Time Pedestrian Detection and Tracking at Nighttime for Driver-Assistance Systems , 2009, IEEE Transactions on Intelligent Transportation Systems.

[24]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[25]  Margrit Betke,et al.  A Thermal Infrared Video Benchmark for Visual Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[26]  Thomas Deselaers,et al.  Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.