Vehicle detection in wide area aerial surveillance using Temporal Context

Moving vehicle detection from wide area aerial surveillance is an important and challenging task, which can be aided by context information. In this paper, we present a Temporal Context(TC) which can capture the road information. In contrast with previous methods to exploit road information, TC does not need to get the location of the road first or to use the Geographical Information System's (GIS) information. We first use background subtraction to generate the candidates, then build TC based on the candidates that have been classified as positive by Histograms of Oriented Gradient(HOG) with Multiple Kernel Learning(MKL). For each positive candidate, a region around the candidate is divided into several subregions based on the direction of the candidate, then each subregion is divided into 12 bins with a fixed length; and finally the TC, a histogram, is built according to the positions of the positive candidates in 8 consecutive frames. In order to benefit from both the appearance and context information, we use MKL to combine TC and HOG. To evaluate the effect of TC, we use the publicly available CLIF 2006 dataset, and label the vehicles in 102 frames which are 2672 × 1200 subregions that contain expressway of the original 2672 × 4008 images. The experiments demonstrate that the proposed TC is useful to remove the false positives that are away from the road, and the combination of TC and HOG with MKL outperforms the use of TC or HOG only.

[1]  Shaogang Gong,et al.  Quantifying and Transferring Contextual Information in Object Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Gert R. G. Lanckriet,et al.  Multi-class object localization by combining local contextual interactions , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Guna Seetharaman,et al.  Flux Tensor Constrained Geodesic Active Contours with Sensor Fusion for Persistent Object Tracking , 2007, J. Multim..

[4]  Heesoo Myeong,et al.  Learning object relationships via graph-based context model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Li Bai,et al.  Multiple Kernel Learning for vehicle detection in wide area motion imagery , 2012, 2012 15th International Conference on Information Fusion.

[8]  Manik Varma,et al.  Learning The Discriminative Power-Invariance Trade-Off , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[9]  Alexei A. Efros,et al.  An empirical study of context in object detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Daphne Koller,et al.  Learning Spatial Context: Using Stuff to Find Things , 2008, ECCV.

[11]  Guna Seetharaman,et al.  Persistent target tracking using likelihood fusion in wide-area and full motion video sequences , 2012, 2012 15th International Conference on Information Fusion.

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Ramakant Nevatia,et al.  Car detection in low resolution aerial image , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14]  Jian Dong,et al.  Contextualizing Object Detection and Classification , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Li Bai,et al.  Evaluation of visual tracking in extremely low frame rate wide area motion imagery , 2011, 14th International Conference on Information Fusion.

[16]  S. V. N. Vishwanathan,et al.  SPF-GMKL: generalized multiple kernel learning with a million kernels , 2012, KDD.

[17]  Ramakant Nevatia,et al.  Car detection in low resolution aerial images , 2003, Image Vis. Comput..

[18]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[19]  Andrea Vedaldi,et al.  Objects in Context , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20]  Erik Blasch,et al.  Context-driven moving vehicle detection in wide area motion imagery , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[21]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[23]  Erik Blasch,et al.  Using maximum consistency context for multiple target association in wide area traffic scenes , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Harpreet S. Sawhney,et al.  Vehicle detection and tracking in wide field-of-view aerial video , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Gérard G. Medioni,et al.  Inferring tracklets for multi-object tracking , 2011, CVPR 2011 WORKSHOPS.

[26]  Jing Xiao,et al.  Contextual boost for pedestrian detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Guna Seetharaman,et al.  Velocity vectors for features of sequential oceanographic images , 1998, IEEE Trans. Geosci. Remote. Sens..

[28]  Genshe Chen,et al.  Wide-area motion imagery (WAMI) exploitation tools for enhanced situation awareness , 2012, 2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).

[29]  Manik Varma,et al.  More generality in efficient multiple kernel learning , 2009, ICML '09.

[30]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[31]  Guna Seetharaman,et al.  Efficient feature extraction and likelihood fusion for vehicle tracking in low frame rate airborne video , 2010, 2010 13th International Conference on Information Fusion.

[32]  Jake Porway,et al.  A hierarchical and contextual model for aerial image understanding , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Haroon Idrees,et al.  Detection and Tracking of Large Number of Targets in Wide Area Surveillance , 2010, ECCV.

[34]  Gang Hua,et al.  Context aware topic model for scene recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Larry S. Davis,et al.  Learning What and How of Contextual Models for Scene Labeling , 2010, ECCV.