Robust and fast detection of moving vehicles in aerial videos using sliding windows

The detection of vehicles driving on busy urban streets in videos acquired by airborne cameras is challenging due to the large distance between camera and vehicles, simultaneous vehicle and camera motion, shadows, or low contrast due to weak illumination. However, it is an important processing step for applications such as automatic traffic monitoring, detection of abnormal behaviour, border protection, or surveillance of restricted areas. In contrast to commonly applied object segmentation methods based on background subtraction or frame differencing, we detect moving vehicles using the combination of a track-before-detect (TBD) approach and machine learning: an AdaBoost classifier learns the appearance of vehicles in low resolution and is applied within a sliding window algorithm to detect vehicles inside a region of interest determined by the TBD approach. Our main contribution lies in the identification, optimization, and evaluation of the most important parameters to achieve both high detection rates and real-time processing.

[1]  Hsu-Yung Cheng,et al.  Vehicle Detection in Aerial Surveillance Using Dynamic Bayesian Networks , 2012, IEEE Transactions on Image Processing.

[2]  Haroon Idrees,et al.  Detection and Tracking of Large Number of Targets in Wide Area Surveillance , 2010, ECCV.

[3]  Michael Teutsch,et al.  Evaluation of object segmentation to improve moving vehicle detection in aerial videos , 2014, 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[4]  Yong Wang,et al.  A Novel Vehicle Detection Method With High Resolution Highway Aerial Image , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Tomaso A. Poggio,et al.  A Trainable System for Object Detection , 2000, International Journal of Computer Vision.

[6]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[7]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[8]  Mohak Shah,et al.  Evaluating Learning Algorithms: A Classification Perspective , 2011 .

[9]  Bernt Schiele,et al.  Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[10]  Yichen Wei,et al.  Efficient histogram-based sliding window , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Toby P. Breckon,et al.  Real-time people and vehicle detection from UAV imagery , 2011, Electronic Imaging.

[12]  Samuel J. Davey,et al.  A Comparison of Detection Performance for Several Track-before-Detect Algorithms , 2008, 2008 11th International Conference on Information Fusion.

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[15]  Paul A. Viola,et al.  Multiple-Instance Pruning For Learning Efficient Cascade Detectors , 2007, NIPS.

[16]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Yuichi Matsumoto,et al.  Shrink boost for selecting multi-LBP histogram features in object detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[19]  Uwe Stilla,et al.  Airborne Vehicle Detection in Dense Urban Areas Using HoG Features and Disparity Maps , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[20]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[21]  Xuelong Li,et al.  Vehicle Detection and Motion Analysis in Low-Altitude Airborne Video Under Urban Environment , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Mubarak Shah,et al.  Multiframe Many–Many Point Correspondence for Vehicle Tracking in High Density Wide Area Aerial Videos , 2013, International Journal of Computer Vision.

[23]  Pietro Perona,et al.  The Fastest Pedestrian Detector in the West , 2010, BMVC.

[24]  Harpreet S. Sawhney,et al.  Vehicle detection and tracking in wide field-of-view aerial video , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Jiejie Zhu,et al.  Pedestrian Detection in Low-Resolution Imagery by Learning Multi-scale Intrinsic Motion Structures (MIMS) , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Uwe Franke,et al.  6D-Vision: Fusion of Stereo and Motion for Robust Environment Perception , 2005, DAGM-Symposium.

[27]  Luc Van Gool,et al.  Pedestrian detection at 100 frames per second , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Horst Bischof,et al.  On-line Boosting for Car Detection from Aerial Images , 2007, 2007 IEEE International Conference on Research, Innovation and Vision for the Future.

[29]  Pietro Perona,et al.  Integral Channel Features , 2009, BMVC.

[30]  Rita Cucchiara,et al.  Multistage Particle Windows for Fast and Accurate Object Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Mennatullah Siam,et al.  Robust autonomous visual detection and tracking of moving targets in UAV imagery , 2012, 2012 IEEE 11th International Conference on Signal Processing.

[32]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.

[34]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.