Spatial pyramid context-aware moving vehicle detection and tracking in urban aerial imagery

Persistent detection and tracking of moving vehicles in airborne imagery provide indispensable information for many traffic surveillance applications including traffic monitoring and management, navigation systems, activity recognition and event detection. This paper presents a collaborative Spatial Pyramid Context-aware detection and Tracking system (SPCT) for moving vehicles in dense urban aerial imagery. The proposed system is composed of one master tracker that usually relies on visual object features and two auxiliary trackers based on object temporal motion information that will be called dynamically to assist master tracker. SPCT utilizes image spatial context at different level to make the video tracking system resistant to occlusion, background noise and improve target localization accuracy. We chose a pre-selected seven-channel complementary features including RGB color, intensity and spatial pyramid of HoG (PHoG) and exploit integral histogram as building block to meet the demands of real-time performance. The extensive experiments on ARGUS and ABQ wide aerial video and comparison with state-of-the-art single object trackers confirm that combining complementary tracking cues in an intelligent fusion framework is essential to address the challenges of persistent tracking in low frame rate Wide Aerial Motion Imagery (WAMI).

[1]  Guna Seetharaman,et al.  Efficient GPU Implementation of the Integral Histogram , 2012, ACCV Workshops.

[2]  Yanxi Liu,et al.  Online Selection of Discriminative Tracking Features , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Fatih Murat Porikli,et al.  Integral histogram: a fast way to extract histograms in Cartesian spaces , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Senem Velipasalar,et al.  Autonomous altitude measurement and landing area detection for indoor UAV applications , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Supakorn Siddhichai,et al.  A gradient-based foreground detection technique for object tracking in a traffic monitoring system , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[8]  Jiri Matas,et al.  Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[9]  Ognjen Arandjelovic,et al.  Automatic vehicle tracking and recognition from aerial image sequences , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[10]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Gerard Medioni,et al.  Motion propagation detection association for multi-target tracking in wide area aerial surveillance , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[12]  G. Seetharaman,et al.  Wide-Area Persistent Airborne Video: Architecture and Challenges , 2011 .

[13]  Tahir Nawaz,et al.  Tracking performance evaluation on PETS 2015 Challenge datasets , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[14]  Guna Seetharaman,et al.  Fast Structure from Motion for Sequential and Wide Area Motion Imagery , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[15]  Guilherme N. DeSouza,et al.  Vinobot and Vinoculer: Two Robotic Platforms for High-Throughput Field Phenotyping , 2017, Sensors.

[16]  Guna Seetharaman,et al.  Efficient feature extraction and likelihood fusion for vehicle tracking in low frame rate airborne video , 2010, 2010 13th International Conference on Information Fusion.

[17]  Aykut Erdem,et al.  Deformable part-based tracking by coupled global and local correlation filters , 2016, J. Vis. Commun. Image Represent..

[18]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[19]  Erik Blasch,et al.  Moving object detection for vehicle tracking in Wide Area Motion Imagery using 4D filtering , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[20]  Guna Seetharaman,et al.  Persistent target tracking using likelihood fusion in wide-area and full motion video sequences , 2012, 2012 15th International Conference on Information Fusion.

[21]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Jake K. Aggarwal,et al.  Robust Vehicle Detection for Tracking in Highway Surveillance Videos Using Unsupervised Learning , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[24]  Zhenyu He,et al.  The Visual Object Tracking VOT2016 Challenge Results , 2016, ECCV Workshops.

[25]  John A. Antoniades,et al.  Autonomous real-time ground ubiquitous surveillance-imaging system (ARGUS-IS) , 2008, SPIE Defense + Commercial Sensing.

[26]  François Brémond,et al.  A hybrid framework for online recognition of activities of daily living in real-world settings , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[27]  Guna Seetharaman,et al.  Feature selection for appearance-based vehicle tracking in geospatial video , 2013, Defense, Security, and Sensing.

[28]  Guna Seetharaman,et al.  Semantic Depth Map Fusion for Moving Vehicle Detection in Aerial Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[29]  Jianke Zhu,et al.  A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration , 2014, ECCV Workshops.

[30]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.