Accurate coverage summarization of UAV videos

A predominant fraction of UAV videos are never watched or analyzed and there is growing interest in having a summary view of the UAV videos for obtaining a better overall perspective of the visual content. Real time summarization of the UAV video events is also important from tactical perspective. Our research focuses on developing resilient algorithms for summarizing videos that can be efficiently processed either onboard or offline. Our previous work [2] on the video summarization has focused on the event summarization. More recently, we have investigated the challenges in providing the coverage summarization of the video content from UAV videos. Different from the traditional coverage summarization taking SfM approach (e.g., [7]) on SIFT-based [14] feature points, there are several additional challenges including jitter, low resolution, contrast, lack of salient features in UAV videos. We propose a novel correspondence algorithm that exploits the 3D context that can potentially alleviate the correspondence ambiguity. Our results on VIRAT dataset shows that our algorithm can find many correct correspondences in low resolution imagery while avoiding many false positives from the traditional algorithms.

[1]  Genshe Chen,et al.  Wide-area motion imagery (WAMI) exploitation tools for enhanced situation awareness , 2012, 2012 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).

[2]  P. Anandan,et al.  Mosaic based representations of video sequences and their applications , 1995, Proceedings of IEEE International Conference on Computer Vision.

[3]  Afshin Dehghan,et al.  GMCP-Tracker: Global Multi-object Tracking Using Generalized Minimum Clique Graphs , 2012, ECCV.

[4]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[5]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Sharath Pankanti,et al.  Efficient UAV video event summarization , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[7]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[8]  Min Chen,et al.  Action-Based Multifield Video Visualization , 2008, IEEE Transactions on Visualization and Computer Graphics.

[9]  Yael Pritch,et al.  Clustered Synopsis of Surveillance Video , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[10]  Yael Pritch,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008 1 Non-Chronological Video , 2022 .

[11]  Larry S. Davis,et al.  AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[12]  Krzysztof Jaskot,et al.  The Prototype of an Unmanned Underwater Vehicle – Mechanical Construction, the Operator Panel , 2014 .

[13]  Kiyoharu Aizawa,et al.  Efficient retrieval of life log based on context and content , 2004, CARPE'04.

[14]  Stan Z. Li,et al.  Online content-aware video condensation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Zhongjian Li,et al.  Ground Moving Target Tracking Control System Design for UAV Surveillance , 2007, 2007 IEEE International Conference on Automation and Logistics.

[16]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17]  Afshin Dehghan,et al.  Visual business recognition: a multimodal approach , 2013, MM '13.

[18]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[19]  Haroon Idrees,et al.  Detection and Tracking of Large Number of Targets in Wide Area Surveillance , 2010, ECCV.

[20]  Imad Jawhar,et al.  UAVs for smart cities: Opportunities and challenges , 2014, 2014 International Conference on Unmanned Aircraft Systems (ICUAS).

[21]  David Salesin,et al.  Schematic storyboarding for video visualization and editing , 2006, SIGGRAPH 2006.

[22]  Yunhong Wang,et al.  Visual Saliency Based Aerial Video Summarization by Online Scene Classification , 2011, 2011 Sixth International Conference on Image and Graphics.

[23]  Yael Pritch,et al.  Webcam Synopsis: Peeking Around the World , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[24]  P. Anandan,et al.  About Direct Methods , 1999, Workshop on Vision Algorithms.

[25]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Gérard G. Medioni,et al.  Detection and tracking of moving objects from a moving platform in presence of strong parallax , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[27]  Witold Czajewski,et al.  Moving Objects Detection and Tracking Framework for UAV-based Surveillance , 2010, 2010 Fourth Pacific-Rim Symposium on Image and Video Technology.

[28]  B. Brisco,et al.  Precision Agriculture and the Role of Remote Sensing: A Review , 1998 .

[29]  Yael Pritch,et al.  Making a Long Video Short: Dynamic Video Synopsis , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30]  Ralf P. Botchen,et al.  Action-based Multi-field Video Visualization , 2007 .