SkyStitch: A Cooperative Multi-UAV-based Real-time Video Surveillance System with Stitching

Recent advances in unmanned aerial vehicle (UAV) technologies have made it possible to deploy an aerial video surveillance system to provide an unprecedented aerial perspective for ground monitoring in real time. Multiple UAVs would be required to cover a large target area, and it is difficult for users to visualize the overall situation if they were to receive multiple disjoint video streams. To address this problem, we designed and implemented SkyStitch, a multiple-UAV video surveillance system that provides a single and panoramic video stream to its users by stitching together multiple aerial video streams. SkyStitch addresses two key design challenges: (i) the high computational cost of stitching and (ii) the difficulty of ensuring good stitching quality under dynamic conditions. To improve the speed and quality of video stitching, we incorporate several practical techniques like distributed feature extraction to reduce workload at the ground station, the use of hints from the flight controller to improve stitching efficiency and a Kalman filter-based state estimation model to mitigate jerkiness. Our results show that SkyStitch can achieve a stitching rate that is 4 times faster than existing state-of-the-art methods and also improve perceptual stitching quality. We also show that SkyStitch can be easily implemented using commercial off-the-shelf hardware.

[1]  Richard Szeliski,et al.  Seamless Image Stitching of Scenes with Large Motions and Exposure Differences , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Ayman Kaheel,et al.  Fast stitching of videos captured from freely moving devices by exploiting temporal redundancy , 2010, 2010 IEEE International Conference on Image Processing.

[3]  Wei Xu,et al.  Panoramic video stitching from commodity HDTV cameras , 2013, Multimedia Systems.

[4]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[5]  Paul A. Beardsley,et al.  Design and control of a spherical omnidirectional blimp , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  N. Trawny,et al.  Indirect Kalman Filter for 3 D Attitude Estimation , 2005 .

[7]  Bernhard Rinner,et al.  Incremental Mosaicking of Images from Autonomous, Small-Scale UAVs , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[8]  Bernhard Rinner,et al.  Prioritized data transmission in airborne camera networks for wide area surveillance and image mosaicking , 2011, CVPR 2011 WORKSHOPS.

[9]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[10]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[11]  Christian Früh,et al.  Google Street View: Capturing the World at Street Level , 2010, Computer.

[12]  Stefan Roth,et al.  Real-time Stereo-Image Stitching using GPU-based Belief Propagation , 2009, VMV.

[13]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[14]  Koen E. A. van de Sande,et al.  A comparison of color features for visual concept classification , 2008, CIVR '08.

[15]  Jason de Villiers Real-time photogrammetric stitching of high resolution video on COTS hardware , 2009, 2009 International Symposium on Optomechatronic Technologies.

[16]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[17]  Yan Lu,et al.  Near Real-Time Dissemination of Geo-Referenced Imagery by an Enterprise Server , 2006 .

[18]  Gang Hua,et al.  Discriminant Embedding for Local Image Descriptors , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[19]  E. Malis,et al.  Deeper understanding of the homography decomposition for vision-based control , 2007 .

[20]  Pål Halvorsen,et al.  Efficient implementation and processing of a real-time panorama video pipeline with emphasis on color correction , 2013 .

[21]  Ayman Kaheel,et al.  Mobicast: a system for collaborative event casting using mobile phones , 2009, MUM.

[22]  Christopher S. Madden,et al.  Measuring Latency for Video Surveillance Systems , 2009, 2009 Digital Image Computing: Techniques and Applications.

[23]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.