Persistent 3D stabilization for aerial imagery

In this paper, we present a novel 3D stabilization method for aerial imagery. While most existing aerial surveillance applications rely on 2D stabilization, which generates parallax errors in urban scenes, we aim to compensate camera motion perfectly as if the scene had been captured by a static camera. It is not only a better way to visualize events but also a more reliable inputs for higher level surveillance tasks. We tackle this problem by proposing a novel approach which generates accurate long-term dense mapping and handles occlusion robustly. Accurate dense mapping from a given frame to the reference frame is achieved in two steps: 1. Identify and compensate outlier flow from independent moving objects. 2. Accumulate the compensated mapping in two paths to avoid drifting. To handle occlusion regions, we formulate occlusion segmentation as an optimization problem considering both mapping relation and smoothness. Seamless rendering is achieved by gradient guidance of a dynamic background model. We evaluate our method on real-world aerial sequences with more than 100 frames with strong parallax. Our method outperforms state-of-the-art 2D and 3D stabilization approaches both qualitatively and quantitatively. Moreover, we demonstrate that when using our stabilization result as an input, the number of false detection significantly reduces in an independent moving object detection method, which is widely applied to wide area aerial surveillance applications.

[1]  Haroon Idrees,et al.  Detection and Tracking of Large Number of Targets in Wide Area Surveillance , 2010, ECCV.

[2]  Janusz Konrad,et al.  Geometry-based estimation of occlusions from video frame pairs , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[3]  Steven G. Johnson,et al.  The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.

[4]  Michael Gleicher,et al.  Content-preserving warps for 3D video stabilization , 2009, ACM Trans. Graph..

[5]  Harpreet S. Sawhney,et al.  Vehicle detection and tracking in wide field-of-view aerial video , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Michael Gleicher,et al.  Content-preserving warps for 3D video stabilization , 2009, ACM Trans. Graph..

[7]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[8]  Raanan Fattal,et al.  Video stabilization using epipolar geometry , 2012, TOGS.

[9]  Jian Sun,et al.  SteadyFlow: Spatially Smooth Optical Flow for Video Stabilization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Changchang Wu,et al.  SiftGPU : A GPU Implementation of Scale Invariant Feature Transform (SIFT) , 2007 .

[11]  Rama Chellappa,et al.  Evaluation of image stabilization algorithms , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[12]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[13]  Gerard Medioni,et al.  Motion propagation detection association for multi-target tracking in wide area aerial surveillance , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[14]  Anthony Hoogs,et al.  Real-time multi-target tracking at 210 megapixels/second in Wide Area Motion Imagery , 2014, IEEE Winter Conference on Applications of Computer Vision.

[15]  Gérard G. Medioni,et al.  Persistent Tracking for Wide Area Aerial Surveillance , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Jongmoo Choi,et al.  Aerial Implicit 3D Video Stabilization Using Epipolar Geometry Constraint , 2014, 2014 22nd International Conference on Pattern Recognition.

[17]  Gérard G. Medioni,et al.  Using 3D scene structure to improve tracking , 2011, CVPR 2011.

[18]  Jack Dongarra,et al.  Special Issue on Program Generation, Optimization, and Platform Adaptation , 2005, Proc. IEEE.

[19]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[20]  Zihan Zhou,et al.  Plane-Based Content Preserving Warps for Video Stabilization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Jongmoo Choi,et al.  Activity recognition in wide aerial video surveillance using entity relationship models , 2012, SIGSPATIAL/GIS.

[22]  Gérard G. Medioni,et al.  3-D Mediated Detection and Tracking in Wide Area Aerial Surveillance , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[23]  Jian Sun,et al.  Bundled camera paths for video stabilization , 2013, ACM Trans. Graph..

[24]  Mubarak Shah,et al.  Multiframe Many–Many Point Correspondence for Vehicle Tracking in High Density Wide Area Aerial Videos , 2013, International Journal of Computer Vision.

[25]  Song-Chun Zhu,et al.  Joint inference of groups, events and human roles in aerial videos , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).