Real-Time Selfie Video Stabilization

We propose a novel real-time selfie video stabilization method. Our method is completely automatic and runs at 26 fps. We use a 1D linear convolutional network to directly infer the rigid moving least squares warping which implicitly balances between the global rigidity and local flexibility. Our network structure is specifically designed to stabilize the background and foreground at the same time, while providing optional control of stabilization focus (relative importance of foreground vs. background) to the users. To train our network, we collect a selfie video dataset with 1005 videos, which is significantly larger than previous selfie video datasets. We also propose a grid approximation to the rigid moving least squares that enables the real-time frame warping. Our method is fully automatic and produces visually and quantitatively better results than previous real-time general video stabilization methods. Compared to previous offline selfie video methods, our approach produces comparable quality with a speed improvement of orders of magnitude. Our code and selfie video dataset is available at https://github.com/jiy173/selfievideostabilization.

[1]  Maja Pantic,et al.  MobiFace: A Novel Dataset for Mobile Face Tracking in the Wild , 2018, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[2]  Jian Sun,et al.  MeshFlow: Minimum Latency Online Video Stabilization , 2016, ECCV.

[3]  Michael Gleicher,et al.  Content-preserving warps for 3D video stabilization , 2009, ACM Trans. Graph..

[4]  Jiyang Yu,et al.  Robust Video Stabilization by Optimization in CNN Weight Space , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Jian Sun,et al.  Bundled camera paths for video stabilization , 2013, ACM Trans. Graph..

[7]  Shi-Min Hu,et al.  Deep Online Video Stabilization With Multi-Grid Warping Transformation Learning , 2019, IEEE Transactions on Image Processing.

[8]  Michael Gleicher,et al.  Subspace video stabilization , 2011, TOGS.

[9]  John K. Tsotsos,et al.  Fast Visual Object Tracking with Rotated Bounding Boxes , 2019, ArXiv.

[10]  Apostol Natsev,et al.  YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.

[11]  Jiyang Yu,et al.  Selfie Video Stabilization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Jun Hu,et al.  Deep Video Stabilization Using Adversarial Networks , 2018, Comput. Graph. Forum.

[13]  Jian Sun,et al.  SteadyFlow: Spatially Smooth Optical Flow for Video Stabilization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[15]  Xiaoming Liu,et al.  Face Alignment in Full Pose Range: A 3D Total Solution , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Andrea Vedaldi,et al.  Deep Image Prior , 2017, International Journal of Computer Vision.

[17]  Mingyu Li,et al.  CodingFlow: Enable Video Coding for Video Stabilization , 2017, IEEE Transactions on Image Processing.

[18]  Youn-Long Lin,et al.  HarDNet: A Low Memory Traffic Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Irfan A. Essa,et al.  Auto-directed video stabilization with robust L1 optimal camera paths , 2011, CVPR 2011.

[21]  Feng Liu,et al.  Spatially and Temporally Optimized Video Stabilization , 2013, IEEE Transactions on Visualization and Computer Graphics.

[22]  In So Kweon,et al.  Deep Iterative Frame Interpolation for Full-frame Video Stabilization , 2019, ACM Trans. Graph..

[23]  Ian D. Reid,et al.  Light-Weight RefineNet for Real-Time Semantic Segmentation , 2018, BMVC.

[24]  Chia-Kai Liang,et al.  Steadiface: Real-Time Face-Centric Stabilization On Mobile Phones , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[25]  Raanan Fattal,et al.  Video stabilization using epipolar geometry , 2012, TOGS.

[26]  Scott Schaefer,et al.  Image deformation using moving least squares , 2006, ACM Trans. Graph..

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.