Learning Structure-And-Motion-Aware Rolling Shutter Correction

An exact method of correcting the rolling shutter (RS) effect requires recovering the underlying geometry, i.e. the scene structures and the camera motions between scanlines or between views. However, the multiple-view geometry for RS cameras is much more complicated than its global shutter (GS) counterpart, with various degeneracies. In this paper, we first make a theoretical contribution by showing that RS two-view geometry is degenerate in the case of pure translational camera motion. In view of the complex RS geometry, we then propose a Convolutional Neural Network (CNN)-based method which learns the underlying geometry (camera motion and scene structure) from just a single RS image and perform RS image correction. We call our method structure-and-motion-aware RS correction because it reasons about the concealed motions between the scanlines as well as the scene structure. Our method learns from a large-scale dataset synthesized in a geometrically meaningful way where the RS effect is generated in a manner consistent with the camera motion and scene structure. In extensive experiments, our method achieves superior performance compared to other state-of-the-art methods for single image RS correction and subsequent Structure from Motion (SfM) applications.

[1]  David J. Kriegman,et al.  The Bas-Relief Ambiguity , 2004, International Journal of Computer Vision.

[2]  Zuzana Kukelova,et al.  R6P - Rolling Shutter Absolute Camera Pose , 2015, CVPR 2015.

[3]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Noah Snavely,et al.  Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Gregory D. Hager,et al.  Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Omar Ait-Aider,et al.  A Robust Method for Strong Rolling Shutter Effects Correction Using Lines with Automatic Feature Selection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Ales Leonardis,et al.  Rolling Shutter Correction in Manhattan World , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[9]  Hongdong Li,et al.  Rolling Shutter Camera Relative Pose: Generalized Epipolar Geometry , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Tomás Pajdla,et al.  Degeneracies in Rolling Shutter SfM , 2016, ECCV.

[11]  Hans-Hellmut Nagel,et al.  The coupling of rotation and translation in motion estimation of planar surfaces , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Hao Su,et al.  A Point Set Generation Network for 3D Object Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Silvio Savarese,et al.  3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.

[14]  A. N. Rajagopalan,et al.  Unrolling the Shutter: CNN to Correct Motion Distortions , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  A. N. Rajagopalan,et al.  Occlusion-Aware Rolling Shutter Rectification of 3D Scenes , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Richard Szeliski,et al.  Shape Ambiguities in Structure From Motion , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Omar Ait-Aider,et al.  Rolling Shutter Pose and Ego-Motion Estimation Using Shape-from-Template , 2018, ECCV.

[18]  Ping Tan,et al.  A Global Linear Method for Camera Pose Registration , 2013, 2013 IEEE International Conference on Computer Vision.

[19]  Irfan A. Essa,et al.  Calibration-free rolling shutter removal , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).

[20]  Loong Fah Cheong,et al.  Baseline Desensitizing in Translation Averaging , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  François Berry,et al.  Structure and kinematics triangulation with a rolling shutter stereo rig , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[23]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[24]  Marc Pollefeys,et al.  A minimal solution to the rolling shutter pose estimation problem , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[25]  Jason J. Corso,et al.  A Continuous Occlusion Model for Road Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Takayuki Okatani,et al.  Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure from Motion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  A. N. Rajagopalan,et al.  From Bows to Arrows: Rolling Shutter Rectification of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Noah Snavely,et al.  Robust Global Translations with 1DSfM , 2014, ECCV.

[30]  Berthold K. P. Horn Motion fields are hardly ever ambiguous , 1988, International Journal of Computer Vision.

[31]  Kyungdon Joo,et al.  High Quality Structure from Small Motion for Rolling Shutter Cameras , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[33]  Rob Fergus,et al.  Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[34]  Gilad Adiv,et al.  Inherent Ambiguities in Recovering 3-D Motion and Structure from a Noisy Flow Field , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Thomas Brox,et al.  A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Michael Felsberg,et al.  Rolling shutter bundle adjustment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Yong-Sheng Chen,et al.  Pyramid Stereo Matching Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Michael J. Black,et al.  Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[39]  Yuandong Tian,et al.  Single Image 3D Interpreter Network , 2016, ECCV.

[40]  Jörg Stückler,et al.  Direct Sparse Odometry with Rolling Shutter , 2018, ECCV.

[41]  Guosheng Lin,et al.  Deep convolutional neural fields for depth estimation from a single image , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Loong Fah Cheong,et al.  Rolling-Shutter-Aware Differential SfM and Image Rectification , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).