Deep360Up: A Deep Learning-Based Approach for Automatic VR Image Upright Adjustment

Spherical VR cameras can capture high-quality immersive VR images with a 360° field of view. However, in practice, when the camera orientation is not straight, the acquired VR image appears tilted when displayed on a VR headset, which diminishes the quality of the VR experience. To overcome this problem, we present a deep learning-based approach that can automatically estimate the orientation of a VR image and return its upright version. In contrast to existing methods, our approach does not require the presence of lines or horizon in the image, and thus can be applied on a wide range of scenes. Extensive experiments and comparisons with state-of-the-art methods have successfully confirmed the validity of our approach.

[1]  Kostas Daniilidis,et al.  Catadioptric Projective Geometry , 2001, International Journal of Computer Vision.

[2]  Thomas Brox,et al.  Image Orientation Estimation with Convolutional Networks , 2015, GCPR.

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Seungyong Lee,et al.  Automatic Upright Adjustment of Photographs With Robust Camera Calibration , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Stephen T. Barnard,et al.  Interpreting Perspective Image , 1983, Artif. Intell..

[6]  Roland Siegwart,et al.  Appearance-Guided Monocular Omnidirectional Visual Odometry for Outdoor Ground Vehicles , 2008, IEEE Transactions on Robotics.

[7]  In-So Kweon,et al.  Stabilizing Omnidirectional Videos Using 3D Structure and Spherical Image Warping , 2011, MVA.

[8]  João Pedro Barreto,et al.  A unifying geometric representation for central projection systems , 2006, Comput. Vis. Image Underst..

[9]  Seungyong Lee,et al.  Robust upright adjustment of 360 spherical panoramas , 2017, The Visual Computer.

[10]  Andrew C. Gallagher Using vanishing points to correct camera rotation in images , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[11]  Pascal Vasseur,et al.  Motion estimation by decoupling rotation and translation in catadioptric vision , 2010, Comput. Vis. Image Underst..

[12]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[13]  Krista A. Ehinger,et al.  Recognizing scene viewpoint using panoramic place representation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Nassir Navab,et al.  Robust Optimization for Deep Regression , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Katsushi Ikeuchi,et al.  Omnidirectional texturing based on robust 3D registration through Euclidean reconstruction from two spherical images , 2010, Comput. Vis. Image Underst..

[16]  Pascal Vasseur,et al.  UAV Attitude estimation by vanishing points in catadioptric images , 2008, 2008 IEEE International Conference on Robotics and Automation.

[17]  William A. P. Smith,et al.  Structure-From-Motion in Spherical Video Using the von Mises-Fisher Distribution , 2017, IEEE Transactions on Image Processing.

[18]  Yannick Hold-Geoffroy,et al.  A Perceptual Measure for Deep Single Image Camera Calibration , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  Seth J. Teller,et al.  Scalable Extrinsic Calibration of Omni-Directional Image Networks , 2002, International Journal of Computer Vision.

[20]  Marc Pollefeys,et al.  3-line RANSAC for orthogonal vanishing point detection , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Hao Tang,et al.  Pitch and Roll Camera Orientation from a Single 2D Image Using Convolutional Neural Networks , 2017, 2017 14th Conference on Computer and Robot Vision (CRV).

[22]  Christopher Mei Laser-augmented omnidirectional vision for 3D localisation and mapping , 2007 .

[23]  Michael Guerzhoy,et al.  Automatic Photo Orientation Detection with Convolutional Neural Networks , 2017, 2017 14th Conference on Computer and Robot Vision (CRV).

[24]  Reinhard Koch,et al.  Vanishing Point Estimation and Line Classification in a Manhattan World with a Unifying Camera Model , 2016, International Journal of Computer Vision.

[25]  Kostas Daniilidis,et al.  Monocular visual odometry in urban environments using an omnidirectional camera , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Paul C. Leopardi A PARTITION OF THE UNIT SPHERE INTO REGIONS OF EQUAL AREA AND SMALL DIAMETER , 2006 .

[27]  Pascal Vasseur,et al.  Rotation estimation and vanishing point extraction by omnidirectional vision in urban environment , 2012, Int. J. Robotics Res..

[28]  Matthew Brand,et al.  SKYLINE2GPS: Localization in urban canyons using omni-skylines , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Marc Pollefeys,et al.  Globally Optimal Consensus Set Maximization through Rotation Search , 2012, ACCV.

[30]  Xianghua Ying,et al.  Can We Consider Central Catadioptric Cameras and Fisheye Cameras within a Unified Imaging Model , 2004, ECCV.

[31]  Kyungdon Joo,et al.  Globally Optimal Inlier Set Maximization for Atlanta Frame Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Pascal Vasseur,et al.  UAV Attitude Computation by Omnidirectional Vision in Urban Environment , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[34]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Pascal Vasseur,et al.  Omnidirectional vision on UAV for attitude computation , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[36]  Pascal Vasseur,et al.  Robust Attitude Estimation with Catadioptric Vision , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[37]  Johannes Kopf,et al.  360° video stabilization , 2016, ACM Trans. Graph..