论文信息 - Inferring sun direction to improve visual odometry: A deep learning approach

Inferring sun direction to improve visual odometry: A deep learning approach

We present a method to incorporate global orientation information from the sun into a visual odometry pipeline using only the existing image stream, in which the sun is typically not visible. We leverage recent advances in Bayesian convolutional neural networks (BCNNs) to train and implement a sun detection model (dubbed Sun-BCNN) that infers a 3D sun direction vector from a single RGB image. Crucially, our method also computes a principled uncertainty associated with each prediction, using a Monte Carlo dropout scheme. We incorporate this uncertainty into a sliding window stereo visual odometry pipeline where accurate uncertainty estimates are critical for optimal data fusion. We evaluate our method on 21.6 km of urban driving data from the KITTI odometry benchmark where it achieves a median error of approximately 12° and yields improvements of up to 42% in translational average root mean squared error (ARMSE) and 32% in rotational ARMSE compared with standard visual odometry. We further evaluate our method on an additional 10 km of visual navigation data from the Devon Island Rover Navigation dataset, achieving a median error of less than 8° and yielding similar improvements in estimation error. In addition to reporting on the accuracy of Sun-BCNN and its impact on visual odometry, we analyze the sensitivity of our model to cloud cover, investigate the possibility of model transfer between urban and planetary analogue environments, and examine the impact of different methods for computing the mean and covariance of a norm-constrained vector on the accuracy and consistency of the estimated sun directions. Finally, we release Sun-BCNN as open-source software.

[1] Roberto Cipolla,et al. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[3] Alexei A. Efros,et al. Estimating the Natural Illumination Conditions from a Single Outdoor Image , 2012, International Journal of Computer Vision.

[4] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Carl Christian Liebe,et al. Sun sensing on the Mars exploration rovers , 2002, Proceedings, IEEE Aerospace Conference.

[7] Ji Zhang,et al. Visual-lidar odometry and mapping: low-drift, robust, and fast , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[8] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[9] Nicholas Roy,et al. PROBE-GK: Predictive robust estimation using generalized kernels , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[10] Pablo Fernández Alcantarilla,et al. Noise Models in Feature-based Stereo Visual Odometry , 2016, ArXiv.

[11] Jonathan Kelly,et al. Reducing drift in visual odometry by inferring sun direction using a Bayesian Convolutional Neural Network , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[12] Michael Bosse,et al. Keyframe-based visual–inertial odometry using nonlinear optimization , 2015, Int. J. Robotics Res..

[13] Gaurav S. Sukhatme,et al. Combined Visual and Inertial Navigation for an Unmanned Aerial Vehicle , 2008, FSR.

[14] Yarin Gal,et al. Uncertainty in Deep Learning , 2016 .

[15] Ivan Petrovic,et al. Stereo odometry based on careful feature selection and tracking , 2015, 2015 European Conference on Mobile Robots (ECMR).

[16] Paul Newman,et al. Scene Signatures: Localised and Point-less Features for Localisation , 2014, Robotics: Science and Systems.

[17] John Enright,et al. The Devon Island rover navigation dataset , 2012, Int. J. Robotics Res..

[18] John Enright,et al. Visual odometry aided by a sun sensor and inclinometer , 2011 .

[19] Roberto Cipolla,et al. Modelling uncertainty in deep learning for camera relocalization , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[20] Clark F. Olson,et al. Rover navigation using stereo ego-motion , 2003, Robotics Auton. Syst..

[21] Timothy D. Barfoot,et al. State Estimation for Robotics , 2017 .

[22] Sanja Fidler,et al. Find your way by observing the sun and other semantic cues , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[23] Andreas Geiger,et al. Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[25] Julius Ziegler,et al. StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[26] John Enright,et al. Sun Sensor Navigation for Planetary Rovers: Theory and Field Testing , 2011, IEEE Transactions on Aerospace and Electronic Systems.

[27] Larry H. Matthies,et al. Visual odometry on the Mars exploration rovers - a tool to ensure accurate driving and science imaging , 2006, IEEE Robotics & Automation Magazine.

[28] R. Fisher. Dispersion on a sphere , 1953, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[29] Jörg Stückler,et al. Large-scale direct SLAM with stereo cameras , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30] Timothy D. Barfoot,et al. Visual teach and repeat for long-range rover autonomy , 2010 .

[31] Larry H. Matthies,et al. Two years of Visual Odometry on the Mars Exploration Rovers , 2007, J. Field Robotics.

[32] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[33] Jonathan Kelly,et al. Improving the Accuracy of Stereo Visual Odometry Using Visual Illumination Estimation , 2016, ISER 2016.

[34] Zoubin Ghahramani,et al. Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference , 2015, ArXiv.

[35] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[36] Friedrich Fraundorfer,et al. Visual Odometry Part I: The First 30 Years and Fundamentals , 2022 .

[37] Paul Newman,et al. 1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[38] Andreas Geiger,et al. Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..